4

Ask HN: Open source LLM for commercial use?

 1 year ago
source link: https://news.ycombinator.com/item?id=35512338
Go to the source link to view the article. You can view the picture content, updated content and better typesetting reading experience. If the link is broken, please click the button below to view the snapshot at that time.
neoserver,ios ssh client

Ask HN: Open source LLM for commercial use?

Ask HN: Open source LLM for commercial use?
37 points by LewisDavidson 1 hour ago | hide | past | favorite | 16 comments
Working on a ML project and looking for an open source LLM that can be used in a commercial environment. As far as I'm aware, products cannot be built on LLAMA.

I don't want to use GPT since the project will be using personal information to train/fine tune the models.

Others have answered your question, but I'll add that the market for high quality AI models is not similar to the software marketplace, where there is always an open source alternative (and where open source is often the state of the art).

LLMs take so much engineering effort, research, and compute that it's unlikely there will be good open source alternatives in the near future. Right now your only real option is OpenAI (or maybe Anthropic) and that seems unlikely to change anytime soon.

The only reason we have LLAMA is because Meta threw us a bone. They might not do that again.

I think you might be confusing the GPT software (a generative pre trained transformer) with the finished product, an LLM (large language model.)

A GPT has no training until you give it materials. I do believe Google released the code for theirs ages ago. Even without source, you can run a GPT against your own data locally, or on a cloud service setup for that purpose.

This is how Bloomberg, for example, created a financial LLM. They used a GPT to train on their own financial data.

s.gif
Any examples of doing that process cost effectively?
s.gif
Not what you're asking but Vicuna did cost merely 300$ to fine-tune on top of LLaMA https://www.marktechpost.com/2023/04/02/meet-vicuna-an-open-...

AFAIK full model training should be a couple order magnitudes higher probably?

Just in case you were not aware: "OpenAI does not use data submitted by customers via our API to train OpenAI models or improve OpenAI’s service offering." It does for ChatGPT though.

Source: https://help.openai.com/en/articles/5722486-how-your-data-is...

What exactly do you want to do? There are various alternatives, but they are not as general as OpenAI's GPT, but, they can be finetuned more cheaply to solve a specific task.
I think https://github.com/BlinkDL/RWKV-LM could be used, but not all versions (namely instruction fine-tuned models trained on alpaca data)
s.gif
Just don't let it convince you to "reduce your carbon footprint" like the last guy did.
s.gif
Wait is this a reference to the belgian case of someone offing themselves?

Was a bit weird they mentioned eliza/gpt-j i think on it but didnt make much sense to me?

did that happen or just hallucinated?

s.gif
Yes that's the one. There hasn't been much news coverage so I suspect that it wasn't quite as convincing a case as reported. Still a little worrying though, and even if not accurate, the fact it could be is definitely worrying.
s.gif
Applications are open for YC Summer 2023
Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search:

About Joyk


Aggregate valuable and interesting links.
Joyk means Joy of geeK