• Kinza Babylon Staked BTCKinza Babylon Staked BTC(KBTC)$83,270.000.00%
  • Steakhouse EURCV Morpho VaultSteakhouse EURCV Morpho Vault(STEAKEURCV)$0.000000-100.00%
  • Stride Staked InjectiveStride Staked Injective(STINJ)$16.51-4.18%
  • Vested XORVested XOR(VXOR)$3,404.231,000.00%
  • FibSwap DEXFibSwap DEX(FIBO)$0.0084659.90%
  • ICPanda DAOICPanda DAO(PANDA)$0.003106-39.39%
  • TruFin Staked APTTruFin Staked APT(TRUAPT)$8.020.00%
  • bitcoinBitcoin(BTC)$101,140.004.66%
  • VNST StablecoinVNST Stablecoin(VNST)$0.0000400.67%
  • ethereumEthereum(ETH)$2,060.3813.92%
  • tetherTether(USDT)$1.000.00%
  • rippleXRP(XRP)$2.256.31%
  • binancecoinBNB(BNB)$621.573.79%
  • Wrapped SOLWrapped SOL(SOL)$143.66-2.32%
  • solanaSolana(SOL)$160.5910.52%
  • usd-coinUSDC(USDC)$1.000.00%
  • dogecoinDogecoin(DOGE)$0.19146412.62%
  • cardanoCardano(ADA)$0.7310.88%
  • tronTRON(TRX)$0.2551543.57%
  • staked-etherLido Staked Ether(STETH)$2,056.1613.63%
  • SuiSui(SUI)$3.9219.33%
  • wrapped-bitcoinWrapped Bitcoin(WBTC)$101,060.004.63%
  • Gaj FinanceGaj Finance(GAJ)$0.0059271.46%
  • Content BitcoinContent Bitcoin(CTB)$24.482.55%
  • USD OneUSD One(USD1)$1.000.11%
  • chainlinkChainlink(LINK)$15.2913.08%
  • UGOLD Inc.UGOLD Inc.(UGOLD)$3,042.460.08%
  • ParkcoinParkcoin(KPK)$1.101.76%
  • avalanche-2Avalanche(AVAX)$21.2610.59%
  • stellarStellar(XLM)$0.2818129.34%
  • Wrapped stETHWrapped stETH(WSTETH)$2,473.7813.43%
  • bitcoin-cashBitcoin Cash(BCH)$419.9716.31%
  • shiba-inuShiba Inu(SHIB)$0.00001410.58%
  • hedera-hashgraphHedera(HBAR)$0.19195210.82%
  • leo-tokenLEO Token(LEO)$8.740.61%
  • USDSUSDS(USDS)$1.000.00%
  • ToncoinToncoin(TON)$3.186.31%
  • HyperliquidHyperliquid(HYPE)$21.994.39%
  • litecoinLitecoin(LTC)$93.095.49%
  • Yay StakeStone EtherYay StakeStone Ether(YAYSTONE)$2,671.07-2.84%
  • polkadotPolkadot(DOT)$4.3210.56%
  • Pundi AIFXPundi AIFX(PUNDIAI)$16.000.00%
  • PengPeng(PENG)$0.60-13.59%
  • wethWETH(WETH)$2,059.8613.98%
  • moneroMonero(XMR)$295.985.16%
  • Bitget TokenBitget Token(BGB)$4.444.69%
  • Binance Bridged USDT (BNB Smart Chain)Binance Bridged USDT (BNB Smart Chain)(BSC-USD)$1.00-0.15%
  • MurasakiMurasaki(MURA)$4.32-12.46%
  • Black PhoenixBlack Phoenix(BPX)$3.351,000.00%
  • Wrapped eETHWrapped eETH(WEETH)$2,200.2913.61%
TradePoint.io
  • Main
  • AI & Technology
  • Stock Charts
  • Market & News
  • Business
  • Finance Tips
  • Trade Tube
  • Blog
  • Shop
No Result
View All Result
TradePoint.io
No Result
View All Result

2023 was a great year for open-source LLMs

December 26, 2023
in AI & Technology
Reading Time: 4 mins read
A A
2023 was a great year for open-source LLMs
ShareShareShareShareShare

Are you ready to bring more awareness to your brand? Consider becoming a sponsor for The AI Impact Tour. Learn more about the opportunities here.


The advent of ChatGPT in late 2022 set off a competitive sprint among AI companies and tech giants, each vying to dominate the burgeoning market for large language model (LLM) applications. Partly as a result of this intense rivalry, most firms opted to offer their language models as proprietary services, selling API access without revealing the underlying model weights or the specifics of their training datasets and methodologies. 

Despite this trend towards private models, 2023 witnessed a surge within the open-source LLM ecosystem, marked by the release of models that can be downloaded and run on your servers and customized for specific applications. The open-source ecosystem has kept pace with private models and cemented its role as a pivotal player within the LLM enterprise landscape. 

YOU MAY ALSO LIKE

HunyuanCustom Brings Single-Image Video Deepfakes, With Audio and Lip Sync

Alienware reveals Aurora class laptops for mid-range gamers

Here is how the open-source LLM ecosystem evolved in 2023.

Is bigger better?

Before 2023, the prevailing belief was that enhancing the performance of LLMs required scaling up model size. Open-source models like BLOOM and OPT, comparable to OpenAI‘s GPT-3 with its 175 billion parameters, symbolized this approach. Although publicly accessible, these large models needed the computational resources and specialized knowledge of large-scale organizations to run effectively.

VB Event

The AI Impact Tour

Getting to an AI Governance Blueprint – Request an invite for the Jan 10 event.

 

Learn More

This paradigm shifted in February 2023, when Meta introduced Llama, a family of models with sizes varying from 7 to 65 billion parameters. Llama demonstrated that smaller language models could rival the performance of larger LLMs. 

The key to Llama’s success was training on a significantly larger corpus of data. While GPT-3 had been trained on approximately 300 billion tokens, Llama’s models ingested up to 1.4 trillion tokens. This strategy of training more compact models on an expanded token dataset proved to be a game-changer, challenging the notion that size was the sole driver of LLM efficacy.

The benefits of open-source models

Llama’s appeal hinged on two key features: its capacity to operate on a single or a handful of GPUs, and its open-source release. This enabled the research community to quickly build on its findings and architecture. The release of Llama catalyzed the emergence of a series of open-source LLMs, each contributing novel facets to the open-source ecosystem.

Notable among these were Cerebras-GPT by Cerebras, Pythia by EleutherAI, MosaicML’s MPT, X-GEN by Salesforce, and Falcon by TIIUAE. 

In July, Meta released Llama 2, which quickly became the basis for numerous derivative models. Mistral.AI made a significant impact with the release of two models, Mistral and Mixtral. The latter, particularly, has been lauded for its capabilities and cost-effectiveness. 

“Since the release of the original Llama by Meta, open-source LLMs have seen an accelerated growth of progress and the latest open-source LLM, Mixtral, is ranked as the third most helpful LLM in human evaluations behind GPT-4 and Claude,” Jeff Boudier, head of product and growth at Hugging Face, told VentureBeat.

Other models such as Alpaca, Vicuna, Dolly, and Koala were developed on top of these foundation models, each fine-tuned for specific downstream applications. 

According to data from Hugging Face, a hub for machine learning models, developers have created thousands of forks and specialized versions of these models.

There are over 14,500 model results for “Llama,” 3,500 for “Mistral,” and 2,400 for “Falcon” on Hugging Face. Mixtral, despite its December release, has already become the basis for 150 projects. 

The open-source nature of these models not only facilitates the creation of new models but also enables developers to combine them in various configurations, enhancing the versatility and utility of LLMs in practical applications. 

The future of open source models

While proprietary models advance and compete, the open-source community will remain a steadfast contender. This dynamic is even recognized by tech giants, who are increasingly integrating open-source models into their products.

Microsoft, the main financial backer of OpenAI, has not only released two open-source models, Orca and Phi-2, but has also enhanced the integration of open-source models on its Azure AI Studio platform. Similarly, Amazon, one of the main investors of Anthropic, has introduced Bedrock, a cloud service designed to host both proprietary and open-source models.

“In 2023, most enterprises were taken by surprise by the capabilities of LLMs through the introduction and popular success of ChatGPT,” Boudier said. “With every CEO asking their team to define what their Generative AI use cases should be, companies experimented and quickly built proof of concept applications using closed model APIs.”

Yet, the reliance on external APIs for core technologies poses significant risks, including the exposure of sensitive source code and customer data. This is not a sustainable long-term strategy for companies that prioritize data privacy and security.

The burgeoning open-source ecosystem presents a unique proposition for businesses aiming to integrate generative AI while addressing other needs. 

“As AI is the new way of building technology, AI just like other technologies before it will need to be created and managed in-house, with all the privacy, security and compliance that customer information and regulation requires,” Boudier said. “And if the past is any indication, that means with open source.”

VentureBeat’s mission is to be a digital town square for technical decision-makers to gain knowledge about transformative enterprise technology and transact. Discover our Briefings.

Credit: Source link

ShareTweetSendSharePin

Related Posts

HunyuanCustom Brings Single-Image Video Deepfakes, With Audio and Lip Sync
AI & Technology

HunyuanCustom Brings Single-Image Video Deepfakes, With Audio and Lip Sync

May 8, 2025
Alienware reveals Aurora class laptops for mid-range gamers
AI & Technology

Alienware reveals Aurora class laptops for mid-range gamers

May 8, 2025
The 18-inch Lenovo Legion 9i can switch between 2D and 3D without glasses
AI & Technology

The 18-inch Lenovo Legion 9i can switch between 2D and 3D without glasses

May 8, 2025
NVIDIA Open-Sources Open Code Reasoning Models (32B, 14B, 7B)
AI & Technology

NVIDIA Open-Sources Open Code Reasoning Models (32B, 14B, 7B)

May 8, 2025
Next Post
Watch: American carried through Turkish cave on stretcher during rescue

Watch: American carried through Turkish cave on stretcher during rescue

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

Search

No Result
View All Result
Archbishop of Buenos Aires moved to tears during Mass honoring Pope Francis

Archbishop of Buenos Aires moved to tears during Mass honoring Pope Francis

May 2, 2025
Judge rules Google has illegal monopoly on advertising technology

Judge rules Google has illegal monopoly on advertising technology

May 8, 2025
Here's how business leaders like Bill Gates and Mark Cuban are reacting to Warren Buffett stepping down – Business Insider

Here's how business leaders like Bill Gates and Mark Cuban are reacting to Warren Buffett stepping down – Business Insider

May 5, 2025

About

Learn more

Our Services

Legal

Privacy Policy

Terms of Use

Bloggers

Learn more

Article Links

Contact

Advertise

Ask us anything

©2020- TradePoint.io - All rights reserved!

Tradepoint.io, being just a publishing and technology platform, is not a registered broker-dealer or investment adviser. So we do not provide investment advice. Rather, brokerage services are provided to clients of Tradepoint.io by independent SEC-registered broker-dealers and members of FINRA/SIPC. Every form of investing carries some risk and past performance is not a guarantee of future results. “Tradepoint.io“, “Instant Investing” and “My Trading Tools” are registered trademarks of Apperbuild, LLC.

This website is operated by Apperbuild, LLC. We have no link to any brokerage firm and we do not provide investment advice. Every information and resource we provide is solely for the education of our readers. © 2020 Apperbuild, LLC. All rights reserved.

No Result
View All Result
  • Main
  • AI & Technology
  • Stock Charts
  • Market & News
  • Business
  • Finance Tips
  • Trade Tube
  • Blog
  • Shop

© 2023 - TradePoint.io - All Rights Reserved!