• bitcoinBitcoin(BTC)$76,363.00-0.57%
  • ethereumEthereum(ETH)$2,289.970.11%
  • tetherTether(USDT)$1.000.00%
  • rippleXRP(XRP)$1.38-0.71%
  • binancecoinBNB(BNB)$623.46-0.02%
  • usd-coinUSDC(USDC)$1.000.00%
  • solanaSolana(SOL)$83.72-0.66%
  • tronTRON(TRX)$0.323109-0.83%
  • Figure HelocFigure Heloc(FIGR_HELOC)$1.040.95%
  • dogecoinDogecoin(DOGE)$0.0994801.55%
  • whitebitWhiteBIT Coin(WBT)$54.03-0.46%
  • USDSUSDS(USDS)$1.000.17%
  • leo-tokenLEO Token(LEO)$10.370.21%
  • HyperliquidHyperliquid(HYPE)$39.91-3.74%
  • cardanoCardano(ADA)$0.2475420.95%
  • bitcoin-cashBitcoin Cash(BCH)$450.890.62%
  • moneroMonero(XMR)$381.48-0.46%
  • chainlinkChainlink(LINK)$9.240.19%
  • CantonCanton(CC)$0.1487760.87%
  • zcashZcash(ZEC)$335.33-4.76%
  • stellarStellar(XLM)$0.162405-1.57%
  • USD1USD1(USD1)$1.000.01%
  • daiDai(DAI)$1.000.04%
  • MemeCoreMemeCore(M)$3.35-11.36%
  • litecoinLitecoin(LTC)$55.470.31%
  • avalanche-2Avalanche(AVAX)$9.180.03%
  • hedera-hashgraphHedera(HBAR)$0.088926-0.32%
  • Ethena USDeEthena USDe(USDE)$1.000.00%
  • suiSui(SUI)$0.920.01%
  • shiba-inuShiba Inu(SHIB)$0.0000060.64%
  • RainRain(RAIN)$0.0074933.35%
  • paypal-usdPayPal USD(PYUSD)$1.000.00%
  • the-open-networkToncoin(TON)$1.290.13%
  • crypto-com-chainCronos(CRO)$0.069136-0.58%
  • Circle USYCCircle USYC(USYC)$1.120.00%
  • tether-goldTether Gold(XAUT)$4,593.09-1.82%
  • BittensorBittensor(TAO)$256.564.20%
  • Global DollarGlobal Dollar(USDG)$1.000.01%
  • World Liberty FinancialWorld Liberty Financial(WLFI)$0.0738762.29%
  • BlackRock USD Institutional Digital Liquidity FundBlackRock USD Institutional Digital Liquidity Fund(BUIDL)$1.000.00%
  • pax-goldPAX Gold(PAXG)$4,592.31-1.81%
  • mantleMantle(MNT)$0.63-0.60%
  • polkadotPolkadot(DOT)$1.230.95%
  • uniswapUniswap(UNI)$3.250.86%
  • SkySky(SKY)$0.086795-2.63%
  • Pi NetworkPi Network(PI)$0.1950436.92%
  • Falcon USDFalcon USD(USDF)$1.00-0.10%
  • nearNEAR Protocol(NEAR)$1.35-0.46%
  • okbOKB(OKB)$82.76-0.73%
  • AsterAster(ASTER)$0.650.88%
TradePoint.io
  • Main
  • AI & Technology
  • Stock Charts
  • Market & News
  • Business
  • Finance Tips
  • Trade Tube
  • Blog
  • Shop
No Result
View All Result
TradePoint.io
No Result
View All Result

Can LLM Already Serve as A Database Interface? Meet BIRD: A Big Bench for Large-scale Database Grounded Text-to-SQLs

May 15, 2023
in AI & Technology
Reading Time: 4 mins read
A A
Can LLM Already Serve as A Database Interface? Meet BIRD: A Big Bench for Large-scale Database Grounded Text-to-SQLs
ShareShareShareShareShare

Text-to-SQL parsing, which focuses on converting spoken English into SQL queries, has piqued the interest of both academics and business leaders. This interest is due to its ability to enable novice data analysts to automatically extract needed information using natural language from prevalent relational databases. Recent developments in neural modeling, notably those using large language models (LLMs), have produced outstanding results on popular benchmarks like Spider and WikiSQL. For instance, during the past three years, the execution accuracy of the top-performing model in Spider Leaderboard has improved from 53.5% to 85.3%. 

They found that modern, cutting-edge models still need help extrapolating to more complex, realistic scenarios that include noisy material and vast database volumes. In addition, it takes outside expertise and logic to unravel the secrets concealed beneath the enormous database values. Additionally, current benchmarks do not consider SQL execution performance, which is very important in real-world applications, particularly in the case of big databases. The large language model (LLM)’s strong comprehension and coding skills are utilized by the most recent SOTA parser in Spider, and this parser’s exceptional performance begs the question: Can LLM already be used as a database interface? 

These findings led them to create a new text-to-SQL benchmark that more closely resembles actual circumstances and reduces the gap between experimental and real-world conditions. Researchers from the University of Hong Kong, DAMO Academy of Alibaba Group, The Chinese University of Hong Kong (Shenzhen), Massachusetts Institute of Technology, and the University of Illinois suggest BIRD, a Big Bench for Large-Scale Database Grounded in Text-to-SQLs, in this study for use in practical applications. A total of 95 large databases totaling 33.4 GB in size and 12,751 complicated instances of information searching are contained in BIRD, which covers 37 different professional disciplines. Then gathered 80 open-source relational databases for training from legitimate analytic platforms (Kaggle, Relation. vit) and handpicked 15 more relational databases for assessment. They rely on crowdsourcing to get natural language commands and the associated SQLs given these databases. 

🚀 JOIN the fastest ML Subreddit Community

To assist annotators in better grasping the database contents, their database specialists first generate a description file for each database that lists all column names, shortened values, value kinds, and external knowledge. Then they employ a SQL annotation team of data engineers and database students to create SQLs to answer inquiries. At the same time, on the other side, they hire and train native speakers to ask questions about these databases. They provide a brand-new statistic called Valid Efficiency Score (VES) to measure efficiency and the usual execution correctness for created SQLs. To their knowledge, BIRD is the first text-to-SQL benchmark that considers efficiency, encouraging the use of more effective query techniques in the setting of large and noisy database contents. 

Modern text-to-SQL parsers are evaluated using two widely used methodologies: in-context learning using large language models (LLMs) like Codex (code-DaVinci-002) and ChatGPT (get-3.5-turbo) and fine-tuning with T5. Their experimental findings show that the present models need help with generalizing effectively. Particularly, on the development and test sets, the Spider SOTA model, which simply relies on the database schema, only manages execution accuracies of 25.88% and 28.95%, respectively. Compared to human performance, which they also give in this benchmark, the performance still needs to catch up. They urge more studies to address the more practical circumstances shown in this benchmark. 


Check out the Paper and Project. Don’t forget to join our 21k+ ML SubReddit, Discord Channel, and Email Newsletter, where we share the latest AI research news, cool AI projects, and more. If you have any questions regarding the above article or if we missed anything, feel free to email us at [email protected]

🚀 Check Out 100’s AI Tools in AI Tools Club


YOU MAY ALSO LIKE

Texas Instruments made a new flagship graphing calculator: the TI-84 Evo

NVIDIA starts offering a 12GB version of the 5070 for laptops

Aneesh Tickoo is a consulting intern at MarktechPost. He is currently pursuing his undergraduate degree in Data Science and Artificial Intelligence from the Indian Institute of Technology(IIT), Bhilai. He spends most of his time working on projects aimed at harnessing the power of machine learning. His research interest is image processing and is passionate about building solutions around it. He loves to connect with people and collaborate on interesting projects.


Credit: Source link

ShareTweetSendSharePin

Related Posts

Texas Instruments made a new flagship graphing calculator: the TI-84 Evo
AI & Technology

Texas Instruments made a new flagship graphing calculator: the TI-84 Evo

April 28, 2026
NVIDIA starts offering a 12GB version of the 5070 for laptops
AI & Technology

NVIDIA starts offering a 12GB version of the 5070 for laptops

April 28, 2026
Amazon brings dark mode to Kindle Colorsoft and Scribe Colorsoft
AI & Technology

Amazon brings dark mode to Kindle Colorsoft and Scribe Colorsoft

April 28, 2026
Mistral AI launches Workflows, a Temporal-powered orchestration engine already running millions of daily executions
AI & Technology

Mistral AI launches Workflows, a Temporal-powered orchestration engine already running millions of daily executions

April 28, 2026
Next Post
How To Short A Stock As A Beginner (Step-By-Step)

How To Short A Stock As A Beginner (Step-By-Step)

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

Search

No Result
View All Result
My Wife’s Spending Scares Me

My Wife’s Spending Scares Me

April 23, 2026
Deadline nears to charge husband in Bahamas disappearance

Deadline nears to charge husband in Bahamas disappearance

April 25, 2026
Astronauts hoisted by helicopter from splashdown site

Astronauts hoisted by helicopter from splashdown site

April 26, 2026

About

Learn more

Our Services

Legal

Privacy Policy

Terms of Use

Bloggers

Learn more

Article Links

Contact

Advertise

Ask us anything

©2020- TradePoint.io - All rights reserved!

Tradepoint.io, being just a publishing and technology platform, is not a registered broker-dealer or investment adviser. So we do not provide investment advice. Rather, brokerage services are provided to clients of Tradepoint.io by independent SEC-registered broker-dealers and members of FINRA/SIPC. Every form of investing carries some risk and past performance is not a guarantee of future results. “Tradepoint.io“, “Instant Investing” and “My Trading Tools” are registered trademarks of Apperbuild, LLC.

This website is operated by Apperbuild, LLC. We have no link to any brokerage firm and we do not provide investment advice. Every information and resource we provide is solely for the education of our readers. © 2020 Apperbuild, LLC. All rights reserved.

No Result
View All Result
  • Main
  • AI & Technology
  • Stock Charts
  • Market & News
  • Business
  • Finance Tips
  • Trade Tube
  • Blog
  • Shop

© 2023 - TradePoint.io - All Rights Reserved!