• bitcoinBitcoin(BTC)$65,343.00-0.77%
  • ethereumEthereum(ETH)$1,765.30-1.55%
  • tetherTether(USDT)$1.000.00%
  • binancecoinBNB(BNB)$603.82-0.55%
  • usd-coinUSDC(USDC)$1.00-0.01%
  • rippleXRP(XRP)$1.20-1.69%
  • solanaSolana(SOL)$73.23-0.67%
  • tronTRON(TRX)$0.3205620.91%
  • Figure HelocFigure Heloc(FIGR_HELOC)$1.02-1.02%
  • HyperliquidHyperliquid(HYPE)$74.200.52%
  • dogecoinDogecoin(DOGE)$0.086695-0.75%
  • USDSUSDS(USDS)$1.000.01%
  • RainRain(RAIN)$0.0146053.39%
  • leo-tokenLEO Token(LEO)$9.67-0.63%
  • zcashZcash(ZEC)$493.47-2.12%
  • stellarStellar(XLM)$0.2237212.43%
  • whitebitWhiteBIT Coin(WBT)$54.01-0.20%
  • CantonCanton(CC)$0.1642980.79%
  • cardanoCardano(ADA)$0.168594-4.00%
  • moneroMonero(XMR)$334.22-3.38%
  • chainlinkChainlink(LINK)$8.20-0.96%
  • USD1USD1(USD1)$1.000.02%
  • Ethena USDeEthena USDe(USDE)$1.00-0.01%
  • the-open-networkGram (prev. Toncoin)(GRAM)$1.65-1.09%
  • bitcoin-cashBitcoin Cash(BCH)$214.31-2.08%
  • daiDai(DAI)$1.000.00%
  • LABLAB(LAB)$13.07-0.76%
  • MemeCoreMemeCore(M)$2.96-5.53%
  • hedera-hashgraphHedera(HBAR)$0.080593-0.80%
  • litecoinLitecoin(LTC)$45.34-0.36%
  • suiSui(SUI)$0.790.16%
  • Circle USYCCircle USYC(USYC)$1.130.00%
  • nearNEAR Protocol(NEAR)$2.29-1.94%
  • avalanche-2Avalanche(AVAX)$6.85-0.06%
  • shiba-inuShiba Inu(SHIB)$0.0000050.05%
  • Global DollarGlobal Dollar(USDG)$1.000.01%
  • crypto-com-chainCronos(CRO)$0.059663-1.12%
  • paypal-usdPayPal USD(PYUSD)$1.000.03%
  • tether-goldTether Gold(XAUT)$4,278.15-0.96%
  • BittensorBittensor(TAO)$253.820.63%
  • BlackRock USD Institutional Digital Liquidity FundBlackRock USD Institutional Digital Liquidity Fund(BUIDL)$1.000.00%
  • worldcoin-wldWorldcoin(WLD)$0.661.20%
  • Ondo US Dollar YieldOndo US Dollar Yield(USDY)$1.13-0.28%
  • uniswapUniswap(UNI)$3.271.68%
  • World Liberty FinancialWorld Liberty Financial(WLFI)$0.0614102.20%
  • pax-goldPAX Gold(PAXG)$4,292.28-0.87%
  • AsterAster(ASTER)$0.7310.97%
  • OndoOndo(ONDO)$0.3747191.81%
  • mantleMantle(MNT)$0.55-1.01%
  • polkadotPolkadot(DOT)$1.020.82%
TradePoint.io
  • Main
  • AI & Technology
  • Stock Charts
  • Market & News
  • Business
  • Finance Tips
  • Trade Tube
  • Blog
  • Shop
No Result
View All Result
TradePoint.io
No Result
View All Result

Google AI Releases DeepPolisher: A New Deep Learning Tool that Improves the Accuracy of Genome Assemblies by Precisely Correcting Base-Level Errors

August 7, 2025
in AI & Technology
Reading Time: 8 mins read
A A
Google AI Releases DeepPolisher: A New Deep Learning Tool that Improves the Accuracy of Genome Assemblies by Precisely Correcting Base-Level Errors
ShareShareShareShareShare

Google AI, in collaboration with the UC Santa Cruz Genomics Institute, has introduced DeepPolisher, a cutting-edge deep learning tool designed to substantially improve the accuracy of genome assemblies by correcting base-level errors. Its notable efficacy was recently demonstrated in advancing the Human Pangenome Reference, a major milestone in genomics research.

The Challenge of Accurate Genome Assembly

A reference genome is an essential foundation for understanding genetic diversity, heredity, disease mechanisms, and evolutionary biology. Modern sequencing technologies, including those developed by Illumina and Pacific Biosciences, have dramatically improved sequencing accuracy and throughput—but even with technological breakthroughs, assembling an error-free human genome (comprising over 3 billion nucleotides) remains immensely challenging. Even a minuscule per-base error rate can result in thousands of errors which can obscure key genetic variations or mislead downstream analyses.

YOU MAY ALSO LIKE

Vercel Releases Eve: An Open-Source AI Agent Framework Where Each Agent is a Directory of Files Mapped to Capabilities

SanDisk’s New 8TB PS5 SSD Costs More Than Three Times As Much As The PS5 Pro

What Is DeepPolisher?

DeepPolisher is an open-source, transformer-based sequencing correction tool. Building on advances from DeepConsensus, it takes advantage of transformer deep learning architectures to further reduce errors in genome assembly, particularly insertion and deletion (indel) errors, which have a profound impact by shifting reading frames and can cause important genes or regulatory elements to be missed during annotation.

  • Technology: Encoder-only transformer, adapting proven techniques in natural language processing for genomics.
  • Training data: Leveraged a human cell line extensively characterized by NIST and NHGRI, sequenced with various platforms to ensure near-complete accuracy (~99.99999% correctness, between 300–1,000 errors in 6 billion bases).

How Does It Work? (Technical Overview)

  1. Input Alignment: Takes aligned PacBio HiFi reads against a haplotype-resolved genome assembly as input.
  2. Error Site Detection: Scans the assembly in 25kb windows; identifies candidate error sites where read evidence deviates from the assembly.
  3. Data Encoding: For each window containing putative errors (<100bp), it creates a multi-channel tensor representation of read alignment features such as base, base quality, mapping quality, and match/mismatch status.
  4. Model Inference: Feeds these tensors into the transformer, which predicts corrected sequences for these regions.
  5. Output Correction: Outputs differences in VCF format, which are then applied to the assembly to produce a polished, highly accurate sequence using tools like bcftools.

Performance and Impact

DeepPolisher delivers substantial improvements:

  • Total error reduction: ~50%
  • Indel error reduction: >70%
  • Error rates: Achieves an error rate as low as one base error per 500,000 assembled bases in real-world deployment with the Human Pangenome Reference Consortium (HPRC).
  • Genomic Q-score improvement: Raises assembly quality from Q66.7 to Q70.1 on average (Q-score is a logarithmic measure of per-base error rate; higher is better. Q70.1 implies <1 error per 12 million nucleotides)
  • Every sample tested by HPRC showed improvement.

These advances directly impact the reliability and accuracy of derived references, such as in the Human Pangenome Reference, which saw a fivefold data expansion and substantial error reduction due to DeepPolisher.

Deployment and Applications

  • Integrated in major projects: Used in HPRC’s second data release, providing high-accuracy reference assemblies for 232 individuals, ensuring broad ancestral diversity in genomic references.
  • Open-source access: Available via GitHub, with case studies and Dockerized workflows for use on assemblies produced by tools like HiFiasm and sequenced with PacBio HiFi reads.
  • Generalizability: While initially focused on human genomes, the structure and approach are adaptable to other organisms and sequencing platforms, fostering accuracy across the genomics community.

Practical Workflow Example

A typical workflow using DeepPolisher might involve:

  • Input: HiFiasm diploid assembly and PacBio HiFi reads, phase-aligned using the PHARAOH pipeline.
  • Running: Dockerized commands for image creation, inference, and correction application.
  • Output: Separate VCF files for maternal and paternal assemblies, polished FASTAs after bcftools consensus step.
  • Assessment: Use of benchmarking tools (e.g., dipcall, Hap.py) to quantify improvements in error rates and variant accuracy.

Conclusion and Future Directions

DeepPolisher represents a leap forward in genome polishing technology—sharply reducing error rates and unlocking higher resolution for functional genomics, rare variant discovery, and clinical applications. By targeting the remaining barrier to perfect genome assemblages, it enables more accurate diagnosis, population-level genetic studies, and paves the way for next-generation reference projects benefiting biomedical research and medicine.


Check out the Technical details, GitHub Page and Paper. Feel free to check out our GitHub Page for Tutorials, Codes and Notebooks. Also, feel free to follow us on Twitter and don’t forget to join our 100k+ ML SubReddit and Subscribe to our Newsletter.


Asif Razzaq is the CEO of Marktechpost Media Inc.. As a visionary entrepreneur and engineer, Asif is committed to harnessing the potential of Artificial Intelligence for social good. His most recent endeavor is the launch of an Artificial Intelligence Media Platform, Marktechpost, which stands out for its in-depth coverage of machine learning and deep learning news that is both technically sound and easily understandable by a wide audience. The platform boasts of over 2 million monthly views, illustrating its popularity among audiences.

Credit: Source link

ShareTweetSendSharePin

Related Posts

Vercel Releases Eve: An Open-Source AI Agent Framework Where Each Agent is a Directory of Files Mapped to Capabilities
AI & Technology

Vercel Releases Eve: An Open-Source AI Agent Framework Where Each Agent is a Directory of Files Mapped to Capabilities

June 17, 2026
SanDisk’s New 8TB PS5 SSD Costs More Than Three Times As Much As The PS5 Pro
AI & Technology

SanDisk’s New 8TB PS5 SSD Costs More Than Three Times As Much As The PS5 Pro

June 17, 2026
Lego Unveils Its First Working Pinball Machine
AI & Technology

Lego Unveils Its First Working Pinball Machine

June 17, 2026
The All-New Google Home Speaker Has Finally Arrived For 0
AI & Technology

The All-New Google Home Speaker Has Finally Arrived For $100

June 17, 2026
Next Post
Skydance CEO David Ellison takes the reins of a ‘new Paramount’ after merger saga

Skydance CEO David Ellison takes the reins of a 'new Paramount' after merger saga

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

Search

No Result
View All Result
These Are The Two 24%+ Yielding Weekly Cash Machines I'd Buy

These Are The Two 24%+ Yielding Weekly Cash Machines I'd Buy

June 14, 2026
A Custodial Roth IRA Turns Your Kid’s First Job Into a Head Start

A Custodial Roth IRA Turns Your Kid’s First Job Into a Head Start

June 10, 2026
Early Stage Investing in the Age of Huge Pre-IPO Rounds

Early Stage Investing in the Age of Huge Pre-IPO Rounds

June 13, 2026

About

Learn more

Our Services

Legal

Privacy Policy

Terms of Use

Bloggers

Learn more

Article Links

Contact

Advertise

Ask us anything

©2020- TradePoint.io - All rights reserved!

Tradepoint.io, being just a publishing and technology platform, is not a registered broker-dealer or investment adviser. So we do not provide investment advice. Rather, brokerage services are provided to clients of Tradepoint.io by independent SEC-registered broker-dealers and members of FINRA/SIPC. Every form of investing carries some risk and past performance is not a guarantee of future results. “Tradepoint.io“, “Instant Investing” and “My Trading Tools” are registered trademarks of Apperbuild, LLC.

This website is operated by Apperbuild, LLC. We have no link to any brokerage firm and we do not provide investment advice. Every information and resource we provide is solely for the education of our readers. © 2020 Apperbuild, LLC. All rights reserved.

No Result
View All Result
  • Main
  • AI & Technology
  • Stock Charts
  • Market & News
  • Business
  • Finance Tips
  • Trade Tube
  • Blog
  • Shop

© 2023 - TradePoint.io - All Rights Reserved!