• Kinza Babylon Staked BTCKinza Babylon Staked BTC(KBTC)$83,270.000.00%
  • Steakhouse EURCV Morpho VaultSteakhouse EURCV Morpho Vault(STEAKEURCV)$0.000000-100.00%
  • Stride Staked InjectiveStride Staked Injective(STINJ)$16.51-4.18%
  • Vested XORVested XOR(VXOR)$3,404.231,000.00%
  • FibSwap DEXFibSwap DEX(FIBO)$0.0084659.90%
  • ICPanda DAOICPanda DAO(PANDA)$0.003106-39.39%
  • TruFin Staked APTTruFin Staked APT(TRUAPT)$8.020.00%
  • bitcoinBitcoin(BTC)$105,449.00-0.04%
  • ethereumEthereum(ETH)$2,483.41-1.14%
  • VNST StablecoinVNST Stablecoin(VNST)$0.0000400.67%
  • tetherTether(USDT)$1.000.01%
  • rippleXRP(XRP)$2.231.54%
  • binancecoinBNB(BNB)$649.03-0.10%
  • Wrapped SOLWrapped SOL(SOL)$143.66-2.32%
  • solanaSolana(SOL)$150.100.06%
  • usd-coinUSDC(USDC)$1.000.01%
  • dogecoinDogecoin(DOGE)$0.181061-1.58%
  • tronTRON(TRX)$0.284629-0.49%
  • cardanoCardano(ADA)$0.66-0.13%
  • staked-etherLido Staked Ether(STETH)$2,482.78-1.14%
  • wrapped-bitcoinWrapped Bitcoin(WBTC)$105,468.00-0.05%
  • Gaj FinanceGaj Finance(GAJ)$0.0059271.46%
  • Content BitcoinContent Bitcoin(CTB)$24.482.55%
  • USD OneUSD One(USD1)$1.000.11%
  • HyperliquidHyperliquid(HYPE)$34.93-0.51%
  • SuiSui(SUI)$3.19-1.48%
  • Wrapped stETHWrapped stETH(WSTETH)$2,995.80-0.98%
  • UGOLD Inc.UGOLD Inc.(UGOLD)$3,042.460.08%
  • ParkcoinParkcoin(KPK)$1.101.76%
  • chainlinkChainlink(LINK)$13.60-1.07%
  • leo-tokenLEO Token(LEO)$9.28-0.18%
  • avalanche-2Avalanche(AVAX)$20.340.16%
  • stellarStellar(XLM)$0.264583-0.98%
  • bitcoin-cashBitcoin Cash(BCH)$413.851.70%
  • ToncoinToncoin(TON)$3.170.22%
  • shiba-inuShiba Inu(SHIB)$0.000012-1.71%
  • hedera-hashgraphHedera(HBAR)$0.1685480.54%
  • USDSUSDS(USDS)$1.000.00%
  • Yay StakeStone EtherYay StakeStone Ether(YAYSTONE)$2,671.07-2.84%
  • litecoinLitecoin(LTC)$87.27-0.99%
  • wethWETH(WETH)$2,485.77-1.01%
  • Wrapped eETHWrapped eETH(WEETH)$2,658.80-0.99%
  • polkadotPolkadot(DOT)$3.98-1.36%
  • Pundi AIFXPundi AIFX(PUNDIAI)$16.000.00%
  • moneroMonero(XMR)$328.21-1.51%
  • Binance Bridged USDT (BNB Smart Chain)Binance Bridged USDT (BNB Smart Chain)(BSC-USD)$1.00-0.02%
  • PengPeng(PENG)$0.60-13.59%
  • Ethena USDeEthena USDe(USDE)$1.000.02%
  • Bitget TokenBitget Token(BGB)$4.64-0.66%
  • MurasakiMurasaki(MURA)$4.32-12.46%
TradePoint.io
  • Main
  • AI & Technology
  • Stock Charts
  • Market & News
  • Business
  • Finance Tips
  • Trade Tube
  • Blog
  • Shop
No Result
View All Result
TradePoint.io
No Result
View All Result

Microsoft Researchers Introduce StrokeNUWA: Tokenizing Strokes for Vector Graphic Synthesis

February 5, 2024
in AI & Technology
Reading Time: 4 mins read
A A
Microsoft Researchers Introduce StrokeNUWA: Tokenizing Strokes for Vector Graphic Synthesis
ShareShareShareShareShare

YOU MAY ALSO LIKE

Why Meta’s Biggest AI Bet Isn’t on Models—It’s on Data

Xbox handheld, Resident Evil Requiem and more

Natural Language Processing (NLP) is one area where Large transformer-based Language Models (LLMs) have achieved remarkable progress in recent years. Also, LLMs are branching out into other fields, like robotics, audio, and medicine.

Modern approaches allow LLMs to produce visual data using specialized modules like VQ-VAE and VQ-GAN, which convert continuous visual pixels into discrete grid tokens. The LLM then processes these altered grid tokens similarly to how textual word processing works, which helps with the generative modeling process of LLMs. On the other hand, LLMs aren’t as good as diffusion models. 

By applying an alternate image format and vector graphics, a new study by Soochow University, Microsoft Research Asia, and Microsoft Azure AI presents a fresh method that essentially preserves the semantic concepts of images. Vector graphics readily capture the semantic concepts of the image, unlike pixel-based formats, which conceal the creation of objects. In their suggested “stroke” token system, for instance, the dolphin is divided into a series of linked strokes containing full semantic information in each stroke unit.

The team highlights that they are not arguing for the inherent superiority of vector graphics over raster images; rather, we are presenting a new way of looking at visual representation. The “stroke” token idea has several benefits, such as:

  1. Each stroke token has visual semantics built-in, making semantic segmentation of image content more intuitive.
  2. Vector graphics are inherently compatible with LLMs because their creation process is sequential and interconnected, similar to how LLMs process information. Put another way, LLMs can digest the strokes more naturally since each one is formed about the ones that came before and after it. 
  3. Highly compressing vector graphics strokes can drastically reduce data size without sacrificing quality or semantic integrity. This makes it possible for each stroke token to encompass a rich, compressed representation of the visual information. 

Based on the analysis above, they present StrokeNUWA, a model that generates vector graphics independently of the visual module. An Encoder-Decoder model plus a VQ-Stroke module makeup StrokeNUWA. The VQ-Stroke may condense serialized vector graphic data into several SVG tokens; it is based on the design of the residual quantizer model. The Encoder-Decoder model mostly uses a pre-trained LLM to generate SVG tokens in response to textual instructions. 

The researchers evaluate StrokeNUWA with optimization-based approaches for the text-guided SVG production job. By improving CLIPScore measures, the proposed method demonstrates that stroke tokens can produce visually semantically richer material. Stroke tokens can be successfully integrated with LLMs since their solution outperforms LLM-based baselines on all criteria. Lastly, the approach achieves speed improvements of up to 94 times, demonstrating great efficiency in generation, thanks to the compression capabilities inherent in vector graphics. 

This study highlights the immense possibilities of using stroke tokens for vector graphic creation. The team’s long-term goal is to refine stroke token quality further using LLM-specific advanced visual tokenization techniques. They also plan to expand stroke tokens to further domains (3D), tasks (SVG Understanding), and creating SVGs from real-world photos.


Check out the Paper. All credit for this research goes to the researchers of this project. Also, don’t forget to follow us on Twitter and Google News. Join our 36k+ ML SubReddit, 41k+ Facebook Community, Discord Channel, and LinkedIn Group.

If you like our work, you will love our newsletter..

Don’t Forget to join our Telegram Channel


Dhanshree Shenwai is a Computer Science Engineer and has a good experience in FinTech companies covering Financial, Cards & Payments and Banking domain with keen interest in applications of AI. She is enthusiastic about exploring new technologies and advancements in today’s evolving world making everyone’s life easy.


🎯 [FREE AI WEBINAR] ‘Using ANN for Vector Search at Speed & Scale (Demo on AWS)’ (Feb 5, 2024)


Credit: Source link

ShareTweetSendSharePin

Related Posts

Why Meta’s Biggest AI Bet Isn’t on Models—It’s on Data
AI & Technology

Why Meta’s Biggest AI Bet Isn’t on Models—It’s on Data

June 9, 2025
Xbox handheld, Resident Evil Requiem and more
AI & Technology

Xbox handheld, Resident Evil Requiem and more

June 9, 2025
Lumines Arise combines that addictive puzzling flow with a killer soundtrack
AI & Technology

Lumines Arise combines that addictive puzzling flow with a killer soundtrack

June 9, 2025
13 Cool Amazon Gadgets You’ll Want in 2025
AI & Technology

13 Cool Amazon Gadgets You’ll Want in 2025

June 8, 2025
Next Post
This AI Paper from UNC-Chapel Hill Proposes ReGAL: A Gradient-Free Method for Learning a Library of Reusable Functions via Code Refactorization

This AI Paper from UNC-Chapel Hill Proposes ReGAL: A Gradient-Free Method for Learning a Library of Reusable Functions via Code Refactorization

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

Search

No Result
View All Result
Blockchain is the future of currency, expert says

Blockchain is the future of currency, expert says

June 8, 2025
Jail escapee appears to post social media video claiming he was ‘let out’

Jail escapee appears to post social media video claiming he was ‘let out’

June 7, 2025
Trump was ‘very surprised’ by Elon Musk’s criticism of GOP spending bill

Trump was ‘very surprised’ by Elon Musk’s criticism of GOP spending bill

June 6, 2025

About

Learn more

Our Services

Legal

Privacy Policy

Terms of Use

Bloggers

Learn more

Article Links

Contact

Advertise

Ask us anything

©2020- TradePoint.io - All rights reserved!

Tradepoint.io, being just a publishing and technology platform, is not a registered broker-dealer or investment adviser. So we do not provide investment advice. Rather, brokerage services are provided to clients of Tradepoint.io by independent SEC-registered broker-dealers and members of FINRA/SIPC. Every form of investing carries some risk and past performance is not a guarantee of future results. “Tradepoint.io“, “Instant Investing” and “My Trading Tools” are registered trademarks of Apperbuild, LLC.

This website is operated by Apperbuild, LLC. We have no link to any brokerage firm and we do not provide investment advice. Every information and resource we provide is solely for the education of our readers. © 2020 Apperbuild, LLC. All rights reserved.

No Result
View All Result
  • Main
  • AI & Technology
  • Stock Charts
  • Market & News
  • Business
  • Finance Tips
  • Trade Tube
  • Blog
  • Shop

© 2023 - TradePoint.io - All Rights Reserved!