Close Menu
  • News
    • Bitcoin
    • Altcoins
    • DeFi
    • Market Cap
  • Blockchain
  • Web 3
    • NFT
    • Metaverse
  • Regulation
  • Analysis
  • Learn
  • Blog
What's Hot

Chainlink brings Samsung, Toyota and Sony prices on-chain with APAC stock streams

2026-06-24

BNO Developments is making energy class A the standard for shortlisted new construction projects in Cyprus

2026-06-24

Securitize Tokenizes Roubini-Linked ETF under Dubai VARA Framework

2026-06-24
Facebook X (Twitter) Instagram
  • Contact
  • Terms & Conditions
  • Privacy Policy
  • DMCA
  • Advertise
Facebook X (Twitter) Instagram
Bitcoin Platform – Bitcoin | Altcoins | Blockchain | News Stories Updated Daily
  • News
    • Bitcoin
    • Altcoins
    • DeFi
    • Market Cap
  • Blockchain

    Chainlink brings Samsung, Toyota and Sony prices on-chain with APAC stock streams

    2026-06-24

    Aztec reaches L2Beat Phase 2 after Governance revokes ownership of the rollup contract

    2026-06-24

    What is MEV? Maximal Extractable Value, the invisible tax on crypto

    2026-06-24

    Orix AI partners with PAYGO to enable AI-powered Web3 payments

    2026-06-23

    How the network processed $309 million in stablecoins last month

    2026-06-23
  • Web 3
    • NFT
    • Metaverse
  • Regulation

    Stablecoins in Britse ponden gemaximeerd op $53 miljard, terwijl de Bank of England stablecoin-regels vastlegt

    2026-06-22

    De Amerikaanse toekomst van crypto-daders zal worden bepaald door hoe toezichthouders besluiten ze te noemen

    2026-06-22

    De MiCA-deadline zal waarschijnlijk kleinere crypto-apps naar gelicentieerde bewaarrails verplaatsen

    2026-06-22

    dollar liquidity may already be too far ahead

    2026-06-22

    Kraken Fed-accountgevecht zou kunnen bepalen hoe cryptobedrijven directe betalingstoegang krijgen

    2026-06-21
  • Analysis

    Ethereum Foundation bezuinigt met 20% op personeel, terwijl ETH YTD met 44% daalt ondanks recordgebruik

    2026-06-24

    CZ noemde het no-KYC-model van Hyperliquid “geweldig”

    2026-06-24

    South Korea’s KOSPI crashes 10% as regulator admits ETF error

    2026-06-23

    Trumps quantum computing-push zet 449 miljard dollar aan ‘blootgestelde Bitcoin’ weer in de schijnwerpers

    2026-06-23

    Solana subsidizes large traders before the markets in the chain prove that the activity can continue to exist

    2026-06-23
  • Learn

    Most Profitable Crypto to Mine in 2026: Best Altcoins for Mining

    2026-06-23

    Bitcoin Alternatives: Our Top Altcoin Picks for You in 2026

    2026-06-23

    What Is a Bull Flag Pattern in Crypto and How to Use It

    2026-06-20

    What Is OTC Trading? Over-the-Counter Trading Explained

    2026-06-20

    The Top 10 Bitcoin Wallets in 2026

    2026-06-20
  • Blog
Bitcoin Platform – Bitcoin | Altcoins | Blockchain | News Stories Updated Daily
Home»NFT»If AI Image Generators Are So Smart, Why Do They Struggle to Write and Count?
If AI Image Generators Are So Smart, Why Do They Struggle to Write and Count?
NFT

If AI Image Generators Are So Smart, Why Do They Struggle to Write and Count?

2023-07-29No Comments4 Mins Read
Share
Facebook Twitter LinkedIn Pinterest Email

Generative AI tools such as Midjourney, Stable Diffusion, and DALL-E 2 have astounded us with their ability to produce remarkable images in a matter of seconds.

Despite their achievements, however, there remains a puzzling disparity between what AI image generators can produce and what we can. For instance, these tools often won’t deliver satisfactory results for seemingly simple tasks such as counting objects and producing accurate text.

If generative AI has reached such unprecedented heights in creative expression, why does it struggle with tasks even a primary school student could complete?

Exploring the underlying reasons helps sheds light on the complex numerical nature of AI, and the nuance of its capabilities.

AI’s limitations with writing

Humans can easily recognize text symbols (such as letters, numbers, and characters) written in various different fonts and handwriting. We can also produce text in different contexts, and understand how context can change meaning.

Current AI image generators lack this inherent understanding. They have no true comprehension of what text symbols mean. These generators are built on artificial neural networks trained on massive amounts of image data, from which they “learn” associations and make predictions.

Combinations of shapes in the training images are associated with various entities. For example, two inward-facing lines that meet might represent the tip of a pencil or the roof of a house.

But when it comes to text and quantities, the associations must be incredibly accurate, since even minor imperfections are noticeable. Our brains can overlook slight deviations in a pencil’s tip or a roof – but not as much when it comes to how a word is written, or the number of fingers on a hand.

See also  How smart contracts can streamline processes across industries

As far as text-to-image models are concerned, text symbols are just combinations of lines and shapes. Since text comes in so many different styles – and since letters and numbers are used in seemingly endless arrangements – the model often won’t learn how to effectively reproduce text.

AI-generated image produced in response to the prompt ‘KFC logo.’ | Credit: The Conversation

The main reason for this is insufficient training data. AI image generators require much more training data to accurately represent text and quantities than they do for other tasks.

The tragedy of AI hands

Issues also arise when dealing with smaller objects that require intricate details, such as hands.

Two AI-generated images produced in response to the prompt ‘young girl holding up ten fingers, realistic.’ | Credit: The Conversation

In training images, hands are often small, holding objects, or partially obscured by other elements. It becomes challenging for AI to associate the term “hand” with the exact representation of a human hand with five fingers.

Consequently, AI-generated hands often look misshapen, have additional or fewer fingers, or have hands partially covered by objects such as sleeves or purses.

We see a similar issue when it comes to quantities. AI models lack a clear understanding of quantities, such as the abstract concept of “four.” As such, an image generator may respond to a prompt for “four apples” by drawing on learning from myriad images featuring many quantities of apples – and return an output with the incorrect amount.

In other words, the huge diversity of associations within the training data impacts the accuracy of quantities in outputs.

Three AI-generated images produced in response to the prompt ‘5 soda cans on a table.’ | Credit: The Conversation

Will AI ever be able to write and count?

It’s important to remember text-to-image and text-to-video conversion is a relatively new concept in AI. Current generative platforms are “low-resolution” versions of what we can expect in the future.

See also  Top NFT Blockchains by Trading Volume

With advancements being made in training processes and AI technology, future AI image generators will likely be much more capable of producing accurate visualizations.

It’s also worth noting most publicly accessible AI platforms don’t offer the highest level of capability. Generating accurate text and quantities demands highly optimized and tailored networks, so paid subscriptions to more advanced platforms will likely deliver better results.


This article is republished from The Conversation under a Creative Commons license. Read the original article by Seyedali Mirjalili, Professor, Director of Centre for Artificial Intelligence Research and Optimisation, Torrens University Australia.



Source link

andCount Generators Image Smart Struggle write
Share. Facebook Twitter Pinterest LinkedIn Tumblr Email

Related Posts

Why are Pudgy Penguins (PENGU) popular? What you need to know

2026-06-21

Top 10 NFT Artists by Trading Volume, Courtyard Outranks

2026-06-21

Pudgy Penguins is expanding its retail footprint with the rollout of Target trading cards

2026-06-20

Circle Unveils Arc Privacy to Bring Confidential Smart Contracts to Institutions

2026-06-13
Add A Comment
Leave A Reply Cancel Reply

Top Posts

What Are Nodes in Crypto?

2024-05-09

Coinbase unveils the Bitcoin yield fund for global institutional investors

2025-04-28

Are AI and Blockchain set to rewrite the federal expenses?

2025-02-08
Editors Picks

Bitcoin Sellers in Disbelief! Here’s why this could lead to a short squeeze

2025-10-20

Top NFT Airdrops and Giveaways for May 2024

2024-05-02

Bitcoin Enters ‘Most Frustrating Phase,’ Says CryptoQuant: A Look at What’s to Come

2026-03-12

Tether’s USDT on TRON Network Surpasses Visa’s Daily Average Volume of $42,000,000,000: Lookonchain

2024-06-23

Our mission is to develop a community of people who try to make financially sound decisions. The website strives to educate individuals in making wise choices about Cryptocurrencies, Defi, NFT, Metaverse and more.

We're social. Connect with us:

Facebook X (Twitter) Instagram Pinterest YouTube
Top Insights

Chainlink brings Samsung, Toyota and Sony prices on-chain with APAC stock streams

BNO Developments is making energy class A the standard for shortlisted new construction projects in Cyprus

Securitize Tokenizes Roubini-Linked ETF under Dubai VARA Framework

Get Informed

Subscribe to Updates

Get the latest news and Update from Bitcoin Platform about Crypto, Metaverse, NFT and more.

  • Contact
  • Terms & Conditions
  • Privacy Policy
  • DMCA
  • Advertise
© 2026 Bitcoinplatform.com - All rights reserved.

Type above and press Enter to search. Press Esc to cancel.