์ƒ์„ธ ์ปจํ…์ธ 

๋ณธ๋ฌธ ์ œ๋ชฉ

์ฑ—GPT, ์ œ๋ฏธ๋‚˜์ด ๋“ฑ AI ๋ชจ๋ธ ํ™˜๊ฐ๋ฅ  by VISUALCAPITALIST.com

๋ณธ๋ฌธ

๋ฐ˜์‘ํ˜•

์ œ๋ฏธ๋‚˜์ด 3, ์ฑ—GPT 5.1, ํด๋กœ๋“œ ์˜คํ‘ธ์Šค 4.5 ๋“ฑ ์ƒ์„ฑAI๋“ค์˜ ๋ฒ„์ „์—…์ด ๊ณ„์†๋˜๊ณ  ์žˆ์Šต๋‹ˆ๋‹ค. ์กฐ๊ธˆ์”ฉ์ด๋“  ๋งŽ์ด ๋“  ์—…๋ฐ์ดํŠธ๋˜๋ฉด์„œ AI์˜ ๊ฐ•๋ ฅํ•จ์„ ๋ณด์—ฌ์ฃผ๊ณ  ์žˆ๋Š”๋ฐ์š”. ๊ทธ๋ ‡๊ฒŒ AI๋“ค์ด ๋˜‘๋˜‘ํ•ด์ง€๋Š” ๋“ฏํ•˜๋ฉด์„œ๋„ ๊ทธ๋“ค์ด ๋‚ด๋†“๋Š” ๋‹ต์„ ์˜จ์ „ํžˆ ๋ฏฟ์„ ์ˆ˜ ์—†๋Š” ๊ฑด ์†Œ์œ„ AI ํ™˜๊ฐ(AI Hallucination)์„ ์ผ์œผํ‚ค๋ฉฐ ์ข…์ข… ์ด์ƒํ•œ ์–˜๊ธฐ๋ฅผ ํ•˜๊ธฐ ๋•Œ๋ฌธ์ž…๋‹ˆ๋‹ค. ๋‹ต๋ณ€์˜ ์ฃผ์š” ๋‚ด์šฉ์ด ๋Œ€๋ถ€๋ถ„ ๊ฑฐ์ง“๋ง์ผ ๋•Œ๋„ ์žˆ์ง€๋งŒ, ๋Œ€๋ถ€๋ถ„์˜ ์ง„์‹ค์— ์ผ๋ถ€ ๊ฑฐ์ง“์„ ์„ž์–ด ๋” ํ˜ผ๋ž€์— ๋น ํŠธ๋ฆฌ๋Š” ๊ฒฝ์šฐ๊ฐ€ ์ข…์ข… ์žˆ๋Š”๋ฐ์š”.

 

AI๋Š” ๊ฑฐ์ง“๋ง์„ ํ•œ๋‹ค? ์ฃผ์š” AI ๋ชจ๋ธ๋ณ„ ํ™˜๊ฐ๋ฅ  ๋น„๊ต

 

์–ด๋–ค AI ๋ชจ๋ธ์ด ๋” ํ™˜๊ฐ๋ฅ ์ด ๋†’์€์ง€์— ๋Œ€ํ•ด์„œ ์ƒ๊ฐํ•ด ๋ณด์‹  ์  ์žˆ์œผ์„ธ์š”? 2025๋…„ 3์›” ์ปฌ๋Ÿผ๋น„์•„ ์ €๋„๋ฆฌ์ฆ˜ ๋ฆฌ๋ทฐ(Columbia Journalism Review)์˜ ํ…Œ์ŠคํŠธ๋ฅผ ๋ฐ”ํƒ•์œผ๋กœ TERZO์™€ VISUAL CAPITALIST๊ฐ€ ์ •๋ฆฌํ•œ ์ด ์ธํฌ๊ทธ๋ž˜ํ”ฝ์€ ์ฃผ์š” AI ๋ชจ๋ธ์˜ ํ™˜๊ฐ๋ฅ ์ด ์–ด๋А ์ •๋„์ธ์ง€๋ฅผ ๋น„๊ตํ•˜๊ณ  ์žˆ๋Š”๋ฐ์š”. ๋ช‡ ๋‹ฌ ์ „ ๋ฐ์ดํ„ฐ์ด๊ณ  ์ œํ•œ๋œ ์‹คํ—˜์ด๋‹ˆ ์ ˆ๋Œ€์ ์ธ ๊ฑด ์•„๋‹ˆ๊ฒ ์ง€๋งŒ, ์ฐธ๊ณ ๋Š” ํ•ด๋ณด์‹œ์ฃ .

 

 

AI Search Has a Citation Problem

We compared eight AI search engines. They’re all bad at citing news.

www.cjr.org

 

๋ฌด์—‡์„ ์งˆ๋ฌธํ•˜๋“  ํ›„๋‘๋‘‘ ๋‹ต์„ ๋ฑ‰์–ด๋‚ด๋Š” AI์˜ ๋‹ต๋ณ€ ์ค‘ ์ง„์‹ค์ด ์•„๋‹Œ ๊ฒฝ์šฐ๊ฐ€ ์–ผ๋งˆ๋‚˜ ๋ ๊นŒ์š”? ๊ฐ€์žฅ ์‹ฌ๊ฐํ•œ ๊ฒƒ์œผ๋กœ ์•Œ๋ ค์ง„ ๊ฑด Xai์˜ ๊ทธ๋ก-3(Grok-3)์˜€์Šต๋‹ˆ๋‹ค. ํ™˜๊ฐ๋ฅ ์€ ๋ฌด๋ ค 94%, ๊ทธ๋ก-2(Grok-2)๋„ 77%๋กœ ๊ฐ€์žฅ ํ™˜๊ฐ๋ฅ ์ด ๋†’์€ AI ํ˜•์ œ๋กœ ๊ผฝํ˜”์Šต๋‹ˆ๋‹ค. ์ด์–ด์„œ ๊ตฌ๊ธ€์˜ ์ œ๋ฏธ๋‚˜์ด(Gemini)๊ฐ€ ํ™˜๊ฐ๋ฅ  76%๋กœ ๊ฝค ๋†’์•˜์Šต๋‹ˆ๋‹ค.

 

๊ฐ€์„ฑ๋น„๋กœ ์ฃผ๋ชฉ ๋ฐ›์€ ์ค‘๊ตญ์˜ ๋”ฅ์‹œํฌ(Deepseek)์˜ ํ™˜๊ฐ๋ฅ ์€ 68%, ์˜คํ”ˆAI์˜ ์ฑ—GPT(ChatGPT)์˜ ํ™˜๊ฐ๋ฅ ์€ 67%, ํผํ”Œ๋ ‰์‹œํ‹ฐ ํ”„๋กœ(Perplexity Pro)์˜ ํ™˜๊ฐ๋ฅ ์€ 45%, ๋งˆ์ดํฌ๋กœ์†Œํ”„ํŠธ์˜ ์ฝ”ํŒŒ์ผ๋Ÿฟ(Copilot)์˜ ํ™˜๊ฐ๋ฅ ์€ 40%, ๋น„๊ต ๋Œ€์ƒ ์ค‘ ๊ฐ€์žฅ ๋‚ฎ์€ ํผํ”Œ๋ ‰์‹œํ‹ฐ(Perplexity)์˜ ํ™˜๊ฐ๋ฅ ์€ 37%์˜€๋Š”๋ฐ ํฅ๋ฏธ๋กœ์šด ๊ฑด ํ”„๋กœ ๋ฒ„์ „์˜ ํ™˜๊ฐ๋ฅ ์ด ๋” ๋†’์•˜๋‹ค๋Š” ๊ฑฐ์˜€์ฃ .

 

์—ฐ๊ตฌ์ง„์€ ๊ฐ ๋ชจ๋ธ์— ๋‰ด์Šค์—์„œ ๋ฐœ์ทŒํ•œ ๋‚ด์šฉ์„ ์ฃผ๊ณ  ๊ฐ ๋ชจ๋ธ์—๊ฒŒ ์›๋ณธ ๊ธฐ์‚ฌ, URL ๋“ฑ์„ ์‹๋ณ„ํ•˜๊ฒŒ ์š”์ฒญํ•˜๊ณ , ์ „ํ†ต์ ์ธ ๊ตฌ๊ธ€ ๊ฒ€์ƒ‰ ๊ฒฐ๊ณผ์™€ ๋น„๊ตํ•˜๋ฉฐ ์‘๋‹ต์˜ ์ •ํ™•์„ฑ์„ ํŒ๋‹จํ–ˆ๋‹ค๊ณ  ํ•˜๋Š”๋ฐ์š”. ์ƒ๊ฐ๋ณด๋‹ค ํ™˜๊ฐ๋ฅ ์ด ๋†’๊ฒŒ ๋‚˜์™€์„œ ์ •๋ง ์ €๋ ‡๊ฒŒ ๋งŽ์ด ํ‹€๋ฆด๊นŒ ์‹ถ์„ ์ •๋„์ด๊ธด ํ•œ๋ฐ. ์ ์  ๋‚˜์•„์ง€๊ณ  ์žˆ๋‹ค๋Š” ํ•ด๋„ ์•„์ง ํ™˜๊ฐ์—์„œ ์ž์œ ๋กญ์ง€ ์•Š์€ ๊ฒŒ AI๋“ค์ด๋‹ˆ AI๊ฐ€ ๋‚ด๋†“๋Š” ๋‹ต์„ ๋งน์‹ ํ•˜๊ฑฐ๋‚˜ ๊ทธ๊ฒƒ๋งŒ ๋ฏฟ๊ณ  ์ค‘์š”ํ•œ ๊ฒฐ์ •์„ ์‰ฝ๊ฒŒ ๋‚ด๋ฆฌ๋Š” ์‹ค์ˆ˜๋Š” ํ•˜์ง€ ์•Š์œผ์…จ์œผ๋ฉด ์ข‹๊ฒ ์Šต๋‹ˆ๋‹ค. AI ์‹œ๋Œ€๋ผ๊ณ  ํ•ด๋„ ๋Œ๋‹ค๋ฆฌ๋Š” ๋‘๋“ค๊ฒจ ๋ณด๊ณ  ๊ฑด๋„ˆ์•ผ ํ•œ๋‹ค๋Š” ๊ฑฐ ์žŠ์ง€ ๋งˆ์„ธ์š”.@_@/


 

Ranked: AI Hallucination Rates by Model

Key Takeaways Many of today’s AI models struggled when asked to identify and cite news sources from an excerpt, producing frequent errors. The highest over…

www.voronoiapp.com

๋ฐ˜์‘ํ˜•

๊ด€๋ จ๊ธ€ ๋”๋ณด๊ธฐ

๋Œ“๊ธ€ ์˜์—ญ