PolyU research finds improving AI large language models helps better align with human brain activity

HONG KONG, May 27, 2024 /PRNewswire/ -- With generative artificial intelligence (GenAI) transforming the social interaction landscape in recent years, large language models (LLMs), which use deep-learning algorithms to train GenAI platforms to process language, have been put in the spotlight. A recent study by The Hong Kong Polytechnic University (PolyU) found that LLMs perform more like the human brain when being trained in more similar ways as humans process language, which has brought important insights to brain studies and the development of AI models.

Current large language models (LLMs) mostly rely on a single type of pretraining - contextual word prediction. This simple learning strategy has achieved surprising success when combined with massive training data and model parameters, as shown by popular LLMs such as ChatGPT. Recent studies also suggest that word prediction in LLMs can serve as a plausible model for how humans process language. However, humans do not simply predict the next word but also integrate high-level information in natural language comprehension.

A research team led by Prof. Li Ping, Dean of the Faculty of Humanities and Sin Wai Kin Foundation Professor in Humanities and Technology at PolyU, has investigated the next sentence prediction (NSP) task, which simulates one central process of discourse-level comprehension in the human brain to evaluate if a pair of sentences is coherent, into model pretraining and examined the correlation between the model's data and brain activation. The study has been recently published in the academic journal Sciences Advances.

The research team trained two models, one with NSP enhancement and the other without, both also learned word prediction. Functional magnetic resonance imaging (fMRI) data were collected from people reading connected sentences or disconnected sentences. The research team examined how closely the patterns from each model matched up with the brain patterns from the fMRI brain data.

It was clear that training with NSP provided benefits. The model with NSP matched human brain activity in multiple areas much better than the model trained only on word prediction. Its mechanism also nicely maps onto established neural models of human discourse comprehension. The results gave new insights into how our brains process full discourse such as conversations. For example, parts of the right side of the brain, not just the left, helped understand longer discourse. The model trained with NSP could also better predict how fast someone read - showing that simulating discourse comprehension through NSP helped AI understand humans better.

Recent LLMs, including ChatGPT, have relied on vastly increasing the training data and model size to achieve better performance. Prof. Li Ping said, "There are limitations in just relying on such scaling. Advances should also be aimed at making the models more efficient, relying on less rather than more data. Our findings suggest that diverse learning tasks such as NSP can improve LLMs to be more human-like and potentially closer to human intelligence."

He added, "More importantly, the findings show how neurocognitive researchers can leverage LLMs to study higher-level language mechanisms of our brain. They also promote interaction and collaboration between researchers in the fields of AI and neurocognition, which will lead to future studies on AI-informed brain studies as well as brain-inspired AI."

Media Contact
Ms Annie Wong
Senior Manager, Public Affairs
Tel: +852 3400 3853
Email: anniewy.wong@polyu.edu.hk

source: The Hong Kong Polytechnic University

【你點睇？】上海有女子赴瑞士接受安樂死，引網民熱議。你是否支持安樂死？► 立即投票

1	【大行炒Ｄ乜】野村升小米目標五成，大和上調多隻航運股評級
2	《盤前攻略》人大常委開會有憧憬，多重不明朗待揭盅港股料觀望
3	港交所今推出每周股票期權，涉十隻港股包括騰訊滙控
4	余偉文：港元利率短期內仍處於較高水平
5	《午市前瞻》料美國大選點票期間表現波動，港股期權市場有待成熟
6	市場靜待發展恒指半日反覆收升２１點，車股開動內房領跌
7	滙豐傳擬發行３０億人民幣熊貓債，時隔九年再度回歸境內債市融資
8	港股全日悶市成交低於千二億，恒指反覆收升６１點報２０５６７
9	【美國大選】投票日華爾街大行通宵應變，香港和新加坡團隊也助陣
10	《本港樓市》市建局研自行發展九龍城沙浦道重建項目，續量入為出

1	《缸邊隨筆－石鏡泉》阿爺大招的效果
2	《窩輪豪情－梁業豪》觀察會否出現往上殺熊證倉的市況
3	《投資智慧－鄧聲興》技術推動競爭優勢，理想汽車值得關注
4	《菲常論證－溫蕎菲》內地製造業重返擴張，小米發力創三年高
5	《金碩良言－黃俊碩》優化上市審批流程，助提升香港ＩＰＯ競爭力
6	《投資心得－潘鐵珊》港交所財務表現強勁，市場活躍度創新高
7	《出旗制勝－麥穎儀》理想跌幅收窄，小米延續強勢
8	《真知灼見－溫灼培》金價２７５０美元貴嗎？
9	《運籌帷幄－梁業豪》人大常委會的明智決定：看準形勢後才發招
10	《「香」講積金－香敏華》債市短期反覆，提供低吸機會

1	理財通 \| 證監會：首批試點計劃券商名單出爐，續優化擴大理財通
2	恒指公司與沙特交易所簽署合作意向協議書，探索產品開發等
3	內地救市見效樓市有起色，惟再有內房抽水可以點揀？
4	港股 \| 蕭猷華：重磅消息來襲，股市勢必波動
5	美國大選 \| 【FOCUS】「垃圾」牽動選票，美媒各有盤算
6	滙控 \| 季績勝預期兼續回購，獲大行唱好股價創17年高，可以點部署？
7	高息定存 \| 一周高息合集，多家銀行加定存息，華僑3個月最高4厘
8	港股 \| 午市前瞻 \| 人行買斷式逆回購刺激料有限內房板塊短線向好可吼
9	【FOCUS】國產機鬥內捲，小米鮎魚上身
10	專訪 \| 洪灝：情緒不等於信心，市場關注人大會議勿捉錯用神（有片）

1	高息定存 \| 恒生推3個月及4個月定存息高達3.5厘
2	A股復市 \| 【FOCUS】「增量政策」霧裏看花，平準基金箭在弦上
3	港股 \| 蕭猷華：小心恒指本周顯著調整
4	券商股 \| 大行唱好A股港股，券商股炒作開戶潮再爆升，港交所更值博
5	恒指 \| A股爆升港股急拋貨，恒指開市五分鐘跌逾千點
6	恒指 \| 恒指瀉2172點創最大點數跌幅，報20926，成交6204億
7	高息定存 \| 一周高息合集，多家銀行加定存息，3個月重返4厘！
8	高息定存 \| 一周高息合集，銀行再加存息，6個月最高3.4厘
9	恒指 \| 大摩上調恒指明年中目標至21550點
10	恒指 \| 花旗:恒指明年底上望28000點，增加兩隻首選股
11	施政報告2024 \| 2024年施政報告重點文字直播（稍後送上懶人包）
12	理財通 \| 證監會：首批試點計劃券商名單出爐，續優化擴大理財通
13	港股 \| 蕭猷華：財政部發布會後，恒指本周走勢如何？
14	傾力救市 \| 洪灝：財政部發布會料聚焦刺激消費，為赤字率提指引
15	把握股市大浪未贏錢先享獎賞開立東亞戶口賺高達HK$3,800獎賞
16	高息定存 \| 南商3個月加至3.6厘，滙豐向個別客戶推特別優惠
17	提振內房｜一文看懂，中國房地產政策組合拳
18	傾力救市 \| 財政部增量政策圍繞四大方面，增債務限額支持地方化債，發特別國債支持銀行補充資本
19	施政報告2024 \| 施政報告2024懶人包
20	TAOBAO \| 市傳淘寶租中港城4萬呎舖，料開設大型體驗家具館
21	光伏股 \| 協鑫科技曾飆三成大選前美國擬撤銷中國光伏反補貼稅
22	港股 \| 恒指午後升逾500點，人行預告下周LPR將減20至25基點
23	秋電展和國際電子組件及生產技術展　匯聚環球3,200家展商
24	【FOCUS】2萬億 VS 10萬億？增量政策考驗漂亮去槓桿
25	新股上市 \| 證監會優化新上市申請審批流程時間表，市值達百億A股有望獲快速審批來港上市
26	NVIDIA \| 英偉達股價創歷史新高，美銀分析師料會繼續上升
27	A股行情 \| 滬綜指險守三千二，國慶復市四天累挫3.6%，成交拾級而下
28	恒指公司與沙特交易所簽署合作意向協議書，探索產品開發等
29	施政報告 \| 李家超︰施政報告沿用綠色封面，代表和諧活力和繁榮
30	【FOCUS】發布會一浪接一浪，風險底線愈見清晰

PolyU research finds improving AI large language models helps better align with human brain activity

大國博弈

美國總統大選何時才可「塵埃落定」？

貨幣攻略

高息定存 | 創興加3個月存息至3.6厘，渣打6個月3.48...

傾力救市

聽話！買ETF

說說心理話