Linq's AI Retrieval Model Achieves the Top Spot on the HuggingFace MTEB Leaderboard

BOSTON, June 5, 2024 /PRNewswire/ -- Linq, a generative AI startup, announced that its large embedding model "Linq-Embed-Mistral" ranked first in the text retrieval evaluation on HuggingFace's "Massive Text Embedding Benchmark (MTEB)" leaderboard, outpacing competitors like NVIDIA, Salesforce, Google, OpenAI, and Cohere. This evaluation is run by HuggingFace, the world's largest machine learning platform.

Linq's embedding model achieved a score of 60.2 points in the text retrieval category, securing the top position. This placed Linq ahead of NVIDIA, which scored 59.4 points, and Voyage AI, which scored 58.3 points. Google's model followed with a score of 55.7, while OpenAI and Cohere scored 55.4 and 55.0 points, respectively.

The MTEB leaderboard by HuggingFace ranks the performance of embedding models across seven categories, including classification, clustering, pair classification, reranking, retrieval, semantic textual similarity (STS), and summarization. Linq's embedding model demonstrated excellent performance not only in the text retrieval category but also in other categories, earning an overall rank of third.

The MTEB lists more than 300 embedding models, highlighting the competitive yet manageable landscape of embedding model technology. Linq's top performance in this specific benchmark underscores its superiority in embedding model technology.

Embedding models are critical in generative AI, particularly for addressing the hallucination problem of large language models (LLMs) by employing retrieval-augmented generation (RAG) technology. RAG allows models to produce reliable outputs by accessing the latest data or internal documents not available within the LLM.

Leading this project, Dr. Junseong Kim stated, "Our research demonstrates that due to the broad topic diversity and challenging difficulty of retrieval data, GPT-generated data is not perfect and requires thorough verification and refinement. Through these processes, we can achieve quality comparable to human-labeled data, ultimately attaining the best retrieval performance based on the MTEB benchmark dataset. This study shows that through elaborate data crafting and filtering using GPT, we can create models optimized for retrieval-augmented generation (RAG) and maximize performance in specific fields." Additionally, he emphasized, "Not only is refined data crucial, but optimized training methodologies and rapid experimental cycles are also key to maximizing retrieval performance."

Linq's Co-founder & CEO, Jacob Choi, emphasized, "Accurate search is crucial for generative AI enterprises' adoption. We're proud to have developed the core embedding model to achieve this, and we'll keep expanding and refining it to ensure precise text searches in specialized fields like finance and legal." Choi noted that while 2023 saw the rise of B2C use cases for generative AI with the advent of ChatGPT, 2024 will witness the growth of B2B (business-to-business) applications with improved accuracy and security technologies.

Massive Text Embedding Benchmark (MTEB) BEIR Retrieval Score in HuggingFace. as of May 30, 2024.

[Company Description]

Founded in 2022, Linq (Wecover Platforms Inc) was established by MIT Electrical and Computer Engineering graduate Jacob Choi and MIT Computational Science and Engineering Ph.D. Subeen Pang. In 2021, Choi was named in Forbes' "30 Under 30" in the science category for his AI neuromorphic computing research. Linq received early investments from KakaoVentures, Smilegate Investment, and Yellowdog in 2022. In 2023, Linq won the Samsung Open Collaboration hosted by Samsung Financial Networks and was selected for MassChallenge Fintech cohort, the largest non-equity accelerator in the U.S., continuing its collaboration with KPMG US.

Contact: Jacob Choi (jacob.choi@getlinq.com)

source: Linq (Wecover Platforms Inc)

【與拍賣官看藝術】東南亞藝術市場是下一個熱點？一探各地獨特及吸引之處！► 即睇

1	【大行炒Ｄ乜】花旗看滙控目標近９０元，大摩勁升小鵬目標四成半
2	《盤後部署》恒指跌轉升挑戰二萬一，民營內房之光龍湖上望２０元
3	恒指全日升４１４點收報２０９５３，十萬億猜想吸引資金下注
4	中國人壽（０２６２８）－股權變動紀錄
5	《盤前攻略》道指飆千五點惟中概股走弱，美滙強勁恒指料偏軟
6	恒指半日升２３７點收復兩主要平均線，政策預期重燃內需內房拉漲
7	【美國大選】光證料黃金升勢持續，貿易戰對科技股影響好壞參半
8	【美國大選】習近平向特朗普致賀電：加強對話溝通，妥善管控分歧
9	福布斯中國內地首富榜：鍾睒睒連續４年首富，馬化騰重上第二位
10	【傾力救市】人行與滙豐花旗等外資座談，闡釋貨幣政策立場及調整

1	《娥姐錦囊－張賽娥》中國電動車優勢難撼動，歐加徵關稅兩敗俱傷
2	《窩輪豪情－梁業豪》小心呈現交替式殺倉
3	《連場取勝－連敬涵》受益於免簽政策，攜程發展前景值得期待
4	《缸邊隨筆－石鏡泉》走寶啦！
5	《運籌帷幄－梁業豪》暫宜多採取觀望態度
6	《股林淘金－林家亨》與魔同行，世界紛亂
7	《菲常論證－溫蕎菲》特朗普再次入主白宮，市場聚焦人大會議
8	《法證攻防－林恩》聚焦美國總統大選，人大常委開會
9	《美元走勢－羅明立》聯儲本周料減息２５點，之後不排除加大力度
10	《牛熊志選－陳其志》資金看好騰訊，留意相關認購證輪

1	美國大選2024 \| 2024美國大選即時結果，特朗普宣布勝利
2	港股 \| 蕭猷華：重磅消息來襲，股市勢必波動
3	高息定存 \| 一周高息合集，多家銀行加定存息，華僑3個月最高4厘
4	高息定存 \| 創興加3個月存息至3.6厘，渣打6個月3.48厘
5	美國大選2024 \|【FOCUS】侵侵勝券在握，防美元反高潮
6	恒指 \| 恒指午後升逾300點，人大常委開會期間中資金融股造好
7	無人機 \| 美團：冀借助港府推動低空經濟，盡快拓香港無人機配送服務
8	美國大選 \| 法國外貿銀行：若60%關稅屬實，損內地GDP增長率1百分點
9	港股 \| 美國大選 \| 押注選情可炒咩股？答案可能令港股投資者失望
10	小米新車新手機熱銷，獲大行升目標股價破頂，可以點部署？

1	高息定存 \| 一周高息合集，銀行再加存息，6個月最高3.4厘
2	美國大選2024 \| 2024美國大選即時結果，特朗普宣布勝利
3	理財通 \| 證監會：首批試點計劃券商名單出爐，續優化擴大理財通
4	施政報告2024 \| 2024年施政報告重點文字直播（稍後送上懶人包）
5	港股 \| 蕭猷華：財政部發布會後，恒指本周走勢如何？
6	恒指公司與沙特交易所簽署合作意向協議書，探索產品開發等
7	內地救市見效樓市有起色，惟再有內房抽水可以點揀？
8	傾力救市 \| 洪灝：財政部發布會料聚焦刺激消費，為赤字率提指引
9	把握股市大浪未贏錢先享獎賞開立東亞戶口賺高達HK$3,800獎賞
10	港股 \| 蕭猷華：重磅消息來襲，股市勢必波動
11	高息定存 \| 南商3個月加至3.6厘，滙豐向個別客戶推特別優惠
12	TAOBAO \| 市傳淘寶租中港城4萬呎舖，料開設大型體驗家具館
13	提振內房｜一文看懂，中國房地產政策組合拳
14	美國大選 \| 【FOCUS】「垃圾」牽動選票，美媒各有盤算
15	傾力救市 \| 財政部增量政策圍繞四大方面，增債務限額支持地方化債，發特別國債支持銀行補充資本
16	施政報告2024 \| 施政報告2024懶人包
17	高息定存 \| 一周高息合集，多家銀行加定存息，華僑3個月最高4厘
18	光伏股 \| 協鑫科技曾飆三成大選前美國擬撤銷中國光伏反補貼稅
19	港股 \| 恒指午後升逾500點，人行預告下周LPR將減20至25基點
20	新股上市 \| 證監會優化新上市申請審批流程時間表，市值達百億A股有望獲快速審批來港上市
21	秋電展和國際電子組件及生產技術展　匯聚環球3,200家展商
22	NVIDIA \| 英偉達股價創歷史新高，美銀分析師料會繼續上升
23	施政報告 \| 李家超︰施政報告沿用綠色封面，代表和諧活力和繁榮
24	港股 \| 午市前瞻 \| 人行買斷式逆回購刺激料有限內房板塊短線向好可吼
25	A股行情 \| 滬綜指險守三千二，國慶復市四天累挫3.6%，成交拾級而下
26	【FOCUS】發布會一浪接一浪，風險底線愈見清晰
27	高息定存 \| 創興加3個月存息至3.6厘，渣打6個月3.48厘
28	內銀股 \| 六大行減存款利率，人行年底前再降準，內銀造好可以點部署？
29	銀色債券 \| 銀債最多獲分24手，申請23手或以下獲全數配發
30	電池之戰 \| 【FOCUS】寧王搶佔增混商機，固態電池更牽暗戰

傾力救市

11月8日的三件大事

大國博弈

瘋癲「狂人」再主白宮，前因後果

貨幣攻略

高息定存 | 特朗普勝選美元走強，富邦一個月美元定存5.98...

說說心理話