UHG
Search
Close this search box.

Karya Partners with Microsoft Research for Extensive Evaluation of Indian LLMs

The project involved over 90,000 human evaluations of 30 models in 10 Indian languages, completed in under three weeks.

Share

Karya has partnered with Microsoft Research to conduct evaluation of large language models (LLMs). The project involved over 90,000 human evaluations of 30 models in 10 Indian languages, completed in under three weeks. This effort is one of the largest multilingual human evaluations of LLMs to date.

The evaluation addressed challenges such as linguistic diversity, benchmark contamination, and the incorporation of local cultural nuances. Karya engaged a broad community of workers representing the average Indian population to ensure a comprehensive assessment.

The evaluation results, including detailed leaderboards on model performance, are available in the published paper here. Karya’s services encompass various benchmarks, including linguistic acceptability, cultural sensitivity, and reasoning.

A presentation of Karya’s LLM Evaluation Services can be viewed here. The project is noted for its complexity, with Karya’s team successfully delivering accurate results.

Multi-Turn Evaluation: Karya workers assessed models using various multi-turn conversation benchmarks, including tasks such as recollection, expansion, refinement, and follow-ups. The evaluation measured model performance in perception, reasoning, and creativity.

Model Comparison: The evaluation also involved comparing multiple models against a common benchmark to determine their relative performance. This analysis aimed to identify the best-performing model for specific tasks.

The execution of this project was led by Jeevitha and included Bipin, Victor, Sakshi, Deepsikha, Swarup, Rizwan, Praveen, Hari, Iqbal, Rupal, Neha, Anand, Monali, Aishwarya, and Anushree. Karya aims to expand such high-complexity digital tasks, potentially increasing earning opportunities for workers and advancing pathways out of poverty.

Founded by Manu Chopra and Vivek Seshadri in 2021, Karya provides a variety of AI services, including generating high quality training data, multimodal data labelling and performing culturally sensitive language model evaluations. By offering industry-leading wages and investing in worker welfare, Karya is pioneering an equitable model in the AI data industry.

📣 Want to advertise in AIM? Book here

Picture of Siddharth Jindal

Siddharth Jindal

Siddharth is a media graduate who loves to explore tech through journalism and putting forward ideas worth pondering about in the era of artificial intelligence.
Related Posts
Association of Data Scientists
Tailored Generative AI Training for Your Team
Upcoming Large format Conference
Sep 25-27, 2024 | 📍 Bangalore, India
Download the easiest way to
stay informed

Subscribe to The Belamy: Our Weekly Newsletter

Biggest AI stories, delivered to your inbox every week.

Flagship Events

Rising 2024 | DE&I in Tech Summit
April 4 and 5, 2024 | 📍 Hilton Convention Center, Manyata Tech Park, Bangalore
Data Engineering Summit 2024
May 30 and 31, 2024 | 📍 Bangalore, India
MachineCon USA 2024
26 July 2024 | 583 Park Avenue, New York
MachineCon GCC Summit 2024
June 28 2024 | 📍Bangalore, India
Cypher USA 2024
Nov 21-22 2024 | 📍Santa Clara Convention Center, California, USA
Cypher India 2024
September 25-27, 2024 | 📍Bangalore, India
discord icon
AI Forum for India
Our Discord Community for AI Ecosystem, In collaboration with NVIDIA.