ADD ANI AS A TRUSTED SOURCE
googleads
Menu
General News

IIT Guwahati showcases method to detect, correct Wikipedia name errors at AI Impact Summit

Indian Institute of Technology Guwahati Researchers have developed a multilingual and scalable method to identify and correct Surface Name Errors (SNEs) in Wikipedia, thus helping improve information reliability for both human users and artificial intelligence systems.

ANI Feb 20, 2026 23:05 IST googleads

Indian Institute of Technology Guwahati (File Photo/ANI)

Guwahati (Assam) [India], February 20 (ANI): Indian Institute of Technology Guwahati Researchers have developed a multilingual and scalable method to identify and correct Surface Name Errors (SNEs) in Wikipedia, thus helping improve information reliability for both human users and artificial intelligence systems.
Wikipedia is a free, multilingual online encyclopaedia created and maintained by a global community of volunteers through open collaboration.
In a press statement, IIT Guwahati stated that a surface name refers to the text used in Wikipedia articles to mention or link to another entity.
"A Surface Name Error (SNE) occurs when this text is incorrect. For example, using a misspelt word like "Parise" to link to the page for Paris. A study conducted by the IIT Guwahati research team found that about 3% to 6% of all entity mentions in Wikipedia contain Surface Name Errors. While these errors may appear minor, they have significant implications," said the press statement.
For human users, an incorrect surface name can reduce the perceived credibility and reliability of the information provided.
Similarly, many machine learning and deep learning models use Wikipedia as a core dataset. Such errors in surface names can negatively impact AI tasks and model performance.
To address this challenge, Associate Professor of the Department of Computer Science and Engineering Amit Awekar, along with then M.Tech student Anuj Khare (batch of 2022), built a method that uses mathematical frequency patterns, making it adaptable across languages.
The first step included scanning Wikipedia and converting every link into a quadruplet containing information on - The page where the link appears, the page it points to, the surface name used in the link, and the surrounding textual context.
In the next step, the developed method reviewed the surface name and considered it correct only if it appeared at least 10 times, and it accounted for at least 5% of all links pointing to a specific page.
Surface names that did not meet these criteria were flagged as potential errors.
In the final step, it categorised the detected errors into "typing mistakes", such as "Gawahati" instead of "Guwahati", or "entity span errors", where extra or incorrect words are mistakenly included in the link.
The researchers tested the developed method on eight languages, including English, Sanskrit, German, Italian, Urdu, Hindi, Marathi, and Gujarati, and found accurate outcomes.
Speaking about the real-world application of the developed method, Awekar said, "This work shows us that we should not be trusting the data from the web blindly, both for human use and training AI models. Good data is the beginning of any good AI model and downstream application."
To validate the developed method, the research team compared snapshots of English Wikipedia from 2018 and 2022 and found that about 30% of the errors predicted by the method had been corrected on Wikipedia over four years, confirming its accuracy.
Wikipedia is maintained by volunteers worldwide, and the developed method can help editors identify hidden typos and linking errors that might otherwise remain unnoticed for years.
To further validate the accuracy of this method, it is notable that the Wikipedia community has accepted more than 99% of the manual corrections suggested by the researchers.
By combining scalable data processing with practical validation through the Wikipedia community, the IIT Guwahati team has demonstrated an effective approach to strengthening digital knowledge systems. (ANI)

Get the App

What to Read Next

Politics

Assam CM hands over appointment letters to 5,754 candidates

Assam CM hands over appointment letters to 5,754 candidates

Assam Chief Minister Dr Himanta Biswa Sarma on Thursday distributed appointment letters to 5,754 candidates under the Education and Power Department at a function held at the Veterinary College playground, Khanapara in Guwahati.

Read More
General News

Indian Railways deploys advanced AI and Machine Learning

Indian Railways deploys advanced AI and Machine Learning

Indian Railways is actively enhancing its safety and operational efficiency through the deployment of advanced Artificial Intelligence (AI) and Machine Learning (ML) devices and smart monitoring systems.

Read More
General News

Karnataka expands partnership with British Council

Karnataka expands partnership with British Council

The Rural Development and Panchayat Raj (RDPR) Department, Government of Karnataka, has expanded its partnership with the British Council to strengthen English language learning, library services, and knowledge access in rural areas through the state's Gram Panchayat Arivu Kendras (Knowledge Centres).

Read More
General News

Assam CM launches distribution of MMUA funds to beneficiaries

Assam CM launches distribution of MMUA funds to beneficiaries

Assam Chief Minister Himanta Biswa Sarma on Thursday launched the distribution of seed capital under Mukhyamantri Mahila Udyamita Abhiyaan (MMUA) to women beneficiaries by handing over cheques amounting to Rs. 10,000 to each of the 5,155 women beneficiaries of Jalukbari Legislative Assembly Constituency at a programme held at Swahid Smarak Kshetra in Boragaon in Guwahati.

Read More
Politics

PM Modi to inaugurate development projects worth Rs 47,600 cr

PM Modi to inaugurate development projects worth Rs 47,600 cr

Prime Minister Narendra Modi will visit Assam on March 13-14 March. During the visit, the Prime Minister will inaugurate, dedicate to the nation, lay foundation stones, perform Bhoomi Poojan and flag off multiple development projects worth more than Rs 47,600 crore across Kokrajhar, Guwahati and Silchar.

Read More
Home About Us Our Products Advertise Contact Us Terms & Condition Privacy Policy

Copyright © aninews.in | All Rights Reserved.