Item response theory in AI: Analysing machine learning classifiers at the instance level

Topics

Item Response Theory (opens in a new tab)Machine Learning (opens in a new tab)Classifier (opens in a new tab)AI Systems (opens in a new tab)Instance Level (opens in a new tab)Instance Hardness Measures (opens in a new tab)Classification Task (opens in a new tab)Supervised Learning (opens in a new tab)Artificial Intelligence (opens in a new tab)Latent Variables (opens in a new tab)

78 Citations

Item Response Theory for Evaluating Regression Algorithms

João V. C. MoraesJessica T. S. ReinaldoR. PrudêncioTelmo de Menezes e Silva Filho

Computer Science

2020 International Joint Conference on Neural…

2020

A new IRT model, particularly designed for dealing with nonnegative unbounded responses, which is adequate for modelling the absolute errors of regression algorithms is proposed.

Explanation-by-Example Based on Item Response Theory

Lucas F. F. CardosoJoseph Ribeiro Ronnie Alves

Computer Science, Psychology

BRACIS

2022

This research explores the Item Response Theory (IRT) as a tool to explaining the models and measuring the level of reliability of the Explanation-by-Example approach.

3
Highly Influenced

[PDF]

Training on the Test Set: Mapping the System-Problem Space in AI

J. Hernández-OralloWout SchellaertFernando Martínez-Plumed

Computer Science

AAAI

2022

This paper introduces the concept of an assessor model, \hat{R}(r|\pi,\mu), a conditional probability estimator trained on test data, and proposes accompanying every deployed AI system with its own assessor.

Item Response Theory Based Ensemble in Machine Learning

Ziheng ChenH. Ahn

Computer Science, Mathematics

International Journal of Automation and Computing

2020

A novel probabilistic framework to improve the accuracy of a weighted majority voting algorithm by introducing the item response theory (IRT) framework to evaluate the samples’ difficulty and classifiers’ ability simultaneously.

31 References

Making Sense of Item Response Theory in Machine Learning

Fernando Martínez-PlumedR. PrudêncioAdolfo Martínez UsóJ. Hernández-Orallo

Computer Science, Education

ECAI

2016

In this paper, a series of experiments with a range of datasets and classification methods are performed to fully understand how IRT works and what their parameters really mean in the context of machine learning.

Analysis of instance hardness in machine learning using item response theory

R. PrudêncioJ. Hernández-OralloA. Mart́ınez-Usó

Computer Science, Mathematics

2015

A case study in which instance hardness is measured by fitting the responses of Random Forests with different number of trees is developed, which reveals several insights about different levels of discrimination among instances, the adequate number of Trees in RF and anomalous situations that were related to noisy instances.

An Analysis of Machine Learning Intelligence

John P. LalorHao WuTsendsuren MunkhdalaiHong Yu

Computer Science

ArXiv

2017

This paper investigates how training size and the incorporation of noise affect a DNN's ability to generalize and learn, and finds that different DNN models exhibit different strengths in learning and are robust to noise in training data.

[PDF]

An instance level analysis of data complexity

Michael R. SmithT. MartinezC. Giraud-Carrier

Computer Science

Machine Learning

2013

This paper identifies instances that are hard to classify correctly (instance hardness) by classifying over 190,000 instances from 64 data sets with 9 learning algorithms and finds that class overlap is a principal contributor to instance hardness.

Learning Instance-Specific Predictive Models

S. VisweswaranG. Cooper

Computer Science, Mathematics

J. Mach. Learn. Res.

2010

The ISMB algorithm was evaluated on 21 UCI data sets using five different performance measures and its performance was compared to that of several commonly used predictive algorithms, including nave Bayes, C4.5 decision tree, logistic regression, neural networks, k-Nearest Neighbor, Lazy Bayesian Rules, and AdaBoost.

An experimental comparison of performance measures for classification

C. FerriJ. Hernández-OralloR. Modroiu

Computer Science

Pattern Recognit. Lett.

2009

Evaluation in artificial intelligence: from task-oriented to ability-oriented measurement

J. Hernández-Orallo

Computer Science

Artificial Intelligence Review

2016

This paper critically assess the different ways AI systems are evaluated, and the role of components and techniques in these systems, and identifies three kinds of evaluation: human discrimination, problem benchmarks and peer confrontation.

Feature subset selection using Thornton ’ s separabil ity index and its applicabil ity to a number of sparse proximity-based classifiers

J. Greene

Computer Science, Mathematics

2001

This work proposes the use of Thornton’s Separabilit y Index as a simple measure of subset merit which is fast and easy to calculate, but gives results which are identical to the asymptotic result of multiple testing with random data splits.

Towards UCI+: A mindful repository design

Núria MaciàEster Bernadó-Mansilla

Computer Science

Inf. Sci.

2014

A review of instance selection methods

J. A. Olvera-LópezJ. A. Carrasco-OchoaJosé Francisco Martínez TrinidadJ. Kittler

Computer Science

Artificial Intelligence Review

2010

This work is focused on presenting a survey of the main instance selection methods reported in the literature, and shows how the training set is reduced which allows reducing runtimes in the classification and/or training stages of classifiers.

...

Related Papers

Showing 1 through 3 of 0 Related Papers

Item response theory in AI: Analysing machine learning classifiers at the instance level | Semantic Scholar (2024)

Topics

78 Citations

31 References

Related Papers