Topics
Item Response Theory (opens in a new tab)Machine Learning (opens in a new tab)Classifier (opens in a new tab)AI Systems (opens in a new tab)Instance Level (opens in a new tab)Instance Hardness Measures (opens in a new tab)Classification Task (opens in a new tab)Supervised Learning (opens in a new tab)Artificial Intelligence (opens in a new tab)Latent Variables (opens in a new tab)
78 Citations
- João V. C. MoraesJessica T. S. ReinaldoR. PrudêncioTelmo de Menezes e Silva Filho
- 2020
Computer Science
2020 International Joint Conference on Neural…
A new IRT model, particularly designed for dealing with nonnegative unbounded responses, which is adequate for modelling the absolute errors of regression algorithms is proposed.
- 3
- PDF
- Lucas F. F. CardosoJoseph Ribeiro Ronnie Alves
- 2022
Computer Science, Psychology
BRACIS
This research explores the Item Response Theory (IRT) as a tool to explaining the models and measuring the level of reliability of the Explanation-by-Example approach.
- 3
- Highly Influenced[PDF]
- J. Hernández-OralloWout SchellaertFernando Martínez-Plumed
- 2022
Computer Science
AAAI
This paper introduces the concept of an assessor model, \hat{R}(r|\pi,\mu), a conditional probability estimator trained on test data, and proposes accompanying every deployed AI system with its own assessor.
- 7
- PDF
- Ziheng ChenH. Ahn
- 2020
Computer Science, Mathematics
International Journal of Automation and Computing
A novel probabilistic framework to improve the accuracy of a weighted majority voting algorithm by introducing the item response theory (IRT) framework to evaluate the samples’ difficulty and classifiers’ ability simultaneously.
- 33 [PDF]
- Onder Coban
- 2022
Computer Science
Concurr. Comput. Pract. Exp.
Comparisons with the most popular filter‐based FS methods show that it is possible to obtain better results with this new modified selector or one of its variants on the majority of both binary and real‐world datasets compared to its well‐known peers.
- 4
- Raül Fabra-BoludaC. FerriFernando Martínez-PlumedM. J. Ramírez-Quintana
- 2022
Computer Science
EBeM@IJCAI
The robustness of different families of machine learning models are evaluated, which are selected and characterised according to their behaviour and a novel taxonomy is defined based on the robusts of the different models and the difficulty of the instances addressed.
- PDF
- Adrienne S. KlineT. KlineZahra Shakeri Hossein AbadJoon Lee
- 2020
Medicine, Computer Science
Journal of medical Internet research
This demonstration shows that using the item response theory (IRT) to stratify the data based on how difficult each case is to classify, independent of the outcome measure of interest (eg, accuracy), and highlights how well classifiers differentiate cases of varying difficulty.
- 10
- PDF
- Adrienne S. KlineT. KlineZahra Shakeri Hossein AbadJoon Lee
- 2020
Medicine, Computer Science
This study aims to demonstrate how the item response theory can be used to stratify the data based on how difficult each case is to classify, independent of the outcome measure of interest (eg, accuracy), and highlights how well classifiers differentiate cases of varying difficulty.
- PDF
- Lucas F. F. CardosoVitor SantosR. S. K. FrancêsR. PrudêncioRonnie Alves
- 2020
Computer Science
BRACIS
A new evaluation methodology based on IRT as well as the tool decodIRT, developed to guide IRT estimation over ML benchmarks are presented.
- Lucas F. F. CardosoVitor SantosR. S. K. FrancêsR. PrudêncioRonnie Alves
- 2021
Computer Science
ArXiv
This work proposes a new assessment methodology based on the combination of Item Response Theory (IRT) and Glicko-2, a rating system mechanism generally adopted to assess the strength of players, which identified the Random Forest as the algorithm with the best innate ability.
...
...
31 References
- Fernando Martínez-PlumedR. PrudêncioAdolfo Martínez UsóJ. Hernández-Orallo
- 2016
Computer Science, Education
ECAI
In this paper, a series of experiments with a range of datasets and classification methods are performed to fully understand how IRT works and what their parameters really mean in the context of machine learning.
- 68
- PDF
- R. PrudêncioJ. Hernández-OralloA. Mart́ınez-Usó
- 2015
Computer Science, Mathematics
A case study in which instance hardness is measured by fitting the responses of Random Forests with different number of trees is developed, which reveals several insights about different levels of discrimination among instances, the adequate number of Trees in RF and anomalous situations that were related to noisy instances.
- 17
- PDF
- John P. LalorHao WuTsendsuren MunkhdalaiHong Yu
- 2017
Computer Science
ArXiv
This paper investigates how training size and the incorporation of noise affect a DNN's ability to generalize and learn, and finds that different DNN models exhibit different strengths in learning and are robust to noise in training data.
- Michael R. SmithT. MartinezC. Giraud-Carrier
- 2013
Computer Science
Machine Learning
This paper identifies instances that are hard to classify correctly (instance hardness) by classifying over 190,000 instances from 64 data sets with 9 learning algorithms and finds that class overlap is a principal contributor to instance hardness.
- 335
- PDF
- S. VisweswaranG. Cooper
- 2010
Computer Science, Mathematics
J. Mach. Learn. Res.
The ISMB algorithm was evaluated on 21 UCI data sets using five different performance measures and its performance was compared to that of several commonly used predictive algorithms, including nave Bayes, C4.5 decision tree, logistic regression, neural networks, k-Nearest Neighbor, Lazy Bayesian Rules, and AdaBoost.
- 20
- PDF
- C. FerriJ. Hernández-OralloR. Modroiu
- 2009
Computer Science
Pattern Recognit. Lett.
- 733
- PDF
- J. Hernández-Orallo
- 2016
Computer Science
Artificial Intelligence Review
This paper critically assess the different ways AI systems are evaluated, and the role of components and techniques in these systems, and identifies three kinds of evaluation: human discrimination, problem benchmarks and peer confrontation.
- 131
- PDF
- J. Greene
- 2001
Computer Science, Mathematics
This work proposes the use of Thornton’s Separabilit y Index as a simple measure of subset merit which is fast and easy to calculate, but gives results which are identical to the asymptotic result of multiple testing with random data splits.
- 40
- PDF
- Núria MaciàEster Bernadó-Mansilla
- 2014
Computer Science
Inf. Sci.
- 56
- J. A. Olvera-LópezJ. A. Carrasco-OchoaJosé Francisco Martínez TrinidadJ. Kittler
- 2010
Computer Science
Artificial Intelligence Review
This work is focused on presenting a survey of the main instance selection methods reported in the literature, and shows how the training set is reduced which allows reducing runtimes in the classification and/or training stages of classifiers.
- 360
- PDF
...
...
Related Papers
Showing 1 through 3 of 0 Related Papers