Presentación | Participantes | Bibliografía (DML-E) | Bibliografía adicional | Enlaces de interés | Otros proyectos DML | Ayuda  
INICIO | 27 de julio de 2024
  

Notes on the bias of dissimilarity indices for incomplete data sets: the case of archaelogical classification.

Título inglés Notes on the bias of dissimilarity indices for incomplete data sets: the case of archaelogical classification.
Título español Notas sobre el sesgo de índices de disimilaridad para conjuntos de datos incompletos: el caso de clasificación arqueológica.
Autor/es Montanari, Angela ; Mignani, Stefania
Organización Dip. Stat. Univ. Studi Bologna, Bolonia, Italia
Revista 0210-8054
Publicación 1994, 18 (1): 39-49, 15 Ref.
Tipo de documento articulo
Idioma Inglés
Resumen inglés The problem of missing data is particularly present in archaeological research where, because of the fragmentariness of the finds, only a part of the characteristics of the whole object can be observed. The performance of various dissimilarity indices differently weighting missing values is studied on archaeological data via a simulation. An alternative solution consisting in randomly substituting missing values with character sets is also examined. Gower's dissimilarity coefficient seems to be the least biased one either with 25% missing values and 49%; it has not however a constant behaviour as to the sign of the bias. The simulation experiment has also shown that when average linkage cluster analysis is performed on an incomplete data set either using Gower's index or randomly substituting missing values gives satisfactory results while the modified indices fail to detect the cluster structure.
Clasificación UNESCO 120909
Palabras clave español Análisis cluster ; Análisis multivariante ; Análisis de datos
Icono pdf Acceso al artículo completo
Equipo DML-E
Instituto de Ciencias Matemáticas (ICMAT - CSIC)
rmm()icmat.es