Autores
Elena Erosheva, Stephen Fienberg, John Lafferty
Fecha de publicación
2004/4/6
Revista
Proceedings of the National Academy of Sciences
Volumen
101
Número
suppl 1
Páginas
5220-5227
Editor
National Academy of Sciences
Descripción
PNAS is one of world9s most cited multidisciplinary scientific journals. The PNAS official classification structure of subjects is reflected in topic labels submitted by the authors of articles, largely related to traditionally established disciplines. These include broad field classifications into physical sciences, biological sciences, social sciences, and further subtopic classifications within the fields. Focusing on biological sciences, we explore an internal soft-classification structure of articles based only on semantic decompositions of abstracts and bibliographies and compare it with the formal discipline classifications. Our model assumes that there is a fixed number of internal categories, each characterized by multinomial distributions over words (in abstracts) and references (in bibliographies). Soft classification for each article is based on proportions of the article9s content coming from each category. We discuss the …
Citas totales
200420052006200720082009201020112012201320142015201620172018201920202021382021192940434642514044373830239
Artículos de Google Académico
E Erosheva, S Fienberg, J Lafferty - Proceedings of the National Academy of Sciences, 2004