MALLET is a Java-based package for statistical natural language processing, document classification, clustering, topic modeling, information extraction, and other machine learning applications to text. MALLET includes sophisticated tools for document classification: efficient routines for converting text to ”features”, a wide variety of algorithms (including Naïve Bayes, Maximum Entropy, and Decision Trees), and code for evaluating classifier performance using several commonly used metrics. In addition to classification, MALLET includes tools for sequence tagging for applications such as named-entity extraction from text. Algorithms include Hidden Markov Models, Maximum Entropy Markov Models, and Conditional Random Fields. These methods are implemented in an extensible system for finite state transducers. ..

References in zbMATH (referenced in 23 articles )

Showing results 1 to 20 of 23.
Sorted by year (citations)

1 2 next

  1. Jacobs, Kayla; Itai, Alon; Wintner, Shuly: Acronyms: identification, expansion and disambiguation (2020)
  2. Chen, Zhe; Doss, Hani: Inference for the number of topics in the latent Dirichlet allocation model via Bayesian mixture modeling (2019)
  3. Aggarwal, Charu C.: Machine learning for text (2018)
  4. George, Clint P.; Doss, Hani: Principled selection of hyperparameters in the latent Dirichlet allocation model (2018)
  5. Gudivada, Venkat N.; Arbabifard, Kamyar: Open-source libraries, application frameworks, and workflow systems for NLP (2018)
  6. Magnusson, Måns; Jonsson, Leif; Villani, Mattias; Broman, David: Sparse partially collapsed MCMC for parallel inference in topic models (2018)
  7. Du, Rundong; Kuang, Da; Drake, Barry; Park, Haesun: DC-NMF: nonnegative matrix factorization based on divide-and-conquer for fast clustering and topic modeling (2017)
  8. Mirylenka, Katsiaryna; Giannakopoulos, George; Do, Le Minh; Palpanas, Themis: On classifier behavior in the presence of mislabeling noise (2017)
  9. Lim, Kar Wai; Buntine, Wray: Bibliographic analysis on research publications using authors, categorical labels and the citation network (2016)
  10. Costa-jussà, Marta R.; Grivolla, Jens; Mellebeek, Bart; Benavent, Francesc; Codina, Joan; Banchs, Rafael E.: Using annotations on mechanical turk to perform supervised polarity classification of Spanish customer comments (2014) ioport
  11. Elloumi, Mourad; Zomaya, Albert Y.: Biological knowledge discovery handbook. Preprocessing, Mining and postprocessing of biological data (2014)
  12. Tellex, Stefanie; Thaker, Pratiksha; Joseph, Joshua; Roy, Nicholas: Learning perceptually grounded word meanings from unaligned parallel data (2014)
  13. Xu, Kaiquan; Liao, Stephen Shaoyi; Lau, Raymond Y. K.; Leon Zhao, J.: Effective active learning strategies for the use of large-margin classifiers in semantic annotation: an optimal parameter discovery perspective (2014)
  14. Das, Shubhomoy; Moore, Travis; Wong, Weng-Keen; Stumpf, Simone; Oberst, Ian; McIntosh, Kevin; Burnett, Margaret: End-user feature labeling: supervised and semi-supervised approaches based on locally-weighted logistic regression (2013)
  15. Zou, Jie; Le, Daniel; Thoma, George R.: Locating and parsing bibliographic references in HTML medical articles (2010) ioport
  16. Biemann, Chris: Unsupervised part-of-speech tagging in the large (2009) ioport
  17. Smith, M.; Giraud-Carrier, C.; Purser, N.: Implicit affinity networks and social capital (2009) ioport
  18. Cesario, Eugenio; Folino, Francesco; Locane, Antonio; Manco, Giuseppe; Ortale, Riccardo: Boosting text segmentation via progressive classification (2008) ioport
  19. Dietterich, Thomas G.; Hao, Guohua; Ashenfelter, Adam: Gradient tree boosting for training conditional random fields (2008)
  20. Rokach, Lior; Romano, Roni; Maimon, Oded: Negation recognition in medical narrative reports (2008) ioport

1 2 next