MALLET is a Java-based package for statistical natural language processing, document classification, clustering, topic modeling, information extraction, and other machine learning applications to text. MALLET includes sophisticated tools for document classification: efficient routines for converting text to ”features”, a wide variety of algorithms (including Naïve Bayes, Maximum Entropy, and Decision Trees), and code for evaluating classifier performance using several commonly used metrics. In addition to classification, MALLET includes tools for sequence tagging for applications such as named-entity extraction from text. Algorithms include Hidden Markov Models, Maximum Entropy Markov Models, and Conditional Random Fields. These methods are implemented in an extensible system for finite state transducers. ..

References in zbMATH (referenced in 22 articles )

Showing results 1 to 20 of 22.
Sorted by year (citations)

1 2 next

  1. Jacobs, Kayla; Itai, Alon; Wintner, Shuly: Acronyms: identification, expansion and disambiguation (2020)
  2. Aggarwal, Charu C.: Machine learning for text (2018)
  3. Dragoni, Mauro; Petrucci, Giulio: A fuzzy-based strategy for multi-domain sentiment analysis (2018)
  4. George, Clint P.; Doss, Hani: Principled selection of hyperparameters in the latent Dirichlet allocation model (2018)
  5. Gudivada, Venkat N.; Arbabifard, Kamyar: Open-source libraries, application frameworks, and workflow systems for NLP (2018)
  6. Du, Rundong; Kuang, Da; Drake, Barry; Park, Haesun: DC-NMF: nonnegative matrix factorization based on divide-and-conquer for fast clustering and topic modeling (2017)
  7. Mirylenka, Katsiaryna; Giannakopoulos, George; Do, Le Minh; Palpanas, Themis: On classifier behavior in the presence of mislabeling noise (2017)
  8. Lim, Kar Wai; Buntine, Wray: Bibliographic analysis on research publications using authors, categorical labels and the citation network (2016)
  9. Costa-jussà, Marta R.; Grivolla, Jens; Mellebeek, Bart; Benavent, Francesc; Codina, Joan; Banchs, Rafael E.: Using annotations on mechanical turk to perform supervised polarity classification of Spanish customer comments (2014) ioport
  10. Elloumi, Mourad; Zomaya, Albert Y.: Biological knowledge discovery handbook. Preprocessing, Mining and postprocessing of biological data (2014)
  11. Tellex, Stefanie; Thaker, Pratiksha; Joseph, Joshua; Roy, Nicholas: Learning perceptually grounded word meanings from unaligned parallel data (2014)
  12. Xu, Kaiquan; Liao, Stephen Shaoyi; Lau, Raymond Y. K.; Leon Zhao, J.: Effective active learning strategies for the use of large-margin classifiers in semantic annotation: an optimal parameter discovery perspective (2014)
  13. Das, Shubhomoy; Moore, Travis; Wong, Weng-Keen; Stumpf, Simone; Oberst, Ian; McIntosh, Kevin; Burnett, Margaret: End-user feature labeling: supervised and semi-supervised approaches based on locally-weighted logistic regression (2013)
  14. Zou, Jie; Le, Daniel; Thoma, George R.: Locating and parsing bibliographic references in HTML medical articles (2010) ioport
  15. Biemann, Chris: Unsupervised part-of-speech tagging in the large (2009) ioport
  16. Smith, M.; Giraud-Carrier, C.; Purser, N.: Implicit affinity networks and social capital (2009) ioport
  17. Cesario, Eugenio; Folino, Francesco; Locane, Antonio; Manco, Giuseppe; Ortale, Riccardo: Boosting text segmentation via progressive classification (2008) ioport
  18. Dietterich, Thomas G.; Hao, Guohua; Ashenfelter, Adam: Gradient tree boosting for training conditional random fields (2008)
  19. Rokach, Lior; Romano, Roni; Maimon, Oded: Negation recognition in medical narrative reports (2008) ioport
  20. Michelson, Matthew; Knoblock, Craig A.: Unsupervised information extraction from unstructured, ungrammatical data sources on the world wide web (2007) ioport

1 2 next