BLEU

BLEU (bilingual evaluation understudy) - BLEU: a Method for Automatic Evaluation of Machine Translation. Human evaluations of machine translation are extensive but expensive. Human evaluations can take months to finish and involve human labor that can not be reused. We propose a method of automatic ma- chine translation evaluation that is quick, inexpensive, and language-independent, that correlates highly with human evalu- ation, and that has little marginal cost per run. We present this method as an automated understudy to skilled human judges which substitutes for them when there is need for quick or frequent evaluations.


References in zbMATH (referenced in 71 articles )

Showing results 1 to 20 of 71.
Sorted by year (citations)

1 2 3 4 next

  1. Dognin, Pierre; Melnyk, Igor; Mroueh, Youssef; Padhi, Inkit; Rigotti, Mattia; Ross, Jarret; Schiff, Yair; Young, Richard A.; Belgodere, Brian: Image captioning as an assistive technology: Lessons learned from VizWiz 2020 challenge (2022)
  2. Fan, Angela; Bhosale, Shruti; Schwenk, Holger; Ma, Zhiyi; El-Kishky, Ahmed; Goyal, Siddharth; Baines, Mandeep; Celebi, Onur; Wenzek, Guillaume; Chaudhary, Vishrav; Goyal, Naman; Birch, Tom; Liptchinsky, Vitaliy; Edunov, Sergey; Auli, Michael; Joulin, Armand: Beyond English-centric multilingual machine translation (2021)
  3. Shuai Lu, Daya Guo, Shuo Ren, Junjie Huang, Alexey Svyatkovskiy, Ambrosio Blanco, Colin Clement, Dawn Drain, Daxin Jiang, Duyu Tang, Ge Li, Lidong Zhou, Linjun Shou, Long Zhou, Michele Tufano, Ming Gong, Ming Zhou, Nan Duan, Neel Sundaresan, Shao Kun Deng, Shengyu Fu, Shujie Liu: CodeXGLUE: A Machine Learning Benchmark Dataset for Code Understanding and Generation (2021) arXiv
  4. Tao Gui, Xiao Wang, Qi Zhang, Qin Liu, Yicheng Zou, Xin Zhou, Rui Zheng, Chong Zhang, Qinzhuo Wu, Jiacheng Ye, Zexiong Pang, Yongxin Zhang, Zhengyan Li, Ruotian Ma, Zichu Fei, Ruijian Cai, Jun Zhao, Xinwu Hu, Zhiheng Yan, Yiding Tan, Yuan Hu, Qiyuan Bian, Zhihua Liu, Bolin Zhu, Shan Qin, Xiaoyu Xing, Jinlan Fu, Yue Zhang, Minlong Peng, Xiaoqing Zheng, Yaqian Zhou, Zhongyu Wei, Xipeng Qiu, Xuanjing Huang: TextFlint: Unified Multilingual Robustness Evaluation Toolkit for Natural Language Processing (2021) arXiv
  5. Tripathy, Jatin Karthik; Sethuraman, Sibi Chakkaravarthy; Cruz, Meenalosini Vimal; Namburu, Anupama; P., Mangalraj; R., Nandha Kumar; S., Sudhakar Ilango; Vijayakumar, Vaidehi: Comprehensive analysis of embeddings and pre-training in NLP (2021)
  6. Yu Li, Josh Arnold, Feifan Yan, Weiyan Shi, Zhou Yu: LEGOEval: An Open-Source Toolkit for Dialogue System Evaluation via Crowdsourcing (2021) arXiv
  7. Zhang, Meishan; Li, Zhenghua; Fu, Guohong; Zhang, Min: Dependency-based syntax-aware word representations (2021)
  8. Amosov, O. S.; Amosova, S. G.; Zhiganov, S. V.; Ivanov, Yu. S.; Pashchenko, F. F.: Computational method for recognizing situations and objects in the frames of a continuous video stream using deep neural networks for access control systems (2020)
  9. Huang, Feicheng; Li, Zhixin; Wei, Haiyang; Zhang, Canlong; Ma, Huifang: Boost image captioning with knowledge reasoning (2020)
  10. Jialun Cao, Meiziniu Li, Yeting Li, Ming Wen, Shing-Chi Cheung: SemMT: A Semantic-based Testing Approach for Machine Translation Systems (2020) arXiv
  11. Kool, Wouter; van Hoof, Herke; Welling, Max: Ancestral Gumbel-top-(k) sampling for sampling without replacement (2020)
  12. Tom B Brown, Benjamin Mann, Nick Ryder, Melanie Subbiah, Jared Kaplan, Prafulla Dhariwal, Arvind Neelakantan, Pranav Shyam, Girish Sastry, Amanda Askell, et al.: Language Models are Few-Shot Learners (2020) arXiv
  13. Zhang, Jiajun; Zhou, Long; Zhao, Yang; Zong, Chengqing: Synchronous bidirectional inference for neural sequence generation (2020)
  14. Aakur, Sathyanarayanan N.; Dias Moreira de Souza, Fillipe; Sarkar, Sudeep: Generating open world descriptions of video using common sense knowledge in a pattern theory framework (2019)
  15. Nguyen, Van Duc; Son, Tran Cao; Pontelli, Enrico: Natural language generation for non-expert users (2019)
  16. Kostin, A. V.; Smirnov, Vitaliĭ V.: Functionality evaluation model for machine translation systems (2018)
  17. Liu, Sensen; Ching, Shinung: Recurrent information optimization with local, metaplastic synaptic dynamics (2017)
  18. Xiong, Deyi; Meng, Fandong; Liu, Qun: Topic-based term translation models for statistical machine translation (2016) ioport
  19. Costa-Jussà, Marta R.; Farrús, Mireia: Statistical machine translation enhancements through linguistic levels: a survey (2014)
  20. Patterson, Genevieve; Xu, Chen; Su, Hang; Hays, James: The SUN attribute database: beyond categories for deeper scene understanding (2014) ioport

1 2 3 4 next