Yifan Zhang

Senior Software Engineer

Office location

Research Complex B1-1130

Yifan Zhang

Senior Software Engineer

Educational Qualifications

M.S. in Science (with Distinction)

B.Sc. in Engineering

Entity

Qatar Computing Research Institute

Division

Qatar Center for Artificial Intelligence

Biography

Yifan Zhang has 20+ years of experience in research and development in speech and language-related products. Before joining QCRI, he built an information discovery engine for news discovery for Wavii (Google), developed a speech recognition engine serving millions of customers for Nuance and Autonomy. In QCRI, as part of the QCAI team, he works on Fanar, QCRI’s Generative AI project on text, speech, and image.
 

M.S. in Science (with Distinction)

Cardiff University, UK

2004

B.Sc. in Engineering

Xi’an Jiaotong University, China

2001

  • Generative AI - text, speech, image
  • Speech recognition
  • Text mining
  • Software engineering

Senior Software Engineer

Qatar Center for Artificial Intelligence, Qatar Computing Research Institute, HBKU

2013 - Present

Senior Software Engineer

Wavii (Google), US

2012 - 2013

Senior Research Engineer

SpinVox (Nuance), UK

2008 - 2012

R&D Engineer

Autonomy, UK

2005 - 2008

Lead Developer

Tianchuang Software, China

2001 - 2002

Da San Martino, G., Shaar, S., Zhang, Y., Yu, S., Barrón-Cedeño, A., & Nakov, P. (2020). PRTA: A system to support the analysis of propaganda techniques in the news. In Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics: System Demonstrations (pp. 287–293). Online.

Zhang, Y., Da San Martino, G., Barrón-Cedeño, A., Romeo, S., An, J., Kwak, H., Staykovski, T., Jaradat, I., Karadzhov, G., Baly, R., Darwish, K., & Glass, P. N. (2019). Tanbih: Get to know what you are reading. In Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing (pp. 223–228). Hong Kong, China.

Ali, A., Bell, P., Gales, M., Kotti, M., Kuo, H.-K. J., Liu, X., Messaoudi, A., Nadejde, M., Renals, S., Taha, Y., & Trmal, J. (2016). The MGB-2 Challenge: Arabic multi-dialect broadcast media recognition. In Proceedings of the IEEE Spoken Language Technology Workshop (SLT 2016). California, USA.

Ali, A., Hamza, W., Lee, J., & Vogel, S. (2015). QATS – The QCRI advanced transcription and translation system. In Proceedings of INTERSPEECH 2015. Dresden, Germany.

Ali, A., Bell, P., & Renals, S. (2014). A complete Kaldi recipe for building Arabic speech recognition systems. In Proceedings of the IEEE Spoken Language Technology Workshop (SLT 2014). Nevada, USA.

  • Best Demo (Honorable Mention), ACL 2020  
  • BBC NewsHack ‘Best Audience Award’ Winner, UK, 2018  
  • BBC NewsHack ‘Best in Show’ Winner, UK, 2014,  
  • Lion Laboratories Prize for Best Project, UK, 2001