Michael Gubanov

Postdoctoral Associate
Computer Science and Artifical Intelligence Laboratory (CSAIL)
Massachusetts Institute of Technology

The Stata Center
32 Vassar St., 32-G904B
Cambridge, MA 02139

I am currently interviewing. Let me know if you need the job application materials


I am a Postdoc at MIT CSAIL with Professor Michael Stonebraker. My current research in Big Data is on Large-scale Data Management, Data Integration and Fusion, Web-search, and Biomedical Informatics

Before MIT, I earned my PhD in Computer Science from the University of Washington working with IBM Almaden Research Center. During my PhD program I also spent some time at Google doing research on large-scale machine learning and Web-search that has been successfully deployed on production clusters at Google. I completed my undergraduate education at St. Petersburg National Research University ITMO (ACM-ICPC World Champions 2013).

Selected publications

  1. Web-scale Synonym Resolution
    Michael Gubanov, Michael Stonebraker MIT NEDB 2014, Cambridge, Massachusetts
  2. Large-scale Semantic Profile Extraction  [bib] [pdf]
    Michael Gubanov, Michael Stonebraker EDBT 2014, Athens, Greece
  3. Text and Structured Data Fusion in DataTamer at Scale  [bib]
    Michael Gubanov, Michael Stonebraker, Daniel Bruckner IEEE ICDE 2014, Chicago, Illinois
  4. Bootstraping Synonym Resolution at Web Scale
    Michael Gubanov, Michael Stonebraker DIMACS/CCICADA Workshop on Big Data Integration 2013, New Brunswick, New Jersey
  5. ReadFast: High-relevance Search-engine for EMR
    Michael Gubanov, Anna Pyayt. MIT Innovations in Health Care Conference 2013, Cambridge, Massachussetts
  6. ReadFast: High-relevance Search-engine for Big Text [bib] [pdf]
    Michael Gubanov, Anna Pyayt. ACM CIKM 2013, San Francisco, California
  7. BigDB: Automatic machine learning optimizer [pdf]
    Michael Gubanov, Anna Pyayt. arXiv:1301.1575 [cs.DB], 2013
  8. Using Unified Famous Objects (UFO) to Automate Alzheimer's Disease Diagnostics. [bib] [pdf]
    Michael Gubanov, Linda Shapiro. IEEE International Conference on Bioinformatics and Biomedicine (BIBM) 2010, Atlanta, Georgia
  9. IBM UFO Repository: Object-oriented Data Integration [bib] [pdf] [book]
    Michael Gubanov, Lucian Popa, Howard Ho, Hamid Pirahesh, Jeng-Yih Chang, Shr-Chang Chen. VLDB 2009, Lyon, France
  10. Metadata Management Engine for Data Integration with Reverse-Engineering Support [bib][pdf]
    Michael Gubanov, Phil Berstein, Alex Moshchuk. IEEE ICDE 2008, Cancun, Mexico

Selected talks

  1. Towards gaining control over Big medical data.
    University of California, Irvine, CA, 2012.
  2. Towards gaining control over information overflow.
    University of Central Florida, Orlando, FL, 2012.
  3. Towards gaining control over information overflow.
    University of Kentucky, Lexington, KY, 2012.
  4. Simplifying access to structured and unstructured data.
    Stanford University, Stanford, CA, 2011.
  5. Object-oriented management of structured and unstructured data.
    University of Washington, Seattle, WA, 2010.
  6. Simplifying information integration using Unified Famous Objects (UFO).
    University of Washington, Seattle, WA, 2010.
  7. IBM UFO Repository.
    IBM Almaden Research Center, San Jose, CA, 2007.
  8. Improving local search ranking with NLP.
    Mcrosoft, Seattle, WA, 2006.
  9. Feature engineering architecture for Web-search ranking.
    Mcrosoft, Seattle, WA, 2006.
  10. Improving ranking of product search results.
    Google, Mountain View, CA, 2005.


© Copyright 2014 Michael Gubanov. All rights reserved.