|
|
Michael Gubanov
| Assistant Professor |
News: Three more papers accepted by ACM CIKM 2024, EDBT 2025, and ICDE 2025!
Awarded a grant from FL Department of Health Casey DeSantis Florida Cancer Innovation Fund as a PI for constructing a hybrid Polystore/LLM simplifying access to Cancer Data Lakes!
Awarded an NSF grant as a PI for constructing Web-scale Knowledge Graphs and LLMs for Data Science!
Received the AWS AI Amazon Research Award (ARA)! Thanks to Amazon for supporting BigLab! research!
One more paper by BigLab! accepted to The Web Conference (WWW) 2023, held in Austin, TX this year!
Two more papers accepted, total 3 papers presented at EDBT by BigLab! this year!!
Our COVIDKG.ORG paper was accepted by EDBT 2023!
Our COVID-19 Web-scale vizualization paper was accepted to ACM CIKM 2022!
Our tabular profiling paper was accepted to ACM SIGMOD 2022!
Watch my talk at MIT on our Hybrid Linear Relational Engine
|
| Florida State University |
Computer Science Department
|
| 1017 Academic Way |
| Tallahassee, FL 32304 |
gubanov at cs.fsu.edu
|
|
Publications in DBLP (external publication tracking system)
|
|
|
|
|
|
|
For more information please consult Google Scholar and DBLP
2025-2023
- "Scalable Tabular Hierarchical Metadata Classification in Heterogeneous Structured
Large-scale Datasets using Contrastive Learning"
Bhim Kandibedala, Gyanendra Shrestha, Anna Pyayt, Todor Ivanov, Michael Gubanov, in ICDE, 2025 [pdf]
- "Tabular Embeddings for Tables with Bi-Dimensional Hierarchical Metadata and Nesting"
Gyanendra Shrestha, Chutian Jiang, Sai Akula, Vivek Yannam, Anna Pyayt, Michael Gubanov, in EDBT, 2025 [pdf]
- "CancerKG.ORG - a Web-scale, Interactive, Verifiable Knowledge Graph-LLM Hybrid for Assisting with Optimal Cancer Treatment and Care"
Michael Gubanov, Anna Pyayt, Aleksandra Karolack, in ACM CIKM, 2024 [pdf]
- "Learning Topical Structured Interfaces from Medical Research Literature"
Maitry Chauhan, Anna Pyayt, Michael Gubanov, in The Web Conference (WWW), 2023 [pdf]
- "COVIDKG.ORG - a Web-scale COVID-19 Interactive, Trustworthy Knowledge Graph, Constructed and Interrogated for Bias using Deep-Learning"
Bhim Kandibedala, Anna Pyayt, Nick Piraino, Chris Caballero, Michael Gubanov, in EDBT, 2023 [pdf]
- "Learning Circular Tabular Embeddings for Heterogeneous Large-scale Structured Datasets"
Michael Gubanov, Anna Pyayt, Sophie Pavia, in EDBT, DOLAP 2023 [pdf]
- "Scalable Metadata Classification in Heterogeneous Large-scale Datasets"
Bhim Kandibedala, Anna Pyayt, Michael Gubanov, in EDBT, DOLAP 2023 [pdf]
|
2022
- "Visualizing and Querying Large-scale Structured Datasets by Learning Multi-layered 3D Meta-Profiles"
Michael Gubanov, Anna Pyayt, Sophie Pavia, in IEEE BigData, 2022 [pdf]
- "Leveraging Scalable Profiling to Learn and Visualize the Latest Trustworthy COVID-19 Medical Research Findings"
Michael Gubanov, Sophie Pavia, Anna Pyayt, William Goble, in ACM CIKM, 2022[pdf]
- "Simplifying Access to Large-scale Structured Datasets by Meta-Profiling with Scalable Training Set Enrichment"
Sophie Pavia, Rituparna Khan, Anna Pyayt, Michael Gubanov, in ACM SIGMOD, 2022[pdf]
- "Hybrid Metadata Classification in Large-scale Structured Datasets"
Sophie Pavia, Nick Piraino, Kazi Islam, Anna Pyayt, Michael Gubanov, invited paper in the journal of Data Intelligence, Rinton Press, Special Issue on "Best of DEXA", 2022 [pdf]
2021
- "Scalable Tabular Metadata Location and Classification in Large-scale Structured Datasets"
Kazi Islam, Michael Gubanov, in DEXA, Springer Nature, 2021, online[pdf]
- "Towards Unveiling Dark Web Structured Data"
Sophie Pavia, Rituparna Khan, Anna Pyayt, Michael Gubanov, in IEEE BigData, 2021[pdf]
- "Learning Tabular Embeddings at Web Scale"
Sophie Pavia, Rituparna Khan, Anna Pyayt, Michael Gubanov, in IEEE BigData, 2021[pdf]
2020
- WebLens: Towards Interactive Large-scale Structured Data Profiling
Rituparna Khan, Michael Gubanov, in ACM CIKM 2020, online [pdf]
- Scalable Linear Algebra on a Relational Database System
Shangyu Luo, Zekai Gao, Michael Gubanov, Christopher Jermaine, Luis Perez, in the Communications of the ACM (CACM), 08/2020, Research Highlight[html]
- WebLens: Towards Interactive Web-scale Data Integration, Training the Models
Rituparna Khan, Michael Gubanov, in IEEE BigData 2020, online
[pdf]
- Towards Tabular Embeddings, Training the Relational Models
Rituparna Khan, Michael Gubanov, in IEEE BigData 2020, online
[pdf]
- Rapid Antibiotic Susceptibility Analysis Using Microscopy and Machine Learning
Anna Pyayt, Rituparna Khan, Robert Brzozowski, Prahathees Eswara, Michael Gubanov, in IEEE BigData 2020, online
[pdf]
2019
- Hybrid.Poly: A Consolidated Interactive Analytical Polystore System
Maksim Podkorytov, Michael Gubanov, in ICDE 2019, Macao, China SAR [pdf]
- Scalable Linear Algebra on a Relational Database System
[pdf]
Shangyu Luo, Zekai Gao, Michael Gubanov, Christopher Jermaine, Luis Perez, Extended Journal Version, to appear in IEEE Transactions on Knowledge and Data Engineering (TKDE), Special Issue on "Best of ICDE"
2018
- Nested Dolls: Towards Unsupervised Web Tables Clustering [pdf]
Rituparna Khan, Michael Gubanov, in IEEE Bigdata 2018, Seattle, WA
- Hybrid.Poly: Performance Evaluation of Linear Algebra Analytical Extensions [pdf]
Maksim Podkorytov, Michael Gubanov, in IEEE Bigdata 2018, Seattle, WA
- Hybrid.AI: A Learning Search Engine for Large-scale Structured Data
[pdf]
Sean Soderman, Anusha Kola, Maxim Podkorytov, Michael Geyer, Michael Gubanov , in WWW, Search, 2018, Lyon, France
- Scalable Linear Algebra on a Relational Database System
[pdf]
Shangyu Luo, Zekai Gao, Michael Gubanov, Christopher Jermaine, Luis Perez, ACM SIGMOD Record, March 2018, special issue for the "2017 ACM SIGMOD Research Highlights"
2017
- CognitiveDB: An Intelligent Navigator for Large-scale Dark Structured Data
[pdf]
Michael Gubanov, Manju Priya, Maxim Podkorytov, in WWW 2017, Perth, Australia
- PolyFuse: A Large-scale Hybrid Data Integration System
[pdf]
Michael Gubanov, in IEEE ICDE DESWEb 2017, San Diego, CA
- Scalable Linear Algebra on a Relational Database System
[pdf]
Shangyu Luo, Zekai Gao, Michael Gubanov, Christopher Jermaine, Luis Perez, IEEE ICDE 2017, San Diego, CA Best Paper Award
- Hybrid: A Large-scale In Memory Image Analytics System
[pdf]
Michael Gubanov, in CIDR 2017, Chaminade, CA
- Hybrid.poly: An Interactive Large-scale In-memory Analytical Polystore
[pdf]
Maxim Podkorytov, Dylan Soderman, Michael Gubanov, in ICDM DSBDA 2017, New Orleans, LA
- mHealth Dipstick Analyzer For Monitoring of Pregnancy Complications
Karthik raj Konnaiyan, Surya Cheemalapati, Michael Gubanov and Anna Pyayt, in IEEE Sensors 2017
- Hybrid.JSON: High-velocity Parallel In-Memory Polystore JSON Ingest
[pdf]
Steven Ortiz, Caner Enbatan, Maksim Podkorytov, Dylan Soderman, Michael Gubanov, in IEEE Bigdata 2017, Boston, MA
- Scalable Spam Classifier for Web Tables
[pdf]
Santiago Villasenor, Tom Nguyen, Anusha Kola, Sean Soderman, Michael Gubanov, in IEEE Bigdata 2017, Boston, MA
- Hybrid.media: High Velocity Video Ingestion in an In-Memory Scalable Analytical Polystore
[pdf]
Mark Simmons, Daniel Armstrong, Dylan Soderman, Michael Gubanov, in IEEE Bigdata 2017, Boston, MA
- Generating UFOs from the Classified Object Tables
[pdf]
Anusha Kola, Harshal More, Sean Soderman, Michael Gubanov, in IEEE Bigdata 2017, Boston, MA
2016-2013
- Type-aware Web search
[pdf]
Michael Gubanov, Anna Pyayt , in EDBT 2016, Bordeaux, France
- mHealth Dipstick Analyzer for Monitoring of Pregnancy Complications
[pdf]
Karthik Konnaiyan, Surya Cheemalapati, Michael Gubanov, Anna Pyayt in IEEE Sensors 2016, Orlando, FL, Best Paper Award
- Real Time Fear Detection Using Wearable Single Channel Electroencephalogram
[pdf]
Surya Cheemalapati, Prashanth Chetlur Adithya, Michael Del Valle, Michael Gubanov, Anna Pyayt, in International Journal of Sensor Networks and Data Communications, 2016
- DataXFormer: Leveraging the Web for Semantic Transformations
[bib] [pdf]
Zia Abedjan, John Morcos, Michael Gubanov, Ihab Ilyas, Michael Stonebraker, Paolo Papotti, Mourad Ouzanni, in CIDR 2015, Asilomar, California
- Large-scale Semantic Profile Extraction
[bib] [pdf]
Michael Gubanov, Michael Stonebraker EDBT 2014, Athens, Greece
- Text and Structured Data Fusion in DataTamer at Scale
[bib] [pdf]
Michael Gubanov, Michael Stonebraker, Daniel Bruckner, IEEE ICDE 2014, Chicago, Illinois
2013
- ReadFast: High-relevance Search-engine for Big Text [bib] [pdf]
Michael Gubanov, Anna Pyat. ACM CIKM 2013, San Francisco, California
|
|
ReadFast: Structural Information Retrieval from Biomedical Big Text by Natural Language Processing [bib][pdf]
Michael Gubanov, Linda Shapiro, Anna Payt.
Invited book chapter in "Information Reuse And Integration In Academia And Industry", Springer 2013
|
- ReadFast: Optimizing Structural Search Relevance for Big Medical Text [bib]
Michael Gubanov, Anna Pyayt. in IEEE Information Reuse and Integration (IRI) 2013, San Francisco, California
- BigDB: Automatic machine learning optimizer [pdf]
Michael Gubanov, Anna Pyayt. arXiv:1301.1575 [cs.DB], 2013
- A real-time classification algorithm for emotion detection using portable EEG
Surya Cheemalapati, Michael Gubanov, Michael Del Valle, Anna Pyayt. IEEE Information Reuse and Integration (IRI), 2013, San Francisco, CA
2006-2012
- MedReadFast: Structural Information Retrieval Engine for Big Clinical Text [bib] [pdf]
Michael Gubanov, Anna Pyayt. IEEE Information Reuse and Integration (IRI) 2012, Las Vegas, Nevada
- ReadFast: Browsing large documents through Unified Famous Objects (UFO). [bib] [pdf]
Michael Gubanov, Linda Shapiro, Anna Payt. IEEE Information Reuse and Integration (IRI) 2011, Las Vegas, Nevada;
- Learning Unified Famous Objects (UFO) to Bootstrap Information Integration. [bib] [pdf] [book]
Michael Gubanov, Linda Shapiro, Anna Payt. IEEE Information Reuse and Integration (IRI) 2011, Las Vegas, Nevada;
- Using Unified Famous Objects (UFO) to Automate Alzheimer's Disease Diagnostics. [bib] [pdf]
Michael Gubanov, Linda Shapiro. IEEE International Conference on Bioinformatics and Biomedicine (BIBM) 2010, Atlanta, Georgia
- IBM UFO Repository. [bib] [pdf]
Michael Gubanov, Lucian Popa, C. T. Howard Ho, Hamid Pirahesh, Jeng-Yih Chang, Shr-Chang Chen. VLDB 2009, Lyon, France
- Simplifying Information Integration: Object-Based Flow-of-Mappings Framework for Integration. [bib] [pdf]
Bogdan Alexe, Michael Gubanov, Mauricio A. Hernndez, C. T. Howard Ho, Jen-Wei Huang, Yannis Katsis, Lucian Popa, Barna Saha, Ioana Stanoi. BIRTE 2008
- Model Management Engine for Data Integration with Reverse-Engineering Support. [bib] [pdf]
Michael Gubanov, Philip A. Bernstein, Alexander Moshchuk. ICDE 2008
- Structural text search and comparison using automatically extracted schema. [bib] [pdf]
Michael Gubanov, Philip A. Bernstein. WebDB 2006
|
|
|