Publications

Discriminative Random Walks with Restart (DRaWR)

Blatti C and Sinha S (Project Lead) (March 2016). Characterizing Gene Sets using Discriminative Random Walks with Restart on Heterogeneous Biological Networks. Bioinformatics 2016: Link

False Discovery Rate Control for Simultaneously Significant Features

Zhao, S.D. (2015). False discovery rate control for identifying simultaneous signals. Link

Data Spread

Bendre M, Sun B, Zhang D, Zhou, Chang K.C, Parameswaran A. (August 2015).
DATASPREAD: Unifying Databases and Spreadsheets. Proceedings of the
VLDB Endowment, Volume 8, No. 12 (2000 – 2003). Link

Zhuang H, Parameswaran A, Roth D, Han J. (August 2015). Debiasing Crowdsourced Batches. Proceedings of the 21th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, (1593-1602). Link

Bhardwaj A, Deshpande A, Elmore A.J, Karger D, Madden S, Parameswaran A, Subramanyam, H, Wu E, Zhang R. (August 2015). Collaborative Data Analytics with DataHub. Proceedings of the VLDB Endowment, Volume 8, No. 12 (1916 – 1919). Link

Bhattacherjee S, Chavan A, Huang S, Deshpande A, Parameswaran A. (August 2015). Principles of Dataset Versioning: Exploring the Recreation/Storage Tradeoff. Proceedings of the VLDB Endowment, Volume 8, No. 12 (1346 – 1357). Link

Chavan A, Huang S, Deshpande A, Elmore A, Madden S, Parameswaran A. (July 2015). Towards a Unified Query Language for Provenance and Versioning. 7th International Workshop on Theory and Practice of Provenance (TaPP), Edinburgh, Scotland. Link

Vartak M, Rahman S, Madden S, Parameswaran A, Polyzotis N. S ee DB: Efficient data-driven visualization recommendations to support visual analytics. Proceedings of the VLDB Endowment. 2015. Link

Kim A, Blais E, Parameswaran A, Indyk P, Madden S, Rubinfeld R. (January 2015). Rapid Sampling for Visualizations with Ordering Guarantees. Proceedings of the VLDB Endowment, Volume 8, No. 5 (521 – 532). Link

Bio-Text Mining

Zhang C, Jiang S, Chen Y, Sun Y, Han J. (September 2015). Fast inbound top-K query for random walk with restart. Proceedings of 2015 European Conference on Machine Learning and Principles and Practices of Knowledge Discovery in Databases (ECMLPKDD’15), Porto, Portugal (608-624). Link

Zhi S, Han J, Gu Q. (September 2015). Robust classification of information networks by consistent graph learning. In: Machine learning and knowledge discovery in databases. Proceedings of 2015 European Conference on Machine Learning and Principles and Practices of Knowledge Discovery in Databases (ECMLPKDD’15), Porto, Portugal (752-767). Link

Ren X, El-Kishky A, Wang C, Han J. (August 2015). Automatic entity recognition and typing from massive text corpora: A phrase and network mining approach. Proceedings of the 21th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (2319-2320). Link

Ren X, El-Kishky A, Wang C, Tao F, Voss CR, Han J. (August 2015). ClusType: Effective entity recognition and typing by relation phrase-based clustering. Proceedings of the 21th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (995-1004). Link

Kim Y, Han J, Yuan C. (August 2015). TOPTRAC: Topical trajectory pattern mining. Proceedings of the 21th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, (587-596). Link

Zhi S, Zhao B, Tong W, et al. (August 2015). Modeling truth existence in truth discovery. Proceedings of the 21th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (1543-1552). Link

Wang C, Song Y, El-Kishky A, Roth D, Zhang M, Han J. Incorporating world knowledge to document clustering via heterogeneous information networks. Proceedings of the 21th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, (1215-1224). Link

Wang C, Liu X, Song Y, Han J. (August 2015). Towards interactive construction of topical hierarchy: A recursive tensor decomposition approach. Proc. of 2015 ACM SIGKDD Int. Conf. on Knowledge Discovery and Data Mining (KDD’15), Sydney, Australia (1225-1234). Link

Zhang C, Zheng Y, Ma X, Han J. (August 2015). Assembler: Efficient discovery of spatial co-evolving patterns in massive geo-sensory data. Proceedings of the 21th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (1415-1424). Link

Li Y, Li Q, Gao J, Su L, Zhao B, Fan W, Han J. (August 2015). On the discovery of evolving truth. Proceedings of the 21th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (675-684). Link

Liu J, Shang J, Wang C, Ren X, Han J. (May 2015). Mining quality phrases from massive text corpora. Proceedings of the 2015 ACM SIGMOD International Conference on Management of Data, Melbourne, Australia (1729-1744). Link

Wan M, Ouyang Y, Kaplan L, Han J. (April 2015). Graph regularized meta-path based transductive regression in heterogeneous information network. Proceedings of 2015 SIAM International Conference on Data Mining (SDM’15), Vancouver, Canada. Link

Kuck J, Zhuang H, Yan X, Cam H, Han J. (March 2015). Query-based outlier detection in heterogeneous information networks. Proceeding of the 18th International Conference on Extending Database Technology (EDBT), Brussels, Belgium (325-336). Link

Tao F, Zhao B, Fuxman A, Li Y, Han J. (2015) Leveraging pattern semantics for extracting entities in enterprises. Proceedings of the international conference on World Wide Web, Florence, Italy (1078-1088). Link

Saul Language Programming 

Kordjamshidi P, Massa W, Provoost T, Moens M. (2015). Machine Reading for Extraction of Bacteria and Habitat Taxonomies. Lecture Notes in Computer Science (LNCS), Communications in Computer and Information Science (CCIS) series. Link

Kordjamshidi P, Roth D, Wu H. (2015). Saul: Towards declarative learning based programming. Proceedings of the Twenty-Fourth International Joint Conference on Artificial Intelligence (1844-1851). Link

Song Y, Peng H, Kordjamshidi P, Sammons M, Roth D. (2015). Improving a Pipeline Architecture for Shallow Discourse Parsing. Proceedings of the 19th Conference on Computational Natural Language Learning (78-83). Link

Kordjamshidi P, Roth D, Moens M. (2015). Structured learning for spatial information extraction from biomedical text: bacteria biotopes. BMC Bioinformatics 2015. Link

Transcriptional Regulators of Drug Response

Hanson C, Cairns J, Wang L, Sinha S. (2015). Computational discovery of transcription factors associated with drug response. Pharmacogenomics J. 2015 Oct 27. Link

Wang S, Cho H, Zhai C, Berger B, Peng J. Exploiting ontology graph for predicting sparsely annotated gene function. Bioinformatics. Link

KnowEnG

Sinha S, Song J, Weinshilboum R, Jongeneel V, Han J. KnowEnG: A knowledge engine for genomics. J Am Med Inform Assoc. 2015;22(6):1115-1119. Link

Stephens ZD, Lee SY, Faghri F, et al. Big data: Astronomical or genomical? PLoS Biol. 2015;13(7):e1002195. Link