Set your preference
Font Scaling
Default
Page Scaling
Default
Color Adjustment
Arnab Bhattacharya

Arnab Bhattacharya

PhD (University of California, Santa Barbara)

Professor, Department of Computer Science and Engineering

Research Interest

Databases, Data Mining, Information Retrieval, Natural Language Processing and Artificial Intelligence

Office

RM 409,
Department of Computer Science and Engineering
IIT Kanpur,
Kanpur 208016

Education

PhD, Computer Science, Department of Computer Science, University of California, Santa Barbara, CA 93106, USA. 2007.

M.S., Computer Science, Department of Computer Science, University of California, Santa Barbara, CA 93106, USA. 2007.

Bachelor of Computer Science and Engineering (B.C.S.E.), Jadavpur University, Kolkata - 700032, India. 2001.

Previous Work Experience

Professor, of Computer Science and Engineering, Indian Institute of Technology (IIT), Kanpur, India. December 2020 – present.
Associate Professor, of Computer Science and Engineering, Indian Institute of Technology (IIT), Kanpur, India. June 2014 – December 2020.
Assistant Professor, of Computer Science and Engineering, Indian Institute of Technology (IIT), Kanpur, India. December 2007 – June 2014.
Project Scientist, of Computer Science, University of California, Santa Barbara, USA. Septem- ber 2007 – November 2007.
Software Design Engineer, Texas Instruments (India) Ltd., Bangalore, India. July 2001 – July 2002.

Selected Publications

Books:“Fundamentals of Database Indexing and Searching”. Arnab Bhattacharya. CRC Press, 2014.
Publications:Indian Legal Documents Corpus (ILDC) for Court Judgment Prediction and Explanation. Vijit Malik, Rishabh Sanjay, Shubham Kumar Nigam, Kripabandhu Ghosh, Shouvik Kumar Guha, Arnab Bhat- tacharya, Ashutosh Modi. Joint Conference of Association for Computational Linguistics and Inter- national Joint Conference Natural Language Processing (ACL-IJCNLP), 2021, to appear, Bangkok,
GraphReach: Position-Aware Graph Neural Network using Reachability Estimations. Sunil Nishad, Shubhangi Agarwal, Sayan Ranu, Arnab Bhattacharya. International Joint Conference on Artificial Intelligence (IJCAI), 2021, to appear, Montreal,
TIPS: Mining Top-K Locations to Minimize User-Inconvenience for Trajectory-Aware Services. Shub- hadip Mitra, Priya Saraf, Arnab IEEE Transactions on Knowledge and Data Engineer- ing (TKDE), 2021, 33(3), pages 1238-1250.
How and Why is an Answer (Still) Correct? Maintaining Provenance in Dynamic Knowledge Graphs. Garima Gaur, Arnab Bhattacharya, Srikanta International Conference on Information and Knowledge Management (CIKM), 2020, pages 405-414, Virtual Event, Ireland.
ChiSeL: Graph Similarity Search using Chi-Squared Statistics in Large Probabilistic Graphs. Shub- hangi Agarwal, Sourav Dutta, Arnab Proceedings of the VLDB Endowment (PVLDB), 2020, 13(10), pages 1654-1668.
Framework for Question-Answering in Sanskrit through Automated Construction of Knowledge Hrishikesh Terdalkar, Arnab Bhattacharya. 6th International Sanskrit Computational Linguistics Symposium (ISCLS), 2019, pages 98-117, Kharagpur, India.
RAQ: Relationship-Aware Graph Querying in Large Networks. Jithin Vachery, Akhil Arora, Sayan Ranu, Arnab Bhattacharya. International World Wide Web Conference (WWW), 2019, pages 1886- 1896, San Francisco,
HD-Index: Pushing the Scalability-Accuracy Boundary for Approximate kNN Search in High-Dimensional Akhil Arora, Sakshi Sinha, Piyush Kumar, Arnab Bhattacharya. Proceedings of the VLDB Endowment (PVLDB), 2018, 11(8), pages 906-919.
Finding Largest Rectangle inside a Digital Object and Rectangularization. Apurba Sarkar, Arindam Biswas, Mousumi Dutt, Arnab Bhattacharya. Journal of Computer and System Sciences, 2018, 95, pages 204-217.
Image Management for Biological Arnab Bhattacharya, Vebjorn Ljosa. Book chapter in Ency- clopedia of Database Systems (2nd Edition) edited by L. Liu and M. T. O¨zsu. Springer, 2018.
MineAr: Using Crowd Knowledge for Mining Association Rules in the Health Domain. Milan Someswar, Arnab Bhattacharya. ACM Joint International Conference on Data Science & Manage- ment of Data (CoDS-COMAD), 2018, pages 108-117, Goa,
Finding Shell Company Accounts using Anomaly Devendra K. Luna, Girish K. Palshikar, Manoj Apte, Arnab Bhattacharya. ACM Joint International Conference on Data Science & Manage- ment of Data (CoDS-COMAD), 2018, pages 167-174, Goa, India.
Tracking the Impact of Fact Deletions on Knowledge Graph Queries using Provenance Garima Gaur, Srikanta J. Bedathur, Arnab Bhattacharya. International Conference on Information and Knowledge Management (CIKM), 2017, pages 2079-2082, Singapore.
SkyGraph: Retrieving Regions of Interest using Skyline Subgraph Queries. Shiladitya Pande, Sayan Ranu, Arnab Bhattacharya. Proceedings of the VLDB Endowment (PVLDB), 2017, 10(11), pages 1382-1393.
NetClus: A Scalable Framework for Locating Top-K Sites for Placement of Trajectory-Aware Ser- Shubhadip Mitra, Priya Saraf, Richa Sharma, Arnab Bhattacharya, Sayan Ranu, Harsh Bhan- dari. International Conference on Data Engineering (ICDE), 2017, pages 87-90, San Diego, USA.
K-Dominant Skyline Join Queries: Extending the Join Paradigm to K-Dominant Anuradha Awasthi, Arnab Bhattacharya, Sanchit Gupta, Ujjwal K. Singh. International Conference on Data Engineering (ICDE), 2017, pages 99-102, San Diego, USA.
Neighbor-Aware Search for Approximate Labeled Graph Matching using the Chi-Square Statistics. Sourav Dutta, Pratik Nayek, Arnab Bhattacharya. International World Wide Web Conference (WWW), 2017, pages 1281-1290, Perth,
Automatic Grading and Feedback using Program Repair for Introductory Programming Courses. Sagar Parihar, Ziyaan Dadachanji, Praveen Kumar Singh, Rajdeep Das, Amey Karkare, Arnab Bhat- ACM Conference on Innovation and Technology in Computer Science Education (ITiCSE), 2017, pages 92-97, Bologna, Italy.
GARUDA: A System for Large-Scale Mining of Statistically Significant Connected Subgraphs. Satya- jit Bhadange, Akhil Arora, Arnab Bhattacharya. Proceedings of the VLDB Endowment (PVLDB), 2016, 9(13), pages 1449-1452. 
SMS: Stable Matching Algorithm using Skylines. Rohit Anurag, Arnab Bhattacharya. International Conference on Scientific and Statistical Database Management (SSDBM), 2016, pages 24:1-24:4, Budapest,
SkyCover: Finding Range-Constrained Approximate Skylines with Bounded Quality Guarantees. Shubhendu Aggarwal, Shubhadip Mitra, Arnab Bhattacharya. International Conference on Man- agement of Data (COMAD), 2016, pages 1-12, Pune,
Finding Largest Rectangle inside a Digital Object. Apurba Sarkar, Arindam Biswas, Mousumi Dutt, Arnab Computational Topology in Image Context (CTIC), 2016, pages 157-169, Mar- seille, France.
Probabilistic Aggregate Skyline Join Queries: Skylines with Aggregate Operations over Existential Uncertain Relations. Arnab Bhattacharya, Shrikant Awate. International Conference on Scientific and Statistical Database Management (SSDBM), 2015, pages 5:1-5:12, San Diego,
Trajectory Aware Macro-cell Planning for Mobile Users. Shubhadip Mitra, Sayan Ranu, Vinay Kolar, Arnab Bhattacharya, Ravi Kokku, Aditya Telang, Sriram IEEE International Conference on Computer Communications (INFOCOM), 2015, 792-800, Hong Kong, China.
Generation of Random Triangular Digital Curves using Combinatorial Techniques. Apurba Sarkar, Arindam Biswas, Mousumi Dutt, Arnab International Conference on Pattern Recogni- tion and Machine Intelligence (PReMI), 2015, pages 136-145, Warsaw, Poland.
Using Social Connections to Improve Collaborative Kanish Manuja, Arnab Bhattacharya.IKDD Conference on Data Sciences (CoDS), 2015, pages 140-141, Bengaluru, India.
Generation of Random Digital Curves using Combinatorial Techniques. Apurba Sarkar, Arindam Biswas, Mousumi Dutt, Arnab Conference on Algorithms and Discrete Applied Math- ematics (CALDAM), 2015, pages 286-297, Kanpur, India.
Mining Statistically Significant Connected Subgraphs in Vertex Labeled Graphs. Akhil Arora, Mayank Sachan, Arnab Bhattacharya. SIGMOD International Conference on Management of Data (SIG- MOD), 2014, pages 1003-1014, Snowbird,
Efficient and Effective Route Planning in Road Networks with Probabilistic Data using Skyline Paths. Arzoo Katiyar, Arnab Bhattacharya, Shubhadip Mitra. IKDD Conference on Data Sciences (CoDS), 2014, New Delhi,
Emotion Recognition from Audio and Visual Data using F-score based Fusion. Abhishek Gera, Arnab IKDD Conference on Data Sciences (CoDS), 2014, New Delhi, India.
RCached-tree: An Index Structure for Efficiently Answering Popular Queries. Manash Pal, Arnab Bhattacharya, Debjyoti International Conference on Information and Knowledge Management (CIKM), 2013, pages 1173-1176, San Francisco, USA.
Efficient Edit Distance based String Similarity Search using Deletion Neighborhoods. Shashwat Mishra, Tejas Gandhi, Akhil Arora, Arnab EDBT/ICDT Workshops, 2013, pages 375- 383, Genoa, Italy.
Hybrid HBase: Leveraging Flash SSDs to Improve Cost per Throughput of Anurag Awasthi, Avani Nandini, Arnab Bhattacharya, Priya Sehgal. International Conference on Management of Data (COMAD), 2012, pages 68-79, Pune, India.
A Plant Identification System using Shape and Morphological Features on Segmented Leaflets: Team IITK, CLEF Akhil Arora, Ankit Gupta, Nitesh Bagmar, Shashwat Mishra, Arnab Bhattacharya. CLEF (Online Notes/Labs/Workshop), 2012, Rome, Italy.
Mining Statistically Significant Substrings using the Chi-Square Statistic. Mayank Sachan, Arnab Proceedings of the VLDB Endowment (PVLDB), 2012, 5(10), pages 1052-1063.
Mining Statistically Significant Substrings Based on the Chi-Square Measure. Sourav Dutta, Arnab Book chapter in Pattern Discovery Using Sequence Data Mining: Applications and Studies edited by P. Kumar, P. R. Krishna and S. B. Raju. IGI Global, 2012.
Minimally Infrequent Itemset Mining using Pattern-Growth Paradigm and Residual Trees. Ashish Gupta, Akshay Mittal, Arnab International Conference on Management of Data (CO- MAD), 2011, pages 57-68, Bengaluru, India. (Best paper)
Caching Stars in the Sky: A Semantic Caching Approach to Accelerate Skyline Arnab Bhat- tacharya, B. Palvali Teja, Sourav Dutta. International Conference on Database and Expert Systems Applications (DEXA), 2011, pages 493-501, Toulouse, France.
A Continuous Query System for Dynamic Route Planning. Nirmesh Malviya, Samuel Madden, Arnab International Conference on Data Engineering (ICDE), 2011, pages 792-803, Han- nover, Germany.
Finding the Bias and Prestige of Nodes in Networks based on Trust Scores. Abhinav Mishra, Arnab International World Wide Web Conference (WWW), 2011, pages 567-576, Hyderabad, India.
Aggregate Skyline Join Queries: Skylines with Aggregate Operations over Multiple Arnab Bhattacharya, B. Palvali Teja. International Conference on Management of Data (COMAD), 2010, pages 15-26, Nagpur, India. (Best student paper)
INSTRUCT: Space-Efficient Structure for Indexing and Complete Query Management of String Sourav Dutta, Arnab Bhattacharya. International Conference on Management of Data (COMAD), 2010, pages 27-38, Nagpur, India.
Simulated Evolution and Proceedings of the 8th International Conference on Simulated Evolution and Learning (SEAL). Co-edited by K. Deb, A. Bhattacharya, N. Chakraborti, P. Chakroborty,
Das, J. Dutta, S. K. Gupta, A. Jain, V. Aggarwal, J. Branke, S. J. Louis, K. C. Tan, Springer, 2010.
Minimum Spanning Tree on Spatio-Temporal Viswanath Gunturi, Shashi Shekhar, Arnab Bhattacharya. International Conference on Database and Expert Systems Applications (DEXA), 2010, pages 149-158, Bilbao, Spain.
Finding Top-k Similar Pairs of Objects Annotated with Terms from an Ontology. Arnab Bhat- tacharya, Abhishek Bhowmick, Ambuj K. Singh. International Conference on Scientific and Sta- tistical Database Management (SSDBM), 2010, pages 214-232, Heidelberg,
Most Significant Substring Mining based on Chi-square Sourav Dutta, Arnab Bhattacharya. Pacific-Asia Conference on Knowledge Discovery and Data Mining (PAKDD), 2010, pages 319-327, Hyderabad, India.
Querying Spatial Vishwakarma Singh, Arnab Bhattacharya, Ambuj K. Singh. International Conference on Extending Database Technology (EDBT), 2010, pages 418-429, Lausanne, Switzer- land.
Image Management for Biological Arnab Bhattacharya, Vebjorn Ljosa. Book chapter in Ency- clopedia of Database Systems edited by M. T. O¨zsu and L. Liu. Springer, 2009.
On Low Distortion Embeddings of Statistical Distance Measures into Low Dimensional Spaces. Arnab Bhattacharya, Purushottam Kar, Manjish Pal. International Conference on Database and Ex- pert Systems Applications (DEXA), 2009, pages 164-172, Linz,
FTDP-17 Mutations in Tau Alter the Regulation of Microtubule Dynamics: An “Alternative Core” Model for Normal and Pathological Tau Action. Adria LeBoeuf, Sasha F. Levy, Michelle Gaylord, Arnab Bhattacharya, Ambuj Singh, Mary Ann Jordan, Leslie Wilson, Stuart C. Feinstein. Journal of Biological Chemistry, 2008, 283(52), pages 36406-36415.
A General Modeling and Visualization Tool for Comparing Different Members of a Group: Appli- cation to Studying Tau-Mediated Regulation of Microtubule Dynamics. Arnab Bhattacharya, Sasha Levy, Adria LeBoeuf, Michelle Gaylord, Leslie Wilson, Ambuj K. Singh, Stuart C. Feinstein. BMC Bioinformatics, 2008, 9, page
Efficient Computation of Statistical Significance of Query Results in Databases. Vishwakarma Singh, Arnab Bhattacharya, Ambuj K. Singh. International Conference on Scientific and Statistical Database Management (SSDBM), 2008, pages 509-516, Hong Kong,
MIST: Distributed Indexing and Querying in Sensor Networks using Statistical Arnab Bhat- tacharya, Anand Meka, Ambuj K. Singh. International Conference on Very Large Data Bases (VLDB), 2007, pages 854-865, Vienna, Austria.
Indexing Spatially Sensitive Distance Measures Using Multi-Resolution Lower Bounds. Vebjorn Ljosa, Arnab Bhattacharya, Ambuj Singh. International Conference on Extending Database Tech- nology (EDBT), 2006, pages 865-883, Munich, Germany.
LB-Index: A Multi-Resolution Index Structure for Vebjorn Ljosa, Arnab Bhattacharya, Am- buj K. Singh. International Conference on Data Engineering (ICDE), 2006, pages 144-145, Atlanta, USA.
ViVo: Visual Vocabulary Construction for Mining Biomedical Arnab Bhattacharya, Vebjorn Ljosa, Jia-Yu Pan, Mark R. Verardo, Hyung-Jeong Yang, Christos Faloutsos, Ambuj K. Singh. Inter- national Conference on Data Mining (ICDM), 2005, pages 50-57, Houston, USA. (One of the top five student papers)
ProGreSS: Simultaneous Searching of Protein Databases by Sequence and Structure. Arnab Bhat- tacharya, Tolga Can, Tamer Kahveci, Ambuj K. Singh, Yuan-Fang Wang. Pacific Symposium on Biocomputing (PSB), 2004, pages 264-275, Hawaii, USA.

Awards & Fellowships

IBM Faculty Research Award
Award from Yahoo! Faculty Research and Engagement Program,
Best paper award at the International Conference on Management of Data (COMAD), 2011 for the paper “Minimally Infrequent Itemset Mining using Pattern-Growth Paradigm and Residual Trees”.
Best student paper award at the International Conference on Management of Data (COMAD), 2010 for the paper “Aggregate Skyline Join Queries: Skylines with Aggregate Operations over Multiple Relations”.
One of the top-five student paper awards at the International Conference on Data Mining (ICDM), 2005 for the paper “ViVo: Visual Vocabulary Construction for Mining Biomedical Images”.
ICDM Student Travel Award sponsored by IBM at the International Conference on Data Mining (ICDM), 2005 awarded to the top five student