Effective April 2020, I am on extended leave of absence from IISc, away from my administrative duties. I continue to advise my existing students at IISc. However, until further notice, I shall not be taking new students (PhD/Masters/ERP), project staff, and interns.

News & Activities



Research


I am broadly interested in Natural Language Processing, Machine Learning, and Knowledge Graphs. My recent research has focused on graph-based learning algorithms for large-scale information extraction and data integration, temporal information processing, automatic knowledge harvesting from large data, and neuro-semantics.

My research group at IISc: Machine And Language Learning (MALL) Lab

To learn more about AI@IISc, visit here. I also continue to actively collaborate with the following research groups at CMU: Read the Web (CMU), CMU Brain Research Group

Past Research Groups: Search Labs (Microsoft Research), Structured Learning at Penn, Penn Research in Machine Learning (PRIML), Penn Natural Language Processing, Penn BioIE Group.

Tutorials


  • Graph Neural Networks for Natural Language Processing (EMNLP 2019 and CODS-COMAD 2020)

  • Never-Ending Learning [Videos: [1][2]] (Invited tutorial at ICML 2019, with Tom Mitchell (CMU))

  • Knowledge Extraction and Inference from Text (KDD 2018)

  • Knowledge Extraction and Inference from Text (CIKM 2017, with Soumen Chakrabarti (IIT Bombay))

  • Graph-based Semi-supervised Learning (ACL 2012 and ICASSP 2013, with Amarnag Subramanya (Google))

  • Publications

    [Google Scholar] [Arxiv]


    Book

    Graph-based Semi-Supervised Learning. Amarnag Subramanya, Partha Pratim Talukdar. Morgan & Claypool Publishers. [Amazon]


    2023

    Self-influence Guided Data Reweighing for Language Model Pre-training
    Megh Thakkar, Sriram Ganapathy, Shikhar Vashishth, Tolga Bolukbasi, Sarath Chandar, Partha Talukdar
    EMNLP 2023

    XTREME-UP: A User-Centric Scarce-Data Benchmark for Under-Represented Languages
    Sebastian Ruder, Jonathan H. Clark, Alexander Gutkin, Mihir Kale, Min Ma, Massimo Nicosia, Shruti Rijhwani, Parker Riley, Jean-Michel A. Sarr, Xinyi Wang, John Wieting, Nitish Gupta, Anna Katanova, Christo Kirov, Dana L. Dickinson, Brian Roark, Bidisha Samanta, Connie Tao, David I. Adelani, Vera Axelrod, Isaac Caswell, Colin Cherry, Dan Garrette, Reeve Ingle, Melvin Johnson, Dmitry Panteleev, Partha Talukdar
    EMNLP 2023 Findings

    Label Aware Speech Representation Learning For Language Identification
    Shikhar Vashishth, Shikhar Bharadwaj, Sriram Ganapathy, Ankur Bapna, Min Ma, Wei Han, Vera Axelrod, Partha Talukdar
    Interspeech 2023

    Parameter-Efficient Finetuning for Robust Continual Multilingual Learning
    Kartikeya Badola, Shachi Dave, Partha Talukdar
    ACL 2023 Findings

    Bootstrapping Multilingual Semantic Parsers using Large Language Models
    Abhijeet Awasthi, Nitish Gupta, Bidisha Samanta, Shachi Dave, Sunita Sarawagi, Partha Talukdar
    EACL 2023

    TwiRGCN: Temporally Weighted Graph Convolution for Question Answering over Temporal Knowledge Graphs
    Aditya Sharma, Apoorv Saxena, Chitrank Gupta, Seyed Mehran Kazemi, Partha Talukdar, Soumen Chakrabarti
    EACL 2023

    Evaluating the Diversity, Equity, and Inclusion of NLP Technology: A Case Study for Indian Languages
    Simran Khanuja, Sebastian Ruder, Partha Talukdar
    Findings of EACL 2023

    Salient Span Masking for Temporal Understanding
    Jeremy Cole, Aditi Chaudhary, Bhuwan Dhingra and Partha Talukdar
    EACL 2023


    2022

    Re-contextualizing Fairness in NLP: The Case of India
    Shaily Bhatt, Sunipa Dev, Partha Talukdar, Shachi Dave, Vinodkumar Prabhakaran
    AACL-IJCNLP 2022

    When is BERT Multilingual? Isolating Crucial Ingredients for Cross-lingual Transfer
    Ameet Deshpande, Partha Talukdar, Karthik Narasimhan
    NAACL 2022

    Overlap-based Vocabulary Generation Improves Cross-lingual Transfer Among Related Languages
    Vaidehi Patil, Partha Talukdar, Sunita Sarawagi
    ACL 2022

    Few-shot Controllable Style Transfer for Low-Resource Multilingual Settings
    Kalpesh Krishna, Deepak Nathani, Xavier Garcia, Bidisha Samanta, Partha Talukdar
    ACL 2022

    GPU-accelerated connectome discovery at scale
    Varsha Sreenivasan, Sawan Kumar, Franco Pestilli, Partha Talukdar, Devarajan Sridharan
    Nature Computational Science volume 2, pages298–306 (2022)

    Walking with PACE-Personalized and Automated Coaching Engine
    Madhurima Vardhan, Narayan Hegde, Srujana Merugu, Shantanu Prabhat, Deepak Nathani, Martin Seneviratne, Nur Muhammad, Pranay Reddy, Sriram Lakshminarasimhan, Rahul Singh, Karina Lorenzana, Eshan Motwani, Partha Talukdar, Aravindan Raghuveer
    30th ACM UMAP 2022 (Best Paper Award)


    2021

    Question Answering over Temporal Knowledge Graphs [Code]
    Apoorv Saxena, Soumen Chakrabarti and Partha Talukdar
    ACL 2021

    MergeDistill: Merging Language Models using Pre-trained Distillation
    Simran Khanuja, Melvin Johnson and Partha Talukdar
    Findings of ACL 2021

    Exploiting Language Relatedness for Low Web-Resource Language Model Adaptation: An Indic Languages Study [Code]
    Yash Khemchandani, Sarvesh Mehtani, Vaidehi Patil, Abhijeet Awasthi, Partha Talukdar and Sunita Sarawagi
    ACL 2021

    Reordering Examples Helps during Priming-based Few-shot Learning [Code]
    Sawan Kumar, Partha Talukdar
    Findings of ACL 2021

    OKGIT: Open Knowledge Graph Link Prediction with Implicit Types [Code]
    Chandrahas, Partha Talukdar
    Findings of ACL 2021

    Spatial Reasoning from Natural Language Instructions for Robot Manipulation
    Sagar Gubbi Venkatesh, Anirban Biswas, Raviteja Upadrashta, Vikram Srinivasan, Partha Talukdar, Bharadwaj Amrutur
    ICRA 2021

    Graph Neural Networks for Soft Semi-Supervised Learning on Hypergraphs [Code]
    Naganand Yadati, Tingran Gao, Shahab Asoodeh, Partha Talukdar, Anand Louis
    PAKDD 2021

    MuRIL: Multilingual Representations for Indian Languages
    Simran Khanuja, Diksha Bansal, Sarvesh Mehtani, Savya Khosla, Atreyee Dey, Balaji Gopalan, Dilip Kumar Margam, Pooja Aggarwal, Rajiv Teja Nagipogu, Shachi Dave, Shruti Gupta, Subhash Chandra Bose Gali, Vish Subramanian, Partha Talukdar
    Preprint: arXiv:2103.10730


    2020

    NHP: Neural Hypergraph Link Prediction
    Naganand Yadati, Vikram Nitin, Madhav Nimishakavi, Prateek Yadav, Anand Louis and Partha Talukdar
    CIKM 2020

    Natural Language Inference with Faithful Natural Language Explanations [Code]
    Sawan Kumar and Partha Talukdar
    ACL 2020

    Improving Multi-hop Question Answering over Knowledge Graphs using Knowledge Base Embeddings [Code]
    Apoorv Saxena, Aditay Tripathi and Partha Talukdar
    ACL 2020

    A Re-evaluation of Knowledge Graph Completion Methods [Code]
    Zhiqing Sun, Shikhar Vashishth, Soumya Sanyal, Partha Talukdar and Yiming Yang
    ACL 2020 [Short Paper]

    Syntax-guided Controlled Generation of Paraphrases [Code]
    Ashutosh Kumar, Kabir Ahuja, Raghuram Vadapalli, Partha Talukdar
    Transactions of the ACL (TACL) 2020 [To be presented at ACL 2020]

    Composition- based Multi-Relational Graph Convolutional Networks [Code]
    Shikhar Vashishth*, Soumya Sanyal*, Vikram Nitin, and Partha Talukdar
    ICLR 2020

    ASAP: Adaptive Structure Aware Pooling for Learning Hierarchical Graph Representations [Code]
    Ekagra Ranjan, Soumya Sanyal, Partha Talukdar
    AAAI 2020

    InteractE: Improving Convolution-based Knowledge Graph Embeddings by Increasing Feature Interactions [Code]
    Shikhar Vashishth*, Soumya Sanyal*, Vikram Nitin, Nilesh Agrawal, Partha Talukdar
    AAAI 2020

    P-SIF: Document Embeddings using Partition Averaging
    Vivek Gupta, Ankit Saw, Pegah Nokhiz, Praneeth Netrapalli, Piyush Rai, Partha Talukdar
    AAAI 2020

    Improving Document Classification with Multi-Sense Embeddings
    Vivek Gupta, Ankit Kumar Saw, Pegah Nokhiz, Harshit Gupta, and Partha Talukdar
    ECAI 2020


    2019

    HyperGCN: A New Method of Training Graph Convolutional Networks on Hypergraphs [Code]
    Naganand Yadati, Madhav Nimishakavi, Prateek Yadav, Vikram Nitin, Anand Louis, Partha Talukdar
    NeurIPS 2019, Canada

    CaRe: Open Knowledge Graph Embeddings [Code]
    Swapnil Gupta, Sreyash Kenkre and Partha Talukdar
    EMNLP 2019, Hong Kong

    Zero-shot Word Sense Disambiguation using Sense Definition Embeddings [Code]
    Sawan Kumar, Sharmistha Jat, Karan Saxena and Partha Talukdar
    ACL 2019, Italy [Recipient of Outstanding Paper Award]

    Relating Simple Sentence Representations in Deep Neural Networks and Brain
    Sharmistha Jat, Hao Tang, Partha Talukdar and Tom Mitchell
    ACL 2019, Italy [Press]

    Incorporating Syntactic and Semantic Information in Word Embeddings using Graph Convolutional Networks [Code]
    Shikhar Vashishth, Manik Bhandari, Prateek Yadav, Piyush Rai, Chiranjib Bhattacharyya and Partha Talukdar
    ACL 2019, Italy

    Submodular Optimization-based Diverse Paraphrasing and its Effectiveness in Data Augmentation [Code]
    Ashutosh Kumar*, Satwik Bhattamishra*, Manik Bhandari and Partha Talukdar
    NAACL 2019, USA.

    Confidence-based Graph Convolutional Networks for Semi-Supervised Learning [Code]
    Shikhar Vashishth*, Prateek Yadav*, Manik Bhandari*, Partha Talukdar
    AISTATS 2019, Japan

    Lovasz Convolutional Networks [Code]
    Prateek Yadav, Madhav Nimishakavi, Naganand Yadati, Shikhar Vashishth, Arun Rajkumar and Partha Talukdar
    AISTATS 2019, Japan

    KVQA: Knowledge-aware Visual Question Answering [Project and Data Page]
    Sanket Shah*, Anand Mishra*, Naganand Yadati and Partha Talukdar
    Thirty-Third AAAI Conference on Artificial Intelligence (AAAI-19), USA

    ReAl-LiFE: Accelerating the Discovery of Individualized Brain Connectomes on GPUs
    Sawan Kumar*, Varsha Sreenivasan*, Partha Talukdar, Franco Pestilli, Devarajan Sridharan
    Thirty-Third AAAI Conference on Artificial Intelligence (AAAI-19), USA

    (*: Equal contributions)


    2018

    HyTE: Hyperplane-based Temporally aware Knowledge Graph Embedding [Code]
    Shib Sankar Dasgupta, Swayambhu Nath Ray and Partha Talukdar
    International Conference on Empirical Methods in Natural Language Processing (EMNLP 2018), Belgium

    AD3: Attentive Deep Document Dater [Code]
    Swayambhu Nath Ray, Shib Sankar Dasgupta and Partha Talukdar
    International Conference on Empirical Methods in Natural Language Processing (EMNLP 2018), Belgium

    RESIDE: Improving Distantly-Supervised Neural Relation Extraction using Side Information [Code]
    Shikhar Vashishth, Rishabh Joshi, Sai Suman Prayaga, Chiranjib Bhattacharyya and Partha Talukdar
    International Conference on Empirical Methods in Natural Language Processing (EMNLP 2018), Belgium

    MT-CGCNN: Integrating Crystal Graph Convolutional Neural Network with Multitask Learning for Material Property Prediction
    S Sanyal, J Balachandran, N Yadati, A Kumar, P Rajagopalan, S Sanyal, P Talukdar
    NIPS 2018 Workshop on Machine Learning for Molecules and Materials, Canada.

    Inductive Framework for Multi-Aspect Streaming Tensor Completion with Side Information [Code]
    Madhav Nimishakavi, Bamdev Mishra, Manish Gupta and Partha Talukdar
    27th International Conference on Information and Knowledge Management (CIKM 2018), Italy.
    [Acceptance rate: 17%]

    Dating Documents using Graph Convolution Networks [Code]
    Shikhar Vashishth, Swayambhu Nath Ray, Shib Sankar Dasgupta and Partha Talukdar
    56th Annual Meeting of the Association for Computational Linguistics (ACL 2018), Melbourne, Australia

    Towards Understanding the Geometry of Knowledge Graph Embeddings [Code]
    Chandrahas Dewangan, Aditya Sharma and Partha Talukdar
    56th Annual Meeting of the Association for Computational Linguistics (ACL 2018), Melbourne, Australia

    Higher-order Relation Schema Induction using Tensor Factorization with Back-off and Aggregation [Code]
    Madhav Nimishakavi, Manish Gupta and Partha Talukdar
    56th Annual Meeting of the Association for Computational Linguistics (ACL 2018), Melbourne, Australia

    CESI: Canonicalizing Open Knowledge Bases using Embeddings and Side Information [Code]
    Shikhar Vashishth, Prince Jain and Partha Talukdar
    The Web Conference 2018 (WWW 2018), Lyon, France. [acceptance rate: 14.8%]

    ELDEN: Improved Entity Linking using Densified Knowledge Graphs [Code]
    Priya Radhakrishnan, Partha Talukdar and Vasudeva Varma
    NAACL 2018, New Orleans, USA

    Never-Ending Learning
    Tom M. Mitchell, W. Cohen, E. Hruschka, P. Talukdar, B. Yang, J. Betteridge, A. Carlson, B. Dalvi, M. Gardner, B. Kisiel, J. Krishnamurthy, N. Lao, K. Mazaitis, T. Mohamed, N. Nakashole, E. Platanios, A. Ritter, M. Samadi, B. Settles, R. Wang, D. Wijaya, A. Gupta, X. Chen, A. Saparov, M. Greaves, J. Welling
    Communications of the ACM, 61(5), pp. 103-115, May 2018.

    Efficient and Distributed Generalized Canonical Correlations Analysis for Big Multiview Data
    X. Fu , K. Huang, E.E. Papalexakis, H. Song, P. Talukdar, N. D. Sidiropoulos, C. Faloutsos, and T. Mitchell
    IEEE Transactions on Knowledge and Data Engineering, Sep. 2018.


    2017

    KGEval: Accuracy Estimation of Automatically Constructed Knowledge Graphs
    Ojha P, Partha Talukdar
    International Conference on Empirical Methods in NLP (EMNLP 2017), Cohenhagen, Denmark

    Speeding up Reinforcement Learning-based Information Extraction Training using Asynchronous Methods
    Aditya Sharma, Zarana Parekh, Partha Talukdar
    International Conference on Empirical Methods in NLP (EMNLP 2017), Cohenhagen, Denmark [Short Paper]

    Revisiting Simple Neural Networks for Learning Representations of Knowledge Graphs
    Srinivas Ravishankar, Chandrahas Dewangan and Partha Talukdar
    6th Workshop on Automated Knowledge Base Construction (AKBC) 2017

    Improving Distantly Supervised Relation Extraction using Word and Entity Based Attention
    Sharmistha Jat, Siddhesh Khandelwal and Partha Talukdar
    6th Workshop on Automated Knowledge Base Construction (AKBC) 2017

    BRAINZOOM: High Resolution Reconstruction from Multi-modal Brain Signals
    Xiao Fu, Kejun Huang, Otilia Stretcu, Hyun Ah Song, Evangelos Papalexakis, Partha Talukdar, Tom Mitchell, Nicholas Sidiropoulos, Christos Faloutsos, Barnabas Poczos
    SIAM International Conference on Data Mining (SDM 2017), Houston, USA

    Facets: Adaptive Local Exploration of Large Graphs
    Robert Pienta, Minsuk (Brian) Kahng, Zhiyuan Lin, Jilles Vreeken, Partha Talukdar, James Abello, Ganesh Parameswaran, Duen Horng (Polo) Chau
    SIAM International Conference on Data Mining (SDM 2017), Houston, USA


    2016

    Relation Schema Induction using Tensor Factorization with Side Information [Code]
    Madhav Nimishakavi, Uday Singh Saini, Partha Talukdar
    International Conference on Empirical Methods in NLP (EMNLP 2016), Austin, USA

    Quality Estimation of Workers in Collaborative Crowdsourcingusing Group Testing
    Ojha P, Partha Talukdar
    AAAI Conference on Human Computation and Crowdsourcing (HCOMP 2016), Austin, USA.

    Discovering Response-Eliciting Factors in Social Question-Answering: A Reddit Inspired Study
    Danish, Yogesh Dahiya and Partha Talukdar
    10th International AAAI Conference on Web and Social Media (ICWSM-16) [acceptance rate: 17%]

    ClaimEval: Integrated and Flexible Framework for Claim Evaluation Using Credibility of Sources
    Mehdi Samadi, Partha Talukdar, Manuela Veloso, Manuel Blum
    30th AAAI Conference Conference on Artificial Intelligence (AAAI-16), Phoenix, USA

    Efficient and Distributed Algorithms for Large-Scale Generalized Canonical Correlations Analysis
    Xiao Fu, Kejun Huang, Evangelos Papalexakis, Hyun-Ah Song, Partha Talukdar, Nicholas Sidiropoulos, Christos Faloutsos, Tom Mitchell
    International Conference on Data Mining (ICDM 2016), Barcelona, Spain [Short Paper, acceptance: 11.1%]

    Turbo-SMT: Parallel Coupled sparse Matrix-Tensor Factorizations and applications
    Evangelos Papalexakis, Tom Mitchell, Nicholas Sidiropoulos, Christos Faloutsos, Partha Talukdar, Brian Murphy
    Statistical Analysis and Data Mining: The ASA Data Science Journal


    2015

    An Entity-centric Approach for Overcoming Knowledge Graph Sparsity
    Manjunath Hegde, Partha Talukdar
    Empirical Methods in NLP (EMNLP 2015), Portugal (Short Paper)

    Knowledge Base Inference using Bridging Entities
    Bhushan Kotnis, Pradeep Bansal, Partha Talukdar
    Empirical Methods in NLP (EMNLP 2015), Portugal (Short Paper)

    Translation Invariant Word Embeddings
    Matt Gardner, Kejun Huang, Evangelos Papalexakis, Xiao Fu, Partha Talukdar, Christos Faloutsos, Nicholas Sidiropoulos, Tom Mitchell
    Empirical Methods in NLP (EMNLP 2015), Portugal (Short Paper)

    AskWorld: Budget-Sensitive Query Evaluation for Knowledge-on-Demand
    Mehdi Samadi, Partha Talukdar, Manuela Veloso, Tom Mitchell
    International Joint Conference on Artificial Intelligence (IJCAI 2015), Buenos Aires, Argentina.

    Never-Ending Learning
    Tom Mitchell, William Cohen, Estevam Hruschka, Partha Talukdar, Justin Betteridge, Andrew Carlson, Bhavana Dalvi, Matt Gardner, Bryan Kisiel, Jayant Krishnmurthy, Ni Lao, Kathryn Mazaitis, Tahir Mohammad, Ndapa Nakashole, Emmanouil Antonios Platanios, Alan Ritter, Mehdi Samadi, Burr Settles, Richard Wang, Derry Wijaya, Abhinav Gupta, Xinlei Chen, Abulhair Saparov, Malcolm Greaves and Joel Welling
    Twenty-Ninth AAAI Conference on Artificial Intelligence (AAAI 2015), Austin, USA.

    Automatic Gloss Finding for a Knowledge Base using Ontological Constraints
    Bhavana Dalvi, Einat Minkov, Partha Talukdar and William Cohen
    International Conference on Web Search and Data Mining (WSDM 2015), Shanghai, China

    A Compositional and Interpretable Semantic Space
    Alona Fyshe, Leila Wehbe, Partha P. Talukdar, Brian Murphy and Tom M. Mitchell
    Conference of the North American Chapter of the ACL (NAACL 2015), Denver, USA

    Principled Neuro-Functional Connectivity Discovery
    K. Huang, N. Sidiropoulos, C. Faloutsos, E. Papalexakis, P. Talukdar, T. Mitchell
    SIAM International Conference on Data Mining (SDM 2015), Vancouver, Canada

    Active Learning in Keyword Search-based Data Integration
    Zhepeng Yan, Nan Zheng, Zachary Ives, Partha Pratim Talukdar, Cong Yu
    The VLDB Journal Special Issue on Best Papers of VLDB 2013

    Combining Vector Space Embeddings with Symbolic Logical Inference over Open-Domain Text
    Matt Gardner, Partha Talukdar and Tom Mitchell
    AAAI Spring Symposium on Knowledge Representation and Reasoning, Stanford, USA


    2014

    Scaling Graph-based Semi Supervised Learning to Large Number of Labels Using Count-Min Sketch
    Partha Pratim Talukdar, William Cohen
    17th International Conference on Artificial Intelligence and Statistics (AISTATS 2014), Reykjavik, Iceland.
    [pre-print presented at NIPS 2013 Workshop on Randomized Methods for Machine Learning]

    Incorporating Vector Space Similarity in Random Walk Inference over Knowledge Bases
    Matt Gardner, Partha Talukdar, Jayant Krishnamurthy, and Tom Mitchell
    International Conference on Empirical Methods in NLP (EMNLP 2014), Doha, Qatar.

    Simultaneously Uncovering the Patterns of Brain Regions Involved in Different Story Reading Subprocesses
    Leila Wehbe, Brian Murphy, Partha Talukdar, Alona Fyshe, Aaditya Ramdas, Tom Mitchell
    PLoS ONE 9(11): e112575. doi:10.1371/journal.pone.0112575

    Press Coverage: CMU Press Release, Scientific American (blog), AP, TIME



    Interpretable Semantic Vectors from a Joint Model of Brain- and Text-based Meaning
    Alona Fyshe, Partha Talukdar, Brian Murphy, Tom Mitchell
    52nd Annual Meeting of the Association for Computational Linguistics (ACL 2014), Baltimore, USA.

    Good-Enough Brain Model: Challenges, Algorithms and Discoveries in Multi-Subject Experiments
    Evangelos Papalexakis, Alona Fyshe, Nicholas Sidiropoulos, Partha Talukdar, Tom Mitchell, Christos Faloutsos
    ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD 2014), New York City, USA.

    Turbo-SMT: Accelerating Coupled Sparse Matrix-Tensor Factorizations by 200x [Supplementary] [Code]
    E. Papalexakis, T. Mitchell, N. Sidiropoulos, C. Faloutsos, P. Talukdar, B. Murphy
    SIAM International Conference on Data Mining (SDM 2014), Philadelphia, USA.

    Invited to the Statistical Analysis and Data Mining (SAM) Special Issue of "Best of SDM 2014"


    FlexiFaCT: Scalable Flexible Factorization of Coupled Tensors on Hadoop
    Alex Beutel, Abhimanu Kumar, Evangelos Papalexakis, Partha Talukdar, Christos Faloutsos, Eric Xing
    SIAM International Conference on Data Mining (SDM 2014), Philadelphia, USA.


    2013

    Improving Learning and Inference in a Large Knowledge-base using Latent Syntactic Cues [Details]
    Matt Gardner, Partha Talukdar, Bryan Kisiel, Tom Mitchell
    International Conference on Empirical Methods in NLP (EMNLP 2013), Seattle, USA. [Short Paper]

    PIDGIN: Ontology Alignment using Web Text as Interlingua [Details] [Slides]
    Derry Wijaya, Partha Pratim Talukdar, Tom Mitchell
    International Conference on Information and Knowledge Management (CIKM 2013), San Francisco, USA.

    Documents and Dependencies: an Exploration of Vector Space Models for Semantic Composition
    Alona Fyshe, Partha Talukdar, Brian Murphy, and Tom Mitchell
    International Conference on Computational Natural Language Learning (CoNLL 2013), Sofia, Bulgaria.

    Actively Soliciting Feedback for Query Answers in Keyword Search-Based Data Integration
    Zhepeng Yan, Nan Zheng, Zack Ives, Partha Talukdar, Cong Yu
    International Conference on Very Large Databases (VLDB 2013), Trento, Italy.

    Invited to the special issue of the VLDB Journal with the "Best Papers of VLDB 2013"


    Advances in Automated Knowledge Base Construction
    Fabian M. Suchanek, James Fan, Raphael Hoffmann, Sebastian Riedel, Partha Talukdar
    ACM SIGMOD Record [To Appear]


    2012

    Acquiring Temporal Constraints between Relations
    Partha Pratim Talukdar, Derry Wijaya, Tom Mitchell
    International Conference on Information and Knowledge Management (CIKM 2012), Hawaii, USA.

    Coupled Temporal Scoping of Relational Facts
    Partha Pratim Talukdar, Derry Wijaya, Tom Mitchell
    International Conference on Web Search and Data Mining (WSDM 2012), Seattle, USA.

    Learning Effective and Interpretable Semantic Models using Non-Negative Sparse Embedding
    Brian Murphy, Partha Talukdar, Tom Mitchell
    International Conference on Computational Linguistics (COLING 2012), Mumbai, India.
    [ Slides ] [ Data ]

    Selecting Corpus-Semantic Models for Neurolinguistic Decoding
    Brian Murphy, Partha Talukdar, Tom Mitchell
    Joint Conference on Lexical and Computational Semantics (StarSem) 2012, Montreal, Canada.

    Metric Learning for Graph-based Domain Adaptation
    Paramveer Dhillon, Partha Pratim Talukdar, Koby Crammer
    International Conference on Computational Linguistics (COLING 2012) [Short Paper], Mumbai, India.

    Associating Structured Records To Text Documents
    Rakesh Agrawal, Ariel Fuxman, Anitha Kannan, John Shafer, Partha Pratim Talukdar
    International World Wide Web Conference (WWW 2012) [Poster], Lyon, France.

    Crowdsourced Comprehension: Predicting Prerequisite Structure in Wikipedia
    Partha Pratim Talukdar, William Cohen
    HLT-NAACL 2012 Workshop on Innovative Use of NLP for Building Educational Applications (BEA7)

    Tracking Story Reading in the Brain
    Leila Wehbe, Partha Talukdar, Brian Murphy, Alona Fyshe, Gustavo Sudre, and Tom Mitchell
    NIPS 2012 Workshop on Machine Learning and Interpretation in NeuroImaging, Lake Tahoe, USA.

    2011

    SCAD: Collective Discovery of Attribute Values
    Anton Bakalov, Ariel Fuxman, Partha Pratim Talukdar, Soumen Chakrabarti
    International World Wide Web Conference (WWW 2011), Hyderabad, India.

    Improving Product Classification Using Images
    Anitha Kannan, Partha Pratim Talukdar, Nikhil Rasiwasia, Qifa Ke
    International Conference on Data Mining (ICDM 2011), Vancouver, Canada.

    2010

    Graph-Based Weakly-Supervised Methods for Information Extraction & Integration
    Partha Pratim Talukdar
    PhD Thesis, CIS Department, University of Pennsylvania, May 2010.

    Experiments in Graph-based Semi-Supervised Learning Methods for Class-Instance Acquisition [ Slides ] [ Data ]
    Partha Pratim Talukdar, Fernando Pereira
    ACL 2010, Uppsala, Sweden.

    Learning Better Data Representation using Inference-Driven Metric Learning [ Poster ]
    Paramveer Dhillon, Partha Pratim Talukdar, Koby Crammer
    ACL 2010 (Short Paper), Uppsala, Sweden.

    Automatically Incorporating New Sources in Keyword Search-Based Data Integration [ Slides ]
    Partha Pratim Talukdar, Zack Ives, Fernando Pereira
    2010 ACM SIGMOD Conference, Indianapolis, USA.

    Inference-Driven Metric Learning (IDML) for Graph Construction
    Paramveer Dhillon, Partha Pratim Talukdar, Koby Crammer
    UPenn CIS Technical Report MS-CIS-10-18

    2009

    New Regularized Algorithms for Transductive Learning [ Slides ] [ Video ]
    Partha Pratim Talukdar, Koby Crammer
    European Conference on Machine Learning (ECML-PKDD) 2009, Bled, Slovenia.

    Sequence Learning from Data with Multiple Labels [ Slides ]
    Mark Dredze, Partha Pratim Talukdar, Koby Crammer
    ECML-PKDD 2009 workshop on Learning from Multi-Label Data (MLD 09), Bled, Slovenia.

    Interactive Data Integration through Smart Copy and Paste
    Zack Ives, Craig Knoblock, Steve Minton, Marie Jacob, Partha Talukdar, Rattapoom Tuchinda, Jose Luis Ambite, Maria Muslea, Cenk Gazen.
    Conference on Innovative Data Systems Research (CIDR) 2009, Asilomar, California.

    Regularized Learning with Networks of Features.
    Ted Sandler, John Blitzer, Partha Pratim Talukdar, Lyle H. Ungar.
    Advances in Neural Information Processing Systems (NIPS) 2009.

    Topics in Graph Construction for Semi-Supervised Learning
    Partha Pratim Talukdar
    UPenn CIS Technical Report MS-CIS-09-13

    2008

    Weakly Supervised Acquisition of Labeled Class Instances using Graph Random Walks [ Slides ]
    Partha Pratim Talukdar, Joseph Reisinger, Marius Pasca, Deepak Ravichandran, Rahul Bhagat, Fernando Pereira.
    EMNLP 2008, Honolulu, Hawaii.

    The Orchestra Collaborative Data Sharing System.
    Todd J. Green, Grigoris Karvounarakis, Nicholas E. Taylor, Val Tannen, Partha Pratim Talukdar, Marie Jacob, Fernando Pereira.
    ACM SIGMOD Record, September 2008.

    Learning to Create Data-Integrating Queries [ Slides ]
    Partha Pratim Talukdar, Marie Jacob, Mohammad Salman Mehmood, Koby Crammer, Zack Ives, Fernando Pereira, Sudipto Guha.
    34th International Conference on Very Large Databases (VLDB 2008), Auckland, New Zealand.

    A Rate-Distortion One-Class Model and its Applications to Clustering. [ Slides ] [ Video ]
    Koby Crammer, Partha Pratim Talukdar, Fernando Pereira.
    International Conference on Machine Learning (ICML) 2008, Helsinki, Finland.

    DRASO: Declaratively Regularized Alternating Structural Optimization. [ Slides ] [ Video ]
    Partha Pratim Talukdar, John Blitzer, Ted Sandler, Mark Dredze, Koby Crammer, Fernando Pereira.
    ICML 2008 Workshop on Prior Knowledge for Text and Language Processing, Helsinki, Finland.

    2007

    Lightly-Supervised Attribute Extraction.
    Kedar Bellare, Partha Pratim Talukdar, Giridhar Kumaran, Fernando Pereira, Mark Liberman, Andrew McCallum and Mark Dredze.
    NIPS 2007 Workshop on Machine Learning for Web Search.

    Frustratingly Hard Domain Adaptation for Dependency Parsing.
    Mark Dredze, John Blitzer, Partha Pratim Talukdar, Kuzman Ganchev, Joao Graca, and Fernando Pereira.
    CoNLL Shared Task Session of EMNLP-CoNLL 2007, Prague.

    Automatic Code Assignment to Medical Text.
    Koby Crammer, Mark Dredze, Kuzman Ganchev, Partha Pratim Talukdar and Steve Caroll.
    BioNLP 2007, Prague.

    2006

    A Context Pattern Induction Method for Named Entity Extraction [ Slides ]
    Partha Pratim Talukdar, Thorsten Brants, Mark Liberman and Fernando Pereira
    Tenth Conference on Computational Natural Language Learning (CoNLL-X), New York City, June 8-9, 2006.

    2004

    Hindi Text Normalization.
    K. Panchapagesan, Partha Pratim Talukdar, N. Sridhar Krishna, Kalika Bali, A.G. Ramakrishnan.
    Fifth International Conference on Knowledge Based Computer Systems (KBCS), 19-22 December 2004, Hyderabad India.

    Phonetic Distance Based Cross-lingual Search.
    Sriram S., Partha Pratim Talukdar, Sameer Badaskar, Kalika Bali, A.G. Ramakrishnan.
    International Conference on Natural Language Processing, 19-22 December 2004, Hyderabad India.

    Optimal Creation of Speech Databases for Indian Language Speech Technology
    Satinder Singh, Partha Talukdar, Sridhar Krishna, Sandeep Manocha, Kalika Bali,Sitaram R.N.V..
    International Conference on Speech and Language Technology/ O-COCOSDA , 17-19 November 2004, New Delhi, India.

    Tools for the Development of a Hindi Speech Synthesis System
    Kalika Bali, A.G.Ramakrishnan, Partha Pratim Talukdar, N. Sridhar Krishna.
    5th ISCA Speech Synthesis Workshop, 14th-16th June 2004, Carnegie Mellon University, USA.

    Duration Modeling for Hindi Text-to-Speech Synthesis.
    N. Sridhar Krishna, Partha Pratim Talukdar, Kalika Bali, A.G. Ramakrishnan.
    8th International Conference on Spoken Language Laguage Processing (ICSLP), 4th-8th October 2004, Jeju Island, Korea.

    Automatic Generation of Compound Word Lexicon for Hindi Speech Synthesis.
    Deepa S.R., A.G. Ramakrishnan, Kalika Bali, Partha Pratim Talukdar.
    Language Resources and Evaluation Conference (LREC) 2004, Portugal, 26-28 May 2004.

    Teaching


    Jan-May 2019: E1 246: Natural Language Understanding
    Aug-Dec 2018: DS 222: Machine Learning with Large Datasets
    Jan-May 2018: E1 246: Natural Language Understanding
    Aug-Dec 2017: DS 222: Machine Learning with Large Datasets
    Jan-May 2016: SE 256: Scalable Systems for Data Science
    Jan-May 2016: SE 294: Data Analysis and Visualization
    Aug-Dec 2015: E1 246: Natural Language Understanding
    Jan-May 2015: SE 294: Data Analysis and Visualization
    Aug-Dec 2014: SE 305: Web-scale Knowledge Harvesting

    Software


    Junto Label Propagation Toolkit: This toolkit consists of implementations of various graph-based semi-supervised learning (SSL) algorithms.

    OCRD: One Class Algorithm based on Rate-Distortion theory (download)
    An algorithm to choose a coherent subset of points from a large set. Please see A Rate-Distortion One-Class Model and its Applications to Clustering for details.


    About Me


    I was born and raised in Guwahati, Assam.