Soumyadeb Mitra

 
Data Domainc Inc.
2421 Mission College Blvd
Santa Clara, CA




About Me

I graduated with a PhD from the Department of Computer Science of University of Illinois, Urbana Champaign.  My advisor was Prof. Marianne Winslett.

After my PhD, I joined a company called Data Domain (which recently got acquired by Network Appliance).

Earlier, I had reveived my undergraduate degree from the Computer Science and Engineering Department of Indian Institute of Technology (IIT), Delhi.

PhD Thesis(pdf)

CV (txt/pdf)


PhD Research

My research was focused on  "Compliance Records" -  Records, such as business communications, financial statements and medical images, which are increasingly being stored in electronic form.  Ensuring that such records are not only readily accessible and accurate, but also credible and irrefutable, is particularly imperative given recent legal and regulatory trends (Sarbanes-Oxley Act, SEC Rule 17a-3/4, HIPPA, DOD 5015.2). In my PhD, I developed techniques for secure creation, maintenance, retrieval, migration and eventual shredding of such compliance records.

Apart from this, I have also worked on Maitri: A data-management system for  scientific data and LBIO: A user space parallel I/O routine for cluster computers.


Recent Publications

2008

  • An Architecture for Regulatory Compliant Database Management.
    Soumyadeb Mitra, Marianne Winslett, Richard Snodgrass, Shashank Yaduvanshi and Sumedh Ambokar.. ICDE 2009
  • Query-based Partitioning of Documents and Indexes for Information Lifecycle Management.
    Soumyadeb Mitra, Marianne Winslett and Windsor Hsu. SIGMOD 2008
  • Deleting Index Entries from Compliance Storage
    Soumyadeb Mitra, Marianne Winslett and Nikita Borisov. Extending Data Base Technology (EDBT) 2008

2007

  • Trustworthy Migration and Retrieval of Regulatory Compliant Records. Soumyadeb Mitra, Marianne Winslett, Windsor H Hsu, Xiaonan Ma. IEEE Conference on Mass Storage Systems and Technologies (MSST) 2007
  • Trustworthy Keyword Search for Compliance Storage. Soumyadeb Mitra, Marianne Winslett, Windsor H Hsu, Kevin C.-C. Chang. In The International Journal on Very Large Data Bases, 2007.

2006

  • Trustworthy Keyword Search for Regulatory Compliant Record Retention. Soumyadeb Mitra, Windsor H. Hsu and Marianne Winslett. VLDB 2006.   Best Paper Award
  • Secure Deletion from Inverted Indexes on Compliance Storage. Soumyadeb Mitra and Marianne Winslett. Storage Security Workshop 06, in conjunction with CCS06.   Best Paper Award
  • Bitmap Indexes for large Scientific Data Sets: A case study. Rishi Rakesh Sinha, Soumyadeb Mitra, Marianne Winslett. IPDPS, 2006

2005

  • An Efficient, Non Intrusive, Log Based I/O Mechanism for Scientific Simulations on Clusters. Soumyadeb Mitra, Rishi R Sinha, Marianne Winslett, Xiangmin Jiao, Cluster 2005 Boston.
  • Maitri: A Format independent Data Management System for Scientific Data. Rishi Rakesh Sinha, Soumyadeb Mitra, Marianne Winslett. SNAPI workshop at PACT, 2005.

Hobbies and Interests

I have keen interest in outdoor sports like soccer, running and biking. Recently, I completed the North Shore Century, a 100 mile biking event in Evanston, IL in 7 hrs 49 mins. Earlier, I had run the Chicago Marathon in 2005.

Travelling is my other big hobby. Although, I haven't travelled much in the US (thanks to PhD workload), I have toured a lot in India. Here is a list of Indian states I have visited, where visiting is defined as spending atleast a night not counting overnight train journeys.

Some Quotes