Charu Aggarwal

Biography

Charu Aggarwal is a Research Scientist at the IBM T. J. Watson Research Center in Yorktown Heights, New York. He completed his B.S. from IIT Kanpur in 1993 and his Ph.D. from Massachusetts Institute of Technology in 1996. His research interest during his Ph.D. years was in combinatorial optimization (network flow algorithms), and his thesis advisor was Professor James B. Orlin . He has since worked in the field of data mining, with particular interests in data streams, privacy, uncertain data and social network analysis. He has published over 200 papers in refereed venues, and has applied for or been granted over 80 patents. Because of the commercial value of the above-mentioned patents, he has received several invention achievement awards and has thrice been designated a Master Inventor at IBM. He is a recipient of an IBM Corporate Award (2003) for his work on bio-terrorist threat detection in data streams, a recipient of the IBM Outstanding Innovation Award (2008) for his scientific contributions to privacy technology, and a recipient of an IBM Research Division Award (2008) for his scientific contributions to data stream research. He has served on the program committees of most major database/data mining conferences, and served as program vice-chairs of the SIAM Conference on Data Mining , 2007, the IEEE ICDM Conference, 2007, the WWW Conference 2009, and the IEEE ICDM Conference, 2009. He served as an associate editor of the IEEE Transactions on Knowledge and Data Engineering Journal from 2004 to 2008. He is an associate editor of the ACM TKDD Journal , an action editor of the Data Mining and Knowledge Discovery Journal , an associate editor of the ACM SIGKDD Explorations, and an associate editor of the Knowledge and Information Systems Journal. He is a fellow of the ACM (2013) and the IEEE (2010) for "contributions to knowledge discovery and data mining techniques".
DBLP Publication Profile

Google Scholar Citation Profile

C.V.

Research Interests:

Graph mining and Social Networks, Data Stream Mining, Uncertain Data Mining, Text and Multimedia Data Mining, Privacy Preserving Data Mining, High Dimensional Data Mining, Data Mining for Electronic Commerce


You can download the postscript/PDF files of my frequently accessed papers from my publication page. A more comprehensive list of publications is available from the DBLP database maintained by Michael Ley.

My citations can be accessed from this link to google scholar search. My h-index is 58.

My list of granted patents with full text is available from the patent office . A searchable link (by patent number) from the US patent office can be used to access the full text of the patents.

Here is a list of my academic honors and professional activities.

A resume may be found here .

Contact Information: Charu Aggarwal

IBM T. J. Watson Research Center, 1101 Kitchawan Rd, Yorktown, NY 10598

Email: charu (at) us (dot) ibm (dot) com

In case you have sent me an email at my earlier address with domain name watson.ibm.com, it is likely that I have not received it.


BOOKS: Most of my books are published with Springer (some with CRC Press) in both hard copy and electronic form. Both Springer and CRC Press generally have an excellent electronic distribution (through Springer link or CRCNetbase) in addition to hard copies. The web pointers to Springer Link and CRCNetbase for each book are also provided below. Some institutions also have agreements or subscriptions with Springer or CRC Press which allow them access to Springer link or CRC Press electronic material. You may want to check with your library. Springer also has a unique MyCopy Program, whereby you might be able to order a very low-priced ($25) personal softcover copy of Springer published books under certain circumstances (depending on your institution's subscriptions with Springer). Check here for details.

NEW: DATA CLUSTERING BOOK

Data Clustering: Algorithms and Applications (CRC Press), Ed. Charu Aggarwal, Chandan Reddy, September 2013. -- Comprehensive survey driven book on data clustering with chapters contributed by prominent researchers in the field.

Table of Contents and Introductory Chapter

CRC Netbase Link for Electronic Book


NEW: (AUTHORED BOOK): OUTLIER ANALYSIS:

Outlier Analysis (Springer) Authored by Charu Aggarwal, January 2013. Comprehensive text book on outlier analysis, including examples and exercises for classroom teaching. Most of the previous books on outlier detection were written by statisticians for statisticians, with little or no coverage from the data mining and computer science perspective. This book is intended to fill that gap. Each chapter contains key research content on the topic, case studies, extensive bibliographic notes and the future direction of research in this field. Includes exercises as well.

Covers applications for credit card fraud, network intrusion detection, law enforcement etc.

Content is simplified so students and practitioners can also benefit from this book.

Chapters will typically cover one of three areas: methods and techniques commonly used in outlier analysis, such as linear methods, proximity-based methods, subspace methods, and supervised methods; data domains, such as, text, categorical, mixed-attribute, time-series, streaming, discrete sequence, spatial and network data; and key applications of these methods as applied to diverse domains such as credit card fraud detection, intrusion detection, medical diagnosis, earth science, web log analytics, and social network analysis.

The book has been selected among the Best publications of 2013 by ACM Computing Reviews.

Table of Contents and Introductory Chapter

Springer Link


NEW: SENSOR DATA MANAGEMENT AND MINING BOOK:

Managing and Mining Sensor Data (Springer) Ed. Charu Aggarwal, March 2013. -- Comprehensive survey driven book on sensor data management and mining with chapters contributed by prominent researchers in the field.

Table of Contents and Introductory Chapter

Springer Link


TEXT MINING BOOK:

Mining Text Data (Springer) Ed. Charu Aggarwal, ChengXiang Zhai, March 2012. -- Comprehensive survey driven book on text mining with chapters contributed by prominent researchers in the field.

Table of Contents and Sample Survey Chapters on Clustering and Classification

Springer Link


SOCIAL NETWORK DATA ANALYTICS BOOK:

Social Network Data Analytics (Springer) Ed. Charu Aggarwal, March 2011. -- Comprehensive survey driven book on social networks with chapters contributed by prominent researchers in the field.

Table of Contents

Introductory Chapter

Springer Link


GRAPH MANAGEMENT AND MINING BOOK:

Managing and Mining Graph Data (Springer) Ed. Charu Aggarwal, Haixun Wang; February 2010. -- Comprehensive survey driven book on graph data with chapters contributed by prominent researchers in the field.

Table of Contents and Introductory Survey Chapters

ACM Computing Reviews for the Book

Springer Link


UNCERTAIN DATA BOOK :

Managing and Mining Uncertain Data (Springer) Ed. Charu Aggarwal, February 2009. -- Comprehensive survey driven book on Uncertain Data with chapters contributed by prominent researchers in the field.

Table of Contents and introductory survey chapters

ACM Computing Reviews for the Book

Springer Link


PRIVACY-PRESERVING DATA MINING BOOK:

Privacy-Preserving Data Mining: Models and Algorithms (Springer) Ed. Charu Aggarwal, Philip S. Yu, July 2008. -- Comprehensive survey driven book on Privacy-Preserving Data Mining Research with chapters contributed by prominent researchers in the field.

Table of Contents

Introductory Survey

ACM Computing Reviews for the book

Book Cover

Springer Link


DATA STREAM BOOK:

Data Streams: Models and Algorithms (Springer) Ed. Charu Aggarwal, January 2007. -- Comprehensive survey driven book on Data Stream Research with chapters contributed by prominent researchers in the field. Table of Contents

ACM Computing Reviews for the Book

Survey Chapter on Synopsis Construction in Data Streams

Springer Link


My podcast on data streams from IBM Research
Download Link for PS/PDF files of frequently accessed papers


KDNuggets, Analytics and Data Mining Resources