Talks and Tutorials
Students & Postdocs
Photograph of me lecturing at the blackboard (credit: R. Benjamin Shapiro, 2002).
Upcoming events that I'm involved with:
(May 19–23, 2019)
CSET 2019 (August 2019)
NSPW 2019 (Aug/Sept/Oct 2019)
A Mathematical and Data-Driven Approach to Intrusion Detection for High-Performance Computing
This project is looking at a mathematical and data-based approach to protecting the major HPC resources of DOE. It seeks to extend the state of the art in intrusion detection by developing new mathematical-statistical techniques for the problem of intrusion detection, and to handle the difficult practical problem of safely "sanitizing" HPC system activity data for use by other researchers and other HPC sites without compromising user privacy and security.
Sponsor: Department of Energy Office of Science Advanced Scientific Computing Research Program [Press Release]
Selected publications resulting from this project:
- "Principles of Authentication"
- Sean Peisert, Ed Talbot, and Tom Kroeger,
- Proceedings of the 2013 New Security Paradigms Workshop (NSPW), pp. 47–56,
Banff, Canada, September 9–12, 2013. [BibTeX] [DOI] [OA] [CDL]
- "Identifying HPC Codes via Performance Logs and Machine Learning"
- Orianna DeMasi, Taghrid Samak, and David H. Bailey,
- Proceedings of the Workshop on Changing Landscapes in HPC Security (CLHS), New York City, NY, June 18, 2013.
- "Energy Consumption Models and Predictions for Large-scale Systems"
- Taghrid Samak, Christine Morin, and David H. Bailey
- Proceedings of the Ninth Workshop on High-Performance, Power-Aware Computing (HPPAC), Boston, MA, May 20, 2013.
- "Here's My Cert, So Trust Me, Maybe? Understanding TLS Errors on the Web"
- Devdatta Akhawe, Bernhard Amann, Matthias Vallentin, and Robin Sommer
- Proceedings of the 22nd International World Wide Web Conference (WWW),
Rio de Janeiro, Brazil, May 13–17, 2013
- "Multiclass Classification of Distributed Memory Parallel Computations"
- Sean Whalen, Sean Peisert, and Matt Bishop,
- Pattern Recognition Letters (PRL), 34(3), pp. 322–329, February 2013. [BibTeX] [DOI] [CDL]
- "Visualizing Distributed Memory Computations with Hive Plots"
- Sophie Engle and Sean Whalen,
- Proceedings of the 9th ACM International Symposium on Visualization for Cyber Security (VizSec), pp. 56–63, Seattle, WA, October 15, 2012. [DOI]
- "Feature Selection and Multi-Class Classification Using a Rule Ensemble Method"
- David H. Bailey, Orianna DeMasi, and Juan Meza,
- Lawrence Berkeley National Laboratory Technical Report, May 25, 2012.
- "Network-Theoretic Classification of Parallel Computation Patterns" (expanded version of CACHES paper)
- Sean Whalen, Sophie Engle, Sean Peisert, and Matt Bishop,
- International Journal of High Performance Computing Applications (IJHPCA), 26(2), pp. 159–169, May 2012. [BibTeX] [DOI] [CDL]
- "Suspended Accounts in Retrospect: An Analysis of Twitter Spam"
- Kurt Thomas, Chris Grier, Vern Paxson and Dawn Song
- Proceedings of the 2011 ACM/USENIX Internet Measurement Conference, Berlin, Germany, November 2–4, 2011.
- "Dimension Reduction Using Rule Ensemble Machine Learning Methods: A Numerical Study of Three Ensemble Methods"
- Orianna DeMasi, Juan Meza, and David H. Bailey,
- Lawrence Berkeley National Laboratory Technical Report, August 30, 2011.
- "Network-Theoretic Classification of Parallel Computation Patterns"
- Sean Whalen, Sean Peisert, and Matt Bishop,
- Proceedings of the First International Workshop on Characterizing Applications for Heterogeneous Exascale Systems (CACHES),
Tucson, AZ, June 4, 2011. [BibTeX]
- "Detecting and Analyzing Automated Activity on Twitter"
- Chao Michael Zhang and Vern Paxson
- Proceedings of the 12th Conference on Passive and Active Measurement (PAM),
Atlanta, GA, March 20–22, 2011.
- "@spam: The Underground on 140 Characters or Less"
- Chris Grier, Kurt Thomas, Vern Paxson, and Michael Zhang
- Proceedings of the 17th ACM Conference on Computer and Communications Security (CCS), Chicago, IL, October 4–8, 2010.
- "Relationships and Data Sanitization: A Study in Scarlet"
- Matt Bishop, Justin Cummins, Sean Peisert, Bhume Bhumitarana, Anhad Singh, Deborah Agarwal, Deborah Frincke, and Michael Hogarth,
- Proceedings of the 2010 New Security Paradigms Workshop (NSPW), pp. 151–164,
Concord, MA, September 21–23, 2010. [BibTeX] [DOI] [OA] [CDL]
- Security Applications of the ε-Machine
- Sean H. Whalen,
- Ph.D. Dissertation, Dept. of Computer Science, University of California, Davis, September 2010.
- "Hidden Markov Models for Automated Protocol Learning"
- Sean Whalen, Matt Bishop and James Crutchfield,
- Proceedings of the 6th International ICST Conference on Security and Privacy in Communications Networks (SecureComm), Singapore, September 7–9, 2010.
- "Fingerprinting Communication and Computation on HPC Machines"
- Sean Peisert,
- Lawrence Berkeley National Laboratory Technical Report LBNL-3483E,
- June 2010. [BibTeX] [Authoritative/CDL]
- "Outside the Closed World: On Using Machine Learning For Network Intrusion Detection"
- Robin Sommer and Vern Paxson,
- Proceedings of the 31st IEEE Symposium on Security and Privacy, pp. 305–316, Oakland, CA, May 16–19, 2010. [DOI]
The definitive versions of the papers posted on this page were first published in the venues indicated. In accordance with publisher copyright policies, these papers are pre-prints or post-prints, and are not the pubilsher's version.
Personal use of the material posted on this page is permitted. However,
permission to reprint/republish this material for advertising or promotional
purposes or for creating new collective works for resale or redistribution to
servers or lists, or to reuse any copyrighted component of this work in other
works must be obtained from the original publishers.
This material is presented to ensure timely dissemination of scholarly and
technical work. Copyright and all rights therein are retained by authors or by
other copyright holders. All persons copying this information are expected to
adhere to the terms and constraints invoked by each author's copyright. In
most cases, these works may not be reposted without the explicit permission of
the copyright holder.