Pengfei Li Headshot

Pengfei Li

Assistant Professor, School of Information

School of Information
Golisano College of Computing and Information Sciences

Pengfei Li

Assistant Professor, School of Information

School of Information
Golisano College of Computing and Information Sciences

Currently Teaching

DSCI-633
3 Credits
A foundations course in data science, emphasizing both concepts and techniques. The course provides an overview of data analysis tasks and the associated challenges, spanning data preprocessing, model building, model evaluation, and visualization. The major areas of machine learning, such as unsupervised, semi-supervised and supervised learning are covered by data analysis techniques including classification, clustering, association analysis, anomaly detection, and statistical testing. The course includes a series of assignments utilizing practical datasets from diverse application domains, which are designed to reinforce the concepts and techniques covered in lectures. A substantial project related to one or more data sets culminates the course.
ISTE-470
3 Credits
Rapidly expanding volumes of data from all areas of society are becoming available in digital form. High value information and knowledge is embedded in many of these data volumes. Unlocking this information can provide many benefits, and may also raise ethical questions in certain circumstances. This course provides students with a hands-on introduction to how interactive data exploration and data mining software can be used for data-driven knowledge discovery, including domains such as business, environmental management, healthcare, finance, and transportation. Data mining techniques and their application to large data sets will be discussed in detail, including classification, clustering, association rule mining, and anomaly detection. In addition, students will learn the importance of applying data visualization practices to facilitate exploratory data analysis.
ISTE-780
3 Credits
Rapidly expanding collections of data from all areas of society are becoming available in digital form. Computer-based methods are available to facilitate discovering new information and knowledge that is embedded in these collections of data. This course provides students with an introduction to the use of these data analytic methods, with a focus on statistical learning models, within the context of the data-driven knowledge discovery process. Topics include motivations for data-driven discovery, sources of discoverable knowledge (e.g., data, text, the web, maps), data selection and retrieval, data transformation, computer-based methods for data-driven discovery, and interpretation of results. Emphasis is placed on the application of knowledge discovery methods to specific domains.