Imperial College Union Student Choice Award
Published:
Outstanding Academic Representation Network Team Award.
Published:
Outstanding Academic Representation Network Team Award.
Published:
Studentship offered by the EPSRC for the Centre of Doctoral Training in Statistics and Machine Learning (StatML CDT)
Published:
Prize awarded for best MSc Statistics research project at Imperial College London.
Published:
Runner-up & People’s Choice Award at poster competition at Imperial College London’s annual Maths PhD Symposium (2022).
Published:
Award for the best postgraduate/doctoral paper presented at the 6th Annual Conference of the Cyprus Statistical Society.
Published:
People’s Choice Award at poster competition at Imperial College London’s annual Maths PhD Symposium (2025).
Published:
Highly commended nomination for the Faculty of Natural Sciences Prize for excellence in teaching and learning.
Published:
1st Place (Jury Award) & People’s Choice Award Winner at the annual poster competition at the Imperial College London Faculty of Natural Sciences Research Showcase (2025).
Published in Advances in Data Analysis and Classification, 2023
Recommended citation: @article{costa2023benchmarking, title={Benchmarking distance-based partitioning methods for mixed-type data}, author={Costa, Efthymios and Papatsouma, Ioanna and Markos, Angelos}, journal={Advances in Data Analysis and Classification}, volume={17}, number={3}, pages={701--724}, year={2023}, publisher={Springer} }
Download Paper
Published in Under review, 2024
Recommended citation: @misc{costa2024dibmix, title={A Deterministic Information Bottleneck Method for Clustering Mixed-Type Data}, author={Costa, Efthymios and Papatsouma, Ioanna and Markos, Angelos}, year={2024}, eprint={2407.03389}, archivePrefix={arXiv}, primaryClass={stat.ME}, howpublished = {arXiv preprint}, url = {https://arxiv.org/abs/2407.03389}
Download Paper
Published in Data Science, Classification, and Artificial Intelligence for Modeling Decision Making (IFCS 2024), 2025
Recommended citation: @inproceedings{costa2024deterministic, title={A Deterministic Information Bottleneck Method for Clustering Mixed-Type Data}, author={Costa, Efthymios and Papatsouma, Ioanna and Markos, Angelos}, booktitle={Conference of the International Federation of Classification Societies}, pages={81--88}, year={2024}, organization={Springer} }
Download Paper
Published in Statistica, 2025
Recommended citation: @article{CostaPapatsouma2024, title={Discussion of the Paper "Connecting Model-Based and Model-Free Approaches to Linear Least Squares Regression" by Lutz Dümbgen and Laurie Davies (2024)}, volume={84}, DOI={10.60923/issn.1973-2201/20656}, number={2}, journal={Statistica}, author={Costa, Efthymios and Papatsouma, Ioanna}, year={2025}, pages={107–108} }
Download Paper
Published in Statistics and Computing, 2025
Recommended citation: @article{costa2025nominal, title={A novel framework for quantifying nominal outlyingness}, author={Costa, Efthymios and Papatsouma, Ioanna}, journal={Statistics and Computing}, volume={36}, number={41}, pages={41--58}, year={2025}, publisher={Springer} }
Download Paper
Published in Submitted to IFCS 2026, 2026
Recommended citation: @misc{costa2026sparsedib, title={Sparse clustering via the Deterministic Information Bottleneck algorithm}, author={Costa, Efthymios and Papatsouma, Ioanna and Markos, Angelos}, year={2026}, eprint={2601.20628}, archivePrefix={arXiv}, primaryClass={stat.ML}, howpublished = {arXiv preprint}, url = {https://arxiv.org/abs/2601.20628}
Download Paper
Published:
IBclust is an R package for clustering datasets using the Information Bottleneck method and its variants. This package supports datasets with mixed-type variables (nominal, ordinal, and continuous), as well as datasets that are purely continuous or categorical. The IB approach preserves the most relevant information while forming concise and interpretable clusters, guided by principles from information theory.
Published:
SONO is an R package for computing scores of outlyingness for data sets consisting of nominal variables. It further includes various evaluation metrics for assessing performance of outlier identification algorithms producing scores of outlyingness.
Published:
Poster Title: “Benchmarking distance-based partitioning methods for mixed-type data”. (poster)
Published:
Presentation Title: “Clustering mixed-type data: Which method to choose?”. (slides)
Published:
Poster Title: “Benchmarking distance-based partitioning methods for mixed-type data”. (poster)
Published:
Poster Title: “A novel approach to outlier detection for mixed-type data”. (poster)
Published:
Presentation Title: “Outlier detection for mixed-type data: A novel approach”. (slides)
Published:
Presentation Title: “A Deterministic Information Bottleneck Method for Clustering Mixed-Type Data”. (slides)
Published:
Presentation Title: “A Deterministic Information Bottleneck Method for Clustering Mixed-Type Data”. (slides)
Published:
Presentation Title: “A Deterministic Information Bottleneck Method for Clustering Mixed-Type Data”. (slides)
Published:
Presentation Title: “A novel framework for quantifying nominal outlyingness”. (slides)
Published:
Presentation Title: “Utilising the Information Bottleneck algorithm for clustering mixed-type data”. (slides)
Published:
Poster Title: “DIBmix: Information-based clustering for mixed-type data”. (poster)
Published:
Presentation Title: “Cluster Analysis From an Information-Theoretic Viewpoint”. (slides)
Published:
Poster Title: “From Entropy to Insight: Discovering Groups in Mixed-Type Data”. (poster)
Published:
Session Title: “Clustering of heterogeneous data”
Imperial College London, Department of Mathematics
Imperial College London, Department of Mechanical Engineering
Imperial College London, Department of Mathematics
Imperial College London, Department of Mathematics