Hi! I’m Marco Basaldella, CTO and co-founder at Trismik. We are an LLM evaluation company that aims to build science-grade, easy-to-use evaluation tools for AI-based systems. As a CTO, I have to wear many, many hats - I oversee the technical roadmap of the engineering team, I lead the science team to develop our agents and evaluations, and I work with the product team to design our amazing products. I am involved in every stage of the products and I love every part of them - from designing prototypes, to making them real-world products, and to working with our users to make our tools better every day.
Previously…
Ciao! Sono Marco Basaldella, CTO e co-fondatore di Trismik, un’azienda che si occupa di sviluppare strumenti di valutazione di IA che siano allo stesso tempo affidabili, scientifici, e facili da usare. Come CTO di una startup, nella mia giornata occupo molti, molti ruolo - supervisiono la roadmap tecnica del team di sviluppo, guido il team scientifico nella progettazione nostri agenti e delle nostre techniche di evaluation, e lavoro con il team di prodotto per progettare i nostri fantastici prodotti. Sono coinvolto in ogni fase dei prodotti e ne amo ogni parte - dalla progettazione dei prototipi, alla loro trasformazione in prodotti reali, fino al lavoro con i nostri utenti per migliorare i nostri strumenti ogni giorno.
In precedenza…
Confident Rankings with Fewer Items: Adaptive LLM Evaluation with Continuous Scores
arXiv preprint arXiv:2601.13885, 2026
[bib] [pdf]Bridging language models and knowledge graphs with controlled natural languages
Knowledge-Based Systems, 2026, 337:115405
[bib] [pdf]Multi-Trigger Poisoning Amplifies Backdoor Vulnerabilities in LLMs
arXiv preprint arXiv:2507.11112, 2025
[bib] [pdf]LUQ: Long-text Uncertainty Quantification for LLMs
2024 Conference on Empirical Methods in Natural Language Processing (EMNLP 2024)
[bib] [pdf]Handling Ontology Gaps in Semantic Parsing
13th Joint Conference on Lexical and Computational Semantics (*SEM 2024)
[bib] [pdf]Self-Alignment Pretraining for Biomedical Entity Representations
2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (NAACL 2021)
[bib] [pdf]Adversarial Training for News Stance Detection: Leveraging Signals from a Multi-Genre Corpus
2021 EACL Hackashop on News Media Content Analysis and Automated Report Generation
[bib] [pdf]COMETA: A Corpus for Medical Entity Linking in the Social Media
2020 Conference on Empirical Methods in Natural Language Processing (EMNLP 2020)
[bib] [pdf] [github]Natural Language Processing for Achieving Sustainable Development: the Case of Neural Labelling to Enhance Community Profiling
2020 Conference on Empirical Methods in Natural Language Processing (EMNLP 2020)
[bib] [pdf]BioReddit: Word Embeddings for User-Generated Biomedical NLP
Proceedings of Tenth International Workshop on Health Text Mining and Information Analysis, co-located with the 2019 Conference on Empirical Methods in Natural Language Processing (EMNLP 2019), November 3-7 2019, Hong Kong, China
[bib] [pdf] [github]Shut Up and Run: the Never-ending Quest for Social Fitness
#RCBlackMirror2018: Re-Coding Black Mirror Workshop, co-located with the 27th International Conference on World Wide Web, April 24, Lyon, France
[bib] [pdf and online version]Bidirectional LSTM Recurrent Neural Network for Keyphrase Extraction
Proceedings of the 14th Italian Research Conference on Digital Libraries (IRCDL 2018), Udine, Italy, January 25-26, 2018
[bib]The Distiller Framework: current state and future challenges
Proceedings of the 14th Italian Research Conference on Digital Libraries (IRCDL 2018), Udine, Italy, January 25-26, 2018
[bib]Entity recognition in the biomedical domain using a hybrid approach
Journal of Biomedical Semantics, 2017, 8:51
[bib] [pdf]Exploiting and Evaluating a Supervised, Multilanguage Keyphrase Extraction pipeline for under-resourced languages
Proceedings of Recent Advances In Natural Language Processing 2017 (RANLP 2017), Varna, Bulgaria, September 4-6, 2017
[bib] [pdf]Evaluating anaphora and coreference resolution to improve automatic keyphrase extraction
In Proceedings of COLING 2016, 26th International Conference on Computational Linguistics, Proceedings of the Conference: Technical Papers, December 11-16, 2016, Osaka, Japan, pages 804--814, 2016.
[bib] [pdf]Towards building a standard dataset for Arabic keyphrase extraction evaluation
Proceedings of the 2016 International Conference on Asian Language Processing, IALP 2016, Tainan, Taiwan, November 21-23, 2016, pages 26--29, 2016.
[bib] [dataset repo] [dataset info]Crowdsourcing Relevance Assessments: The Unexpected Benefits of Limiting the Time to Judge
Proceedings of the 4th AAAI Conference on Human Computation and Crowdsourcing (HCOMP 2016), Austin, Texas.
[bib] [pdf]Using a Hybrid Approach for Entity Recognition in the Biomedical Domain
Proceedings of the 7th International Symposium on Semantic Mining in Biomedicine, SMBM 2016, Potsdam, Germany, August 4-5, 2016., pages 11--19, 2016.
[bib] [pdf]Introducing Distiller: A Unifying Framework for Knowledge Extraction
Proceedings of 1st AI*IA Workshop on Intelligent Techniques At LIbraries and Archives co-located with XIV Conference of the Italian Association for Artificial Intelligence, IT@LIA@AI*IA 2015, Ferrara, Italy, September 22, 2015., 2015.
[bib] [pdf]A Content-Based Approach to Social Network Analysis: A Case Study on Research Communities
Proceedings of Digital Libraries on the Move - 11th Italian Research Conference on Digital Libraries, IRCDL 2015, Bolzano, Italy, January 29-30, 2015, Revised Selected Papers, pages 142--154, 2015.
[bib] [pdf]Modelling the User Modelling Community (and Other Communities as Well)
Proceedings of User Modeling, Adaptation and Personalization - 23rd International Conference, UMAP 2015, Dublin, Ireland, June 29 - July 3, 2015. Proceedings, pages 357--363, 2015.
[bib]