Summary

deblasio_dan_cropped My research interest are broadly in algorithm design and analysis, and I take inspiration from biological problems. Many times this not only leads to an interesting algorithmic result, but a useful biological tool (see Software).

Starting Fall 2023, I am an Assistant Teaching Professor in the Ray and Stephanie Lane Computational Biology Department of the School of Computer Science at Carnegie Mellon University. Previously I was a Lane Fellow in the same department working with Carl Kingsford.

From Fall 2019 to Spring 2023 I was a member of the Computer Science Department at the University of Texas at El Paso (UTEP) as an Assistant Professor.

Until 2016 I was a PhD student in the Computer Science Department at the University of Arizona working with John Kececioglu and before that a student in the CS Department Department at the University of Central Florida working with Shaojie Zhang.

In the past my work has focused mainly on multiple sequence alignment problems. Most recently I worked on improving accuracy of protein multiple sequence alignments. Multiple sequence alignment is a fundamental step in bioinformatics, but the problem is NP-complete. Because of the importance of the result and complexity of the multiple sequence alignment problem many algorithms exist to find high quality alignments in practice. Each of these algorithms has a large number of tunable parameters that can greatly affect the quality of the computed alignment. Most users rely on the default parameter choices, which produce the best alignments on average, but produce poor alignments for some inputs. We developed a process called parameter advising which selects parameter choices that produces a high quality alignment for the input. To accomplish this candidate alignments are produced using each of the parameter choices in an advising set, the accuracy of these candidate alignments is then estimated using an advising estimator, the candidate alignment with the highest estimated accuracy is then selected for the user. To estimate the alignment accuracy we developed Facet (Feature-based accuracy estimator) which is a linear combination of efficiently-computable feature functions. We have found that learning an optimal advisor (selecting both the estimator coefficients and the set of parameter choices) is NP-complete. We expanded this result to show that finding the estimator coefficients or the estimator set independently is also NP-complete. In practice, we have methods to find close-to optimal advisors. We are working on ways to improve the accuracy of these parameter advisors.

I have also worked on improving the memory consumption of secondary structure conscious RNA multiple sequence alignment (see PMFastR) and high throughput phylogeny filtering (see SiClE).

Named JCB Social Media Editor

I was recently named the Social Media Editor for the Journal of Computational Biology. My duties will include maintaining (and creating) the journal’s twitter presence and interacting with authors in this arena.

Paper accepted to STOC

Our paper titled “How much data is sufficient to learn high-performing algorithms? Generalization guarantees for data-driven algorithm design” (link on the publication page), co-authored with Ellen Vitercik, Travis Dick, Toumas Sandholm, Nina Balcan, and Carl Kingsford was accepted to STOC 2021. The acceptance rate for this year’s (virtual) STOC meeting was only 28%.

Transcript Assembly Advising paper published

Our paper More Accurate Transcript Assembly via Parameter Advising co-authored with Kwanho Kim and Carl Kingsford, was featured in this months issues of the Journal for Computational Biology. This is the special issue of JCB with work from WCB@ICML which I mentioned previously. Link to the paper on the publications page.

Fiyin’s Work Wins Best Student Paper at ACM-BCB

The work that was done while Fiyinfoluwa Gbosibo was at CMU during the summer of 2017 which was published at ACM-BCB this week won the best student paper award. The paper titled “Practical Universal k-mer Sets for Minimizer Schemes” also has Carl and Guillaume as authors. Congrats Fiyin!

The full paper is available in the BCB proceedings and are open access.

Speaking at the TTIC Workshop on Automated Algorithm Design

I have been asked to give a talk at the Workshop on Automated Algorithm Design being held at the Toyota Technical Institute at Chicago from August 7th to the 9th. I will be speaking about our work related to applying parameter advising to reference-based transcript assembly (see Publications).

While in Chicago, I will also attend the Workshop on Learning-Based Algorithms also being held at TTIC from August 12th to the 14th.

Slides | Paper | Code