Jan 15, 2023 1 min read Paper Summaries

Interpreting Potts and Transformer Protein Models Through the Lens of Simplified Attention

Original Paper link - bhattacharya.pdf (stanford.edu)

Highlights of the Paper

Argue that attention captures real properties of protein family data. $\rightarrow$ leading to a principled model of protein interactions.
Introduce an energy-based attention layer, factored attention which recovers a Potts model.
Contrast Potts Models and Transformers.
Shows that Transformer leverages hierarchical signals in protein family databases that is not captured in single layer models.

Introduction/Background

Potts Model

This is a kind of$^\star$ a Markov Random Field which is a popular method for unsupervised protein contact prediction.
MRF based methods can capture statistical information about co-evolving positions.
There is a great illustrated example of a Potts Model in Tianyu's Blog.

$^\star$ in particular it is a fully-connected pairwise MRF.

Markov Random Field(MRF)/Markov Network means the same thing. MRF is an undirected probabilistic graphical model. To learn more, I would recommend this coursera video.

Highlights of the Paper

Introduction/Background

Potts Model

You might also like...

Introduction to Adam Optimizer & advancements leading to it

Deep Residual Learning for Image Recognition

ImageNet classification with deep convolutional neural networks (AlexNet)

Summary of Paper - Representation Learning Via Invariant Causal Mechanisms (ReLIC)

Popular tags