Keyword

deep learning

11 papers tagged “deep learning”

BiologyNature Methods · Oct 2021 Open access

Effective gene expression prediction from sequence by integrating long-range interactions

Žiga Avsec, Vikram Agarwal and David R. Kelley

This paper introduces Enformer, a transformer-based deep learning model that predicts gene expression and chromatin states directly from DNA sequence by integrating regulatory information from up to ~100 kb away. By using self-attention to capture long-range interactions, it substantially improves prediction accuracy over prior convolutional models. The approach also improves prediction of the effects of non-coding genetic variants on expression.

deep learning gene expression genomics transformer

AINature · Jul 2021 Open access

Highly accurate protein structure prediction with AlphaFold

John Jumper, Richard Evans, Alexander Pritzel, David Silver, Oriol Vinyals and Demis Hassabis

The paper introduces AlphaFold2, a deep-learning system that predicts three-dimensional protein structures directly from amino-acid sequence with near-experimental accuracy. It combines a novel attention-based Evoformer over multiple sequence alignments and pairwise representations with an end-to-end structure module that produces atomic coordinates. AlphaFold won the CASP14 assessment by a wide margin, delivering atomic-level accuracy for the majority of targets.

alphafold protein structure prediction deep learning structural biology

BiologyScience · Jul 2021 Open access

Accurate prediction of protein structures and interactions using a three-track neural network

Minkyung Baek, Frank DiMaio and David Baker

This paper presented RoseTTAFold, a three-track neural network that simultaneously processes one-dimensional sequence, two-dimensional residue-pair distances, and three-dimensional atomic coordinate information, with information flowing between the tracks. The method achieved protein structure prediction accuracy approaching that of AlphaFold2 while being more computationally efficient. It also demonstrated rapid generation of accurate models for protein-protein complexes.

protein structure prediction rosettafold deep learning protein interactions

AIICLR 2021 (9th International Conference on Learning Representations) · May 2021 Open access

An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale

Alexey Dosovitskiy, Lucas Beyer, Alexander Kolesnikov and Neil Houlsby

This paper introduced the Vision Transformer (ViT), applying a standard Transformer encoder directly to sequences of image patches treated as tokens, with minimal vision-specific inductive biases. When pre-trained on large datasets and transferred to downstream tasks, ViT matched or exceeded state-of-the-art convolutional networks while requiring fewer computational resources to train. It demonstrated that convolutions are not necessary for strong image recognition at scale.

vision transformer image classification transformers computer vision

AIAdvances in Neural Information Processing Systems 33 (NeurIPS 2020) · Jun 2020 Open access

Denoising Diffusion Probabilistic Models

Jonathan Ho, Ajay Jain and Pieter Abbeel

The paper introduces denoising diffusion probabilistic models (DDPMs), a class of latent-variable generative models trained to reverse a fixed Gaussian noising process. It establishes a connection between diffusion models and denoising score matching with Langevin dynamics, and proposes a simplified, reweighted training objective. The resulting models produce high-quality image samples, achieving competitive log-likelihoods and a strong FID on CIFAR-10.

diffusion models generative models image synthesis deep learning

AIAdvances in Neural Information Processing Systems 30 (NeurIPS 2017) · Jun 2017 Open access

Attention Is All You Need

Ashish Vaswani, Noam Shazeer, Niki Parmar, Jakob Uszkoreit, Llion Jones, Aidan N. Gomez, et al.

The paper introduced the Transformer, a sequence-transduction architecture based entirely on attention mechanisms, dispensing with the recurrence and convolutions used by prior state-of-the-art models. By relying on multi-head self-attention, the model is more parallelizable and trains substantially faster, while achieving new state-of-the-art results on machine translation. The architecture became the foundation for subsequent large language models and much of modern deep learning.

deep learning transformers attention natural language processing

AI2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) · Jun 2016 Open access

Deep Residual Learning for Image Recognition

Kaiming He, Xiangyu Zhang, Shaoqing Ren and Jian Sun

The authors introduced a residual learning framework that reformulates network layers to learn residual functions with reference to their inputs (via identity 'shortcut' connections), making very deep networks substantially easier to optimize. They showed that such residual networks gain accuracy from greatly increased depth, evaluating models up to 152 layers deep on ImageNet at lower complexity than VGG networks. The approach won first place in the ILSVRC 2015 classification task and yielded large improvements on detection and localization benchmarks.

deep learning computer vision convolutional neural networks image classification

AINature · Jan 2016

Mastering the game of Go with deep neural networks and tree search

David Silver, Aja Huang, Chris J. Maddison and Demis Hassabis

This paper introduced AlphaGo, a system combining deep convolutional neural networks (policy and value networks) trained by supervised learning from human games and reinforcement learning by self-play, integrated with Monte Carlo tree search. The networks reduce the breadth and depth of the search needed to evaluate Go positions. AlphaGo defeated other Go programs and became the first program to beat a professional human Go player (Fan Hui) on a full-size board.

deep learning reinforcement learning monte carlo tree search game of go

AIMedical Image Computing and Computer-Assisted Intervention (MICCAI 2015), LNCS vol. 9351, pp. 234-241 · Oct 2015 Open access

U-Net: Convolutional Networks for Biomedical Image Segmentation

Olaf Ronneberger, Philipp Fischer and Thomas Brox

The paper introduces U-Net, an encoder-decoder convolutional network with a contracting path to capture context and a symmetric expanding path with skip connections for precise localization. Combined with heavy data augmentation, the architecture trains end-to-end from very few annotated images. It won the ISBI cell-tracking and neuronal-structure segmentation challenges and segments a 512x512 image in under a second on a GPU.

image segmentation convolutional neural networks biomedical imaging deep learning

AIICML 2015 (32nd International Conference on Machine Learning) · Jul 2015 Open access

Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift

Sergey Ioffe and Christian Szegedy

This paper introduced batch normalization, a technique that normalizes layer inputs using mini-batch statistics to reduce internal covariate shift during training. It allows higher learning rates and less careful initialization, accelerates convergence, and acts as a regularizer. Applied to image classification networks, it dramatically reduced training steps and improved accuracy.

batch normalization deep learning neural network training regularization

AIICLR 2015 (3rd International Conference on Learning Representations) · May 2015 Open access

Adam: A Method for Stochastic Optimization

Diederik P. Kingma and Jimmy Ba

This paper introduced Adam, a first-order gradient-based optimization algorithm for stochastic objective functions that computes adaptive per-parameter learning rates from estimates of the first and second moments of the gradients. The method is computationally efficient, has low memory requirements, and is well suited to large-scale and noisy/sparse-gradient problems. It became one of the most widely used optimizers in deep learning.

optimization stochastic gradient descent deep learning adaptive learning rate