Welcome to Next.js!

Get started by editing pages/index.js

Andrew Gibiansky :: Math → [Code]
- Bringing HPC Techniques to Deep Learning 21/2/2017
- NRAM: Theano Implementation 5/6/2016
- Cool Linear Algebra: Singular Value Decomposition 29/5/2013
Jeremy Jordan - Thoughts, ideas, and new things I've learned.
- Understanding the Transformer architecture for neural networks 6/5/2023
Eugene Yan - I design, build, and operate machine learning systems that serve customers at scale. Currently, I'm a Senior Applied Scientist at Amazon. Outside of work, I also...
- Don't Mock Machine Learning Models In Unit Tests 25/2/2024
- Text-to-Image: Diffusion, Text Conditioning, Guidance, Latent Space 27/11/2022
David Stutz - I am a research scientist at Google DeepMind interested in making generative AI safe for everyone to use through research on uncertainty estimation, watermarking, adversarial robustness, or out-of-distribution detection, among other topics. Previously, I focused on computer vision, including 3D reconstruction and image segmentation.
- ArXiv Pre-Print - Adversarial Training against Location-Optimized Adversarial Patches 25/5/2020
- Implementing Custom PyTorch Tensor Operations in C and CUDA 20/04/2020

Understanding the Transformer architecture for neural networks

The attention mechanism allows us to merge a variable-length sequence of vectors into a fixed-size context vector. What if we could use this mechanism to entirely replace recurrence for sequential modeling? This blog post covers the Transformer architecture which explores such an approach.

Hi, I'm Eugene Yan, an ML engineer.

Don't Mock Machine Learning Models In Unit Tests

How unit testing machine learning code differs from typical software practices

25 Feb 2024 · 6 min · machinelearning engineering

Andrew Gibiansky :: Math → [Code]

Bringing HPC Techniques to Deep Learning

Feb 21

NRAM: Theano Implementation

Jun 16

DAVIDSTUTZ

I am a research scientist at Google DeepMind interested in making generative AI safe for everyone to use through research on uncertainty estimation, watermarking, adversarial robustness, or out-of-distribution detection, among other topics. Previously, I focused on computer vision, including 3D reconstruction and image segmentation.

APRIL2020

ARTICLE

Implementing Custom PyTorch Tensor Operations in C and CUDA

C CUDA DEEP LEARNING PYTORCH

PyTorch, alongside TensorFlow, has become standard among deep learning researchers and practitioners. While PyTorch provides a large variety in terms of tensor operations or deep learning layers, some specialized operations still need to be implemented manually. In cases where runtime is crucial, this should be done in C or CUDA for supporting both CPU and GPU computation. In this article, I want to provide a simple example and framework for extending PyTorch with custom C and CUDA operations using CFFI for Python and CuPy.

More ...

MAY2020

ARTICLE

ArXiv Pre-Print “Adversarial Training against Location-Optimized Adversarial Patches”

ADVERSARIAL MACHINE LEARNING COMPUTER VISION DEEP LEARNING PUBLICATION

While robustness against imperceptible adversarial examples is well-studied, robustness against visible adversarial perturbations such as adversarial patches is poorly understood. In this pre-print, we present a practical approach to obtain adversarial patches while actively optimizing their location within the image. On Cifar10 and GTSRB, we show that adversarial training on these location-optimized adversarial patches improves robustness significantly while not reducing accuracy.

More ...

Cool Linear Algebra: Singular Value Decomposition

May 29, 2013

https://github.com/gibiansky/experiments

Projects:

lbm/ - Simple implementation of the Lattice Boltzmann method; described a little bit on one of my blog posts.
fluids/ - Experiments with a fluid simulation method known as Smoothed Particle Hydrodynamics. Includes a complete description of the simulation method, starting from fundamental fluid mechanics to derive the Navier-Stokes equations, followed by a derivation of the simulation steps from the Navier-Stokes equations. Also includes a very quick and dirty Java implementation of the simulation method. Also available as a pair of blog posts for more detail.
morph/ - Experiments with simple image interpolation / morphing. Take a look at the blog post for more detail.
neural-network/ - An implementation of a feed-forward neural network in MATLAB for handwritten digit recognition. For information on neural networks, take a look at the following explanation.
cpu/ - A toy CPU created in Verilog for use with a Cyclone II Altera Starter Kit FPGA board. The CPU executes (compiled versions of) a custom MIPS-like assembly language; the compiler for this language and the description are in simple/.
simple/ - A toy C-like language called "Simple", which compiles to a custom MIPS-like assembly, which can execute on a custom CPU for a Cyclone II FPGA board.
quadcopter/ - A detailed description of quadcopter motion dynamics and the control used for quadcopter stabilization. Includes a simulation and movie.

lbm/ - Simple implementation of the Lattice Boltzmann method; described a little bit on one of my blog posts.
fluids/ - Experiments with a fluid simulation method known as Smoothed Particle Hydrodynamics. Includes a complete description of the simulation method, starting from fundamental fluid mechanics to derive the Navier-Stokes equations, followed by a derivation of the simulation steps from the Navier-Stokes equations. Also includes a very quick and dirty Java implementation of the simulation method. Also available as a pair of blog posts for more detail.
morph/ - Experiments with simple image interpolation / morphing. Take a look at the blog post for more detail.
neural-network/ - An implementation of a feed-forward neural network in MATLAB for handwritten digit recognition. For information on neural networks, take a look at the following explanation.
cpu/ - A toy CPU created in Verilog for use with a Cyclone II Altera Starter Kit FPGA board. The CPU executes (compiled versions of) a custom MIPS-like assembly language; the compiler for this language and the description are in simple/.
simple/ - A toy C-like language called "Simple", which compiles to a custom MIPS-like assembly, which can execute on a custom CPU for a Cyclone II FPGA board.
quadcopter/ - A detailed description of quadcopter motion dynamics and the control used for quadcopter stabilization. Includes a simulation and movie.

Welcome to Next.js!

In this article

Understanding the Transformer architecture for neural networks

Hi, I'm Eugene Yan, an ML engineer.

Don't Mock Machine Learning Models In Unit Tests

Further reading

Text-to-Image: Diffusion, Text Conditioning, Guidance, Latent Space

Andrew Gibiansky :: Math → [Code]

Bringing HPC Techniques to Deep Learning

NRAM: Theano Implementation

DAVIDSTUTZ

ARTICLE

Implementing Custom PyTorch Tensor Operations in C and CUDA

ARTICLE

ArXiv Pre-Print “Adversarial Training against Location-Optimized Adversarial Patches”

Cool Linear Algebra: Singular Value Decomposition

Documentation →

Learn →

Examples →

Deploy →