John Andrew Gunnels from 10 Londonderry Ln, Somers Town, NY 10589, age 59, Phone: (914) 244-6233

Resumes

Principal Engineer At Aws

Location:

Yorktown Heights, NY

Industry:

Research

Work:

Amazon Web Services
Principal Engineer at Aws
Ibm
Distinguished Research Staff Member, Manager, Senior Manager, and Program Director at Ibm Research

Education:

Texas State University
Bachelors The University of Texas at Austin
Doctorates, Doctor of Philosophy, Computer Science

Skills:

High Performance Computing
Algorithms
Distributed Systems
Parallel Computing
Scalability
Computer Science
Parallel Programming
Software Engineering
Simulations
Software Development
Big Data
Optimization
Linux
Data Mining
Machine Learning
C
Python
Cloud Computing
C++
Unix
Analytics
Hadoop
Shell Scripting
Scientific Computing
High Performance Computing
Mpi
Computer Architecture
Management
Mathematical Programming
Fortran
Java
Mathematica
Research
Ibm Power
Multithreaded Application Development
Molecular Dynamics
Assembly Language
Latex
System Architecture
Matlab
Large Scale Optimization
Programming
Architecture
Scheme
Lisp

Interests:

Children
Environment
Education
Science and Technology
Disaster and Humanitarian Relief
Animal Welfare
Health

Certifications:

Introduction To Quantum Computing
Quantum Algorithms For Cybersecurity, Chemistry, and Optimization
Practical Realities of Quantum Computation and Quantum Communication
Requirements For Large-Scale Universal Quantum Computation
Applications of Quantum Computing
Patent Issuance

John Gunnels

John Ed Gunnels

John Gunnels

Publications

Us Patents

Method And Structure For An Improved Data Reformatting Procedure

US Patent:

7555604, Jun 30, 2009

Filed:

Jan 9, 2006

Appl. No.:

11/328344

Inventors:

Siddhartha Chatterjee - Yorktown Heights NY, US
John A. Gunnels - Brewster NY, US
Fred Gehrung Gustavson - Briarcliff Manor NY, US

Assignee:

International Business Machines Corporation - Armonk NY

International Classification:

G06F 12/12

US Classification:

711118

Abstract:

A method (and structure) of managing memory in which a low-level mechanism is executed to signal, in a sequence of instructions generated at a higher level, that at least a portion of a contiguous area of memory is permitted to be overwritten.

Performance Evaluation Of Algorithmic Tasks And Dynamic Parameterization On Multi-Core Processing Systems

US Patent:

7793011, Sep 7, 2010

Filed:

May 29, 2008

Appl. No.:

12/129245

Inventors:

John A. Gunnels - Yorktown Heights NY, US
Shakti Kapoor - Austin TX, US
Ravi Kothari - New Delhi, IN
Yogish Sabharwal - New Delhi, IN
James C. Sexton - Danbury CT, US

Assignee:

International Business Machines Corporation - Armonk NY

International Classification:

G06F 13/28
G06F 11/30
G06F 9/455
G06G 7/62

US Classification:

710 22, 710 23, 710 28, 702182, 702186, 703 13, 703 23

Abstract:

A method for evaluating performance of DMA-based algorithmic tasks on a target multi-core processing system includes the steps of: inputting a template for a specified task, the template including DMA-related parameters specifying DMA operations and computational operations to be performed; evaluating performance for the specified task by running a benchmark on the target multi-core processing system, the benchmark being operative to generate data access patterns using DMA operations and invoking prescribed computation routines as specified by the input template; and providing results of the benchmark indicative of a measure of performance of the specified task corresponding to the target multi-core processing system.

Method And Structure For Fast In-Place Transformation Of Standard Full And Packed Matrix Data Formats

US Patent:

7844630, Nov 30, 2010

Filed:

Feb 19, 2008

Appl. No.:

12/033581

Inventors:

Fred Gehrung Gustavson - Briarcliff Manor NY, US
John A. Gunnels - Yorktown Heights NY, US
James C. Sexton - Danbury CT, US

Assignee:

International Business Machines Corporation - Armonk NY

International Classification:

G06F 17/00

US Classification:

707791, 707802, 707822, 707828

Abstract:

A computerized method provides for an in-place transformation of matrix A data including a New Data Structure (NDS) format and a transformation T having a compact representation. The NDS represents data of the matrix A in a format other than a row major format or a column major format, such that the data for the matrix A is stored as contiguous sub matrices of size MB by NB in an order predetermined to provide the data for a matrix processing. The transformation T is applied to the MB by NB blocks, using an in-place transformation processing, thereby replacing data of the block A with the contents of T(A).

System And Method For Detecting A Faulty Object In A System

US Patent:

7853820, Dec 14, 2010

Filed:

Oct 22, 2008

Appl. No.:

12/256355

Inventors:

John A. Gunnels - Brewster NY, US
Fred Gehrung Gustavson - Briarcliff Manor NY, US
Robert Daniel Engle - St. Louis MO, US

Assignee:

International Business Machines Corporation - Armonk NY

International Classification:

G06F 11/00

US Classification:

714 4, 709223, 709224

Abstract:

A method (and system) for detecting at least one faulty object in a system including a plurality of objects in communication with each other in an n-dimensional architecture, includes probing a first plane of objects in the n-dimensional architecture and probing at least one other plane of objects in the n-dimensional architecture which would result in identifying a faulty object in the system.

Performance Evaluation Of Algorithmic Tasks And Dynamic Parameterization On Multi-Core Processing Systems

US Patent:

8037215, Oct 11, 2011

Filed:

May 30, 2008

Appl. No.:

12/130167

Inventors:

John A. Gunnels - Yorktown Heights NY, US
Shakti Kapoor - Austin TX, US
Ravi Kothari - New Delhi, IN
Yogish Sabharwal - New Delhi, IN
James C. Sexton - Danbury CT, US

Assignee:

International Business Machines Corporation - Armonk NY

International Classification:

G06F 13/28
G06F 17/50

US Classification:

710 22, 710 23, 710 24, 710 25, 710 26, 710 27, 710 28, 703 13

Abstract:

Apparatus for evaluating the performance of DMA-based algorithmic tasks on a target multi-core processing system includes a memory and at least one processor coupled to the memory. The processor is operative: to input a template for a specified task, the template including DMA-related parameters specifying DMA operations and computational operations to be performed; to evaluate performance for the specified task by running a benchmark on the target multi-core processing system, the benchmark being operative to generate data access patterns using DMA operations and invoking prescribed computation routines as specified by the input template; and to provide results of the benchmark indicative of a measure of performance of the specified task corresponding to the target multi-core processing system.

Method And Structure For Skewed Block-Cyclic Distribution Of Lower-Dimensional Data Arrays In Higher-Dimensional Processor Grids

US Patent:

8055878, Nov 8, 2011

Filed:

Feb 8, 2005

Appl. No.:

11/052216

Inventors:

Siddhartha Chatterjee - Yorktown Heights NY, US
John A. Gunnels - Brewster NY, US

Assignee:

International Business Machines Corporation - Armonk NY

International Classification:

G06F 15/00

US Classification:

712 10

Abstract:

A method and structure of distributing elements of an array of data in a computer memory to a specific processor of a multi-dimensional mesh of parallel processors includes designating a distribution of elements of at least a portion of the array to be executed by specific processors in the multi-dimensional mesh of parallel processors. The pattern of the designating includes a cyclical repetitive pattern of the parallel processor mesh, as modified to have a skew in at least one dimension so that both a row of data in the array and a column of data in the array map to respective contiguous groupings of the processors such that a dimension of the contiguous groupings is greater than one.

Reducing Bandwidth Requirements For Matrix Multiplication

US Patent:

8250130, Aug 21, 2012

Filed:

May 30, 2008

Appl. No.:

12/129789

Inventors:

Daniel A. Brokenshire - Round Rock TX, US
John A. Gunnels - Yorktown Heights NY, US
Michael D. Kistler - Pflugerville TX, US

Assignee:

International Business Machines Corporation - Armonk NY

International Classification:

G06F 17/16

US Classification:

708607

Abstract:

A block matrix multiplication mechanism is provided for reversing the visitation order of blocks at corner turns when performing a block matrix multiplication operation in a data processing system. The mechanism increases block size and divides each block into sub-blocks. By reversing the visitation order, the mechanism eliminates a sub-block load at the corner turns. The mechanism performs sub-block matrix multiplication for each sub-block in a given block, and then repeats operation for a next block until all blocks are computed. The mechanism may determine block size and sub-block size to optimize load balancing and memory bandwidth. Therefore, the mechanism reduces maximum throughput and increases performance. In addition, the mechanism also reduces the number of multi-buffered local store buffers.

Optimized Scalar Promotion With Load And Splat Simd Instructions

US Patent:

8255884, Aug 28, 2012

Filed:

Jun 6, 2008

Appl. No.:

12/134495

Inventors:

Alexandre E. Eichenberger - Chappaqua NY, US
Michael K. Gschwind - Chappaqua NY, US
John A. Gunnels - Yorktown Heights NY, US

Assignee:

International Business Machines Corporation - Armonk NY

International Classification:

G06F 9/45
G06F 9/44

US Classification:

717136, 717106, 717141, 717144, 717146, 717149, 712 4

Abstract:

Mechanisms for optimizing scalar code executed on a single instruction multiple data (SIMD) engine are provided. Placement of vector operation-splat operations may be determined based on an identification of scalar and SIMD operations in an original code representation. The original code representation may be modified to insert the vector operation-splat operations based on the determined placement of vector operation-splat operations to generate a first modified code representation. Placement of separate splat operations may be determined based on identification of scalar and SIMD operations in the first modified code representation. The first modified code representation may be modified to insert or delete separate splat operations based on the determined placement of the separate splat operations to generate a second modified code representation. SIMD code may be output based on the second modified code representation for execution by the SIMD engine.

Isbn (Books And Publications)

Using Plapack: Parallel Linear Algebra Package

Author

John Gunnels

ISBN #

0262720264

Videos & Images

Flickr

Youtube

2005 Grady White 33 Express For Sale

www.marinemax.co... Grady White is built to a higher standard of quai...

Category:

Autos & Vehicles

Uploaded:

08 Nov, 2010

Duration:

49s

Imagine - John Lenon cover

This is just a chill little version of the song...I hope you like it! ...

Category:

Music

Uploaded:

09 May, 2009

Duration:

3m 5s

My top 10 Death Metal singer *New*

10:D. Randal Blythe/Ruin 9:Jonny Davy/The Rising Tide 8:Steve Marois/ ...

Category:

Music

Uploaded:

17 Jul, 2008

Duration:

5m 15s

John Gunnels Blue Day

John Gunnels Blue Day.

Duration:

2m 26s

John Gunnels Crazy Heart

Crazy Heart b y; John Gunnels.

Duration:

2m 39s

John Gunnels- REALTOR with JPAR REAL ESTATE

Your Austin and Central Texas Local Real Estate Agent. John Gunnels JP...

Duration:

16s

Coachlining

We're back with a short video on the techniques we use for applying a ...

Duration:

15m 3s

George Gunnels | Arizona Biltmore Dentistry Patien...

htttp://arizonab... Welcome to Arizona Biltmore Dentistry At Arizona ...

Duration:

1m 9s

John Andrew Gunnels

John Gunnels Phones & Addresses

Resumes

Resumes

Principal Engineer At Aws

John Gunnels

John Gunnels

John Ed Gunnels

John Gunnels

John Gunnels

Publications

Us Patents

Method And Structure For An Improved Data Reformatting Procedure

Performance Evaluation Of Algorithmic Tasks And Dynamic Parameterization On Multi-Core Processing Systems

Method And Structure For Fast In-Place Transformation Of Standard Full And Packed Matrix Data Formats

System And Method For Detecting A Faulty Object In A System

Performance Evaluation Of Algorithmic Tasks And Dynamic Parameterization On Multi-Core Processing Systems

Method And Structure For Skewed Block-Cyclic Distribution Of Lower-Dimensional Data Arrays In Higher-Dimensional Processor Grids

Reducing Bandwidth Requirements For Matrix Multiplication

Optimized Scalar Promotion With Load And Splat Simd Instructions

Isbn (Books And Publications)

Using Plapack: Parallel Linear Algebra Package

John Gunnels

John Gunnels

John Gunnels

john gunnels

John Gunnels

John Gunnels

John Gunnels

John Gunnels

John Gunnels

John Gunnels

John Gunnels

John Gunnels

John Gunnels

John Riley Gunnels

John Gunnels (John)

John Gunnels, Redmond Hig...

John Gunnels, Humble High...

Berry Elementary School, ...

C.E. Barrick Elementary S...

John Gunnels

John Gunnels

John Gunnels

John Gunnels

John Gunnels

John Gunnels

John Gunnels

John Gunnels