SUPERCOMPUTING FRONTIERS 2017

MARCH 13 – 16, 2017

Matrix@Biopolis, Singapore

CONFERENCE PROGRAMME

Mon, March 13

TUTORIAL 1: DEEP LEARNING INSTITUTE

Time: 9.00am – 5.00pm

Venue: Level 4, Matrix Building, Biopolis

Breaks:
Morning Tea Break – 10.30am – 11.00am
Lunch Break – 1.00pm – 2.00pm
Afternoon Tea Break – 3.30pm – 4.00pm

Presenter(s):
Nicolas Walker, Senior Solutions Architect, NVIDIA SEA
Jeff Adie, HPC / Deep Learning Solutions Architect, NVIDIA APJ
Aik Beng Ng, Deep Learning Solutions Architect, NVIDIA APJ

Important Notes to Participants:

Participants are required to bring their own laptop.

Labs System Requirements: Laptop / Web browser
Connecting to an IPython Notebook requires a WebSocket connection and only the following browsers are known to work:
• * Chrome 14 or above
• * Firefox 6 or above
• * Opera 12.10 or above
• * Safari 5 or above
• * Internet Explorer 10 or above

You can verify your system is supported by going to this test site and verifying the “WebSockets (Port 80)” section has all green check marks.

Abstract:
Learn the latest techniques on how to design, train, and deploy neural network-powered machine learning in your applications. You’ll explore widely used open-source frameworks and NVIDIA’s latest GPU-accelerated deep learning platforms. You will learn:

Lab 1:
Getting Started with Deep Learning
Learn how to leverage deep neural networks (DNN) within the deep learning workflow to solve a real-world image classification problem using NVIDIA DIGITS. You’ll walk through the process of data preparation, model definition, model training and troubleshooting, validation testing and strategies for improving model performance using GPUs. On completion of this lab, you will be able to use NVIDIA DIGITS to train a DNN on your own image classification application.

Pre-Requisite:

Helpful to have:
- What is Deep Learning (Wiki page)
- Basic Python programming

Lab 2:
Deep Learning for Image Segmentation
Abstract: There are a variety of important applications that need to go beyond detecting individual objects within an image and instead segment the image into spatial regions of interest. Examples of image segmentation uses include medical imagery analysis where it is often important to separate the pixels corresponding to different types of tissue, blood or abnormal cells so we can isolate a particular organ and self-driving cars where it is used to understand road scenes. In this lab you’ll learn how to train and evaluate an image segmentation network.

Pre-Requisite:

Required:
- “Getting Started with Deep Learning” Lab completion
- You know about things like forward and backpropagation, activations, stochastic gradient descent (SGD), convolutions, pooling, bias.
- You are familiar with convolutional neural networks (CNN).
Helpful to have:
- Image recognition experience
- TensorFlow experience
- Python experience
- Standalone SSH Client (I.e. PuTTY)

Lab 3:
Neural Network Deployment
Abstract: Once a deep neural network (DNN) has been trained using GPU acceleration, it needs to be deployed into production. The step after training is called inference as it uses a trained DNN to make predictions from new data.
In this lab we will show different approaches to deploying a trained DNN for inference. The first approach is to directly use inference functionality within a deep learning framework, in this case DIGITS and Caffe. The second approach is to integrate inference within a custom application by using a deep learning framework API, again using Caffe but this time through it’s Python API. The final approach is to use the NVIDIA TensorRT™ which will automatically create an optimized inference run-time from a trained Caffe model and network description file. You will learn about the role of batch size in inference performance as well as various optimizations that can be made in the inference process. You’ll also explore inference for a variety of different DNN architectures trained in other DLI labs.

Pre-Requisite:

Required:
- “Getting Started with Deep Learning” Lab completion
Helpful to have:
- Caffe framework experience
- Basic Python programming

TUTORIAL 2: MULTI AND MANY CORES COMPUTING OPTIMIZATION WITH INTEL TOOLS

Time: 9.00am – 5.00pm

Venue: Level 4, Matrix Building, Biopolis

Breaks:
Morning Tea Break – 10.30am – 11.00am
Lunch Break – 1.00pm – 2.00pm
Afternoon Tea Break – 3.30pm – 4.00pm

Presenter(s):
Rama Kishan Malladi, Application Engineeer, Intel Software Service Group
Jyotsna Khemka, Technical Consulting Engineer, Intel Software Service Group

Abstract:

Slot 1: 9:00 – 10:30
Intel® Architecture for Software Developers
Learn about the parallel architecture, technical advances and features of the latest and future Intel processors, especially Xeon, Xeon Phi – newest Knights Landing. Brief introduction to latest Intel® architecture Intel® Xeon Phi™ coprocessor architecture overview.

Slot 2: 11:00 – 1:00
Vectorization for Intel® C++ & Fortran Compiler
• Introduction to SIMD for Intel® Architecture
• Vector Code Generation
• Compiler & Vectorization
• Validating Vectorization Success
• Reasons for Vectorization Fails
• Vectorization of Special Program Constructs & Loops

Slot 3: 12:00 – 1:00
Understanding vectorization and how it impacts performance: Vectorization Advisor
* Features
* Workflow
* Understanding the results
* Demo

Slot 4: 2:00 – 3:30
Intel® Advisor Roofline
A Roofline chart is a visual representation of application performance in relation to hardware limitations, including memory bandwidth and computational peaks
The Roofline provides insight into:
• Where your performance bottlenecks are
• How much performance is left on the table because of them
• Which bottlenecks are possible to address, and which ones are worth addressing
• Why these bottlenecks are most likely occurring
• What your next steps should be
New Era for OpenMP*: Beyond Traditional Shared Memory Parallel Programming
OpenMP* has been the de facto standard for parallel programming on shared memory systems OpenMP 4.5 added major enhancements to features to the OpenMP API to keep pace with the latest technological trends in HPC and beyond. The new features in OpenMP 4.5 significantly improve the expressiveness of OpenMP on modern architectures and increase its applicability for complex application codes. This presentation describes the evolution of OpenMP and provides an overview of the major new elements including support for accelerators/coprocessors/GPUs, SIMD extensions, doacross loop, task loop, task priority, and thread affinity.

Slot 5: 4:00 – 5:00
Intel’s Machine Learning
Intel is committed to AI and is making major investments across technology, training, resources and R&D to advance AI for business and society. In this session, we cover the Intel portfolio – software and hardware solutions for machine learning in the data center from general purpose (Xeon, Xeon Phi) to targeted silicon (FPGA and Nervana technology). At the edge, Intel also offers a portfolio of processors (Core, Atom, Joule, etc) that utilize common intelligent APIs for distributed and collaborative intelligence.

TUTORIAL 3: ACTING ON INSIGHT – TIPS FOR DEVELOPING AND OPTIMISING APPLICATIONS WITH ALLINEA TOOLS ON NVIDIA GPUS AND INTEL KNL

Time: 9.00am – 5.00pm

Venue: Level 4, Matrix Building, Biopolis

Breaks:
Morning Tea Break – 10.30am – 11.00am
Lunch Break – 1.00pm – 2.00pm
Afternoon Tea Break – 3.30pm – 4.00pm

Presenter(s):
Patrick Wohlschlegel

Abstract:
These tutorials provide practical insight and advice on enabling end-users to get as much science as possible from available HPC resources using insights from the Allinea development and optimization tools available on the NSCC Supercomputer. The two independent session include hands-on workshops which helps HPC developers to improve workloads and work more efficiently and effectively during supercomputer allocated time, on both NVIDIA GPUs (morning session) and Intel Knights Landing (afternoon session).

The tutorials will increase your understanding of a range of time-saving techniques associated with these specialist tools for debugging, application analysis and code optimization, covering:

– General development best practices: cut down the time spent resolving one-off problems arising during code development.

– Proactive optimization of HPC applications: lessons from the “Allinea Performance Roadmap.

– Failsafe automation: testing as part of your daily workflow to eradicate bugs and performance problems.

TUTORIAL 4: OPENPOWER AND LINUX: THE HARDWARE, SOFTWARE AND HPC WORKLOADS

Time: 9.00am – 1.00pm

Venue: Level 4, Matrix Building, Biopolis

Breaks:
Morning Tea Break – 10.30am – 11.00am
Lunch Break – 1.00pm – 2.00pm

Presenter(s):
Jeremy Kerr, Open POWER Architect, IBM Australia
Abhisek Chatterjee, Program Director – OpenSource Technology Development & OpenPower Enablement, IBM Australia

Abstract:
The OpenPOWER project delivers the high-performance and highly parallel POWER processor architecture on an entirely open source software stack – including workload applications, the operating system, and even low-level system firmware.
This technically-focussed tutorial describes the evolution of the OpenPOWER hardware and software architecture, and our leading design principles.
We will explain the initialisation and management processes of OpenPOWER platforms, particularly in the context of high performance computing environments.

As OpenPOWER is specifically designed for Linux, we are able to make targeted optimisations for the Linux operating system and infrastructure.
We’ll cover the key details of the operating system implementation, and how it integrates with the platform firmware. In addition to implementing the base platform, we have been porting and optimising common HPC workloads and infrastructure to OpenPOWER.
The tutorial will provide detail on those efforts, and explain some of the deployments of those so far.
For those working with their own code, we’ll discuss some key guidelines and hints for running workloads efficiently on OpenPOWER, and the compilers and development tools that are available.

TUTORIAL 5: HIGH PERFORMANCE DATA ANALYTICS 101

Time: 9.00am – 1.00pm

Venue: Matrix Building, Biopolis

Breaks:
Morning Tea Break – 10.30am – 11.00am
Lunch Break – 1.00pm – 2.00pm

Presenter(s):
Dr. Hui Liang, Lead Data Scientist, APJ Innovation Center, Hewlett Packard Enterprise, Singapore
Mr. Wei Ann Lim, Data Scientist, APJ Innovation Center, Hewlett Packard Enterprise, Singapore

Abstract:

This tutorial includes the basic introduction to some fundamental machine learning technologies.It will also give the attendees an overview of what a data analysis pipe is like and what are the available tools for each of the phase of the pipeline.

Track 1: Machine Learning
Introduction to Machine Learning
Linear Regression
Logistic Regression
Clustering

Track 2: Text Analytics
Introduction to Text Analysis
Applications
Specific Techniques for Text Analysis

Track 3: HPC + Data Analytics
Introduction to Deep Learning
Application of Deep Learning in Natural Language Processing

Tue, March 14

Updated as of March 7, 2017

Day 1 \| Tuesday, March 14, 2017
08:00	Registration
09:00	Welcome Address Tan Tin Wee, Organising Chairman, National Supercomputing Centre Singapore
09:05	Opening Speech Peter Ho, Steering Committee Chairman, National Supercomputing Centre Singapore
09:20	Inaugural National Supercomputing Centre Singapore (NSCC) Awards
09:40	Keynote \| Gordon Bell Prize, Three Decades: Motivating and Measuring HPC Progress Gordon Bell, Microsoft Research, USA
10:40	Tea Break
11:00	Keynote \| The Opportunities and Challenges of Exascale Computing Thom H. Dunning, Jr., Northwest Institute for Advanced Computing, USA
12:00	Industry Sponsor \| Directions Towards Common Technologies for HPC and Big Data Analytics Joseph Curley, Intel, USA
12:30	Lunch
13:30	Keynote \| Sunway TaihuLight: Designing and Tuning Scientific Applications at the Scale of 10-Million Cores Haohuan Fu, National Supercomputing Center in Wuxi, China
14:30	Industry Sponsor \| Compute the Convergence of HPC, Big Data and Machine Intelligence Cheng Jang Thye, Fujitsu, Singapore
15:00	Tea Break
	SCIENTIFIC TRACK \| CHAIR: Lou Jing	INDUSTRY TRACK \| CHAIR: Leong Wai Meng
	APPLICATIONS: ENGINEERING AND CFD	NEW TECHNOLOGIES
15:30 – 18:00	15:30 – 15:50 Magnetohydrodynamic Simulations to Reveal Supernovae Explosion Mechanism Supercomputing Hidetomo Sawai and Shoichi Yamada, RIST and Waseda University, Japan 15:50 – 16:10 Towards Fully Resolving The Turbulence Around Wave-Induced Bedforms Using Petascale Supercomputing Asim Onder & Jing Yuan, National University of Singapore 16:10 – 16:30 An Application of GPU Acceleration in CFD Simulation for Insect Flight Yang Yao & Khoon Seng Yeo, National University of Singapore 16:30 – 16:50 Recent Advances in Scaling Up Complex Fluid-Structure Interaction Simulations Alex Lee & Rainald Lohner, Tian Building Engineering Pte Ltd, Singapore and George Mason University, USA 16:50 – 17:10 Efficient HPC Parallel Global Optimisation Algorithms Applied to Nonlinear Partial Differential Equation-Based Environmental Objectives Christine Shoemaker & Min Pang, National University of Singapore and Cornell University, USA 17:10 – 17:30 Application of High Performance Computing and Efficient Optimisation Algorithms for Calibration of Computationally Expensive Water Resources Models Taimoor Akhtar, Wei Xia, Jingjie Zhang & Christine Shoemaker, National University of Singapore	15:30 – 16:00 Unleashing HPC System Efficiency with Mellanox Smart Interconnect Solution Tong Liu, Mellanox Technologies, APJ & China 16:00 – 16:20 Intel‘s SSF and Technology Directions Addressing Large Scale HPC Vivek Rai, Seetha R. K. Nookala, Intel, Singapore 16:20 – 16:50 HPE Advanced HPC Infrastructures for Industry Scientific Research Craig Yamasaki, Hewlett Packard Enterprise, USA 16:50 – 17:10 Enhancing Lustre File System Performance and Reliability with TBF QoS and Parallel Head Ahead Carlos A. Aoki Thomaz, DDN, USA 17:10 – 17:40 Preparing Workloads for Next Generation HPC Systems Patrick Wohlschlegel, ARM, Singapore
			END OF DAY 1

Wed, March 15

Updated as of March 7, 2017

Day 2 \| Wednesday, March 15, 2017
08:00	Registration
09:00	Keynote \| Cognitive Discovery: The Next Frontier of R&D Alessandro Curioni, IBM Research Laboratory, Switzerland
10:00	Industry Sponsor \| GPU Deep Learning Powering a New Computing Model Marc Hamilton, NVIDIA Corporation
10:40	Tea Break
	SCIENTIFIC TRACK \| CHAIR: Marek Michalewicz	INDUSTRY TRACK \| CHAIR: Terence Chan
11:00 – 12:50	11:00 – 11:50 Invited \| Beating Floating Point at its Own Game: Posit Arithmetic John Gustafson, ASTAR Computational Resource Centre, Singapore*	ARTIFICIAL INTELLIGENCE (AI) AND DEEP LEARNING (DL)
		11:00 – 11:30 Cognitive System: The Design from Training to Inferencing for AI Yonghua Lin, IBM Research, China 11:30 – 11:50 Deep Learning in Autonomous Vehicle Perception, the SMART Way Xinxin Du, Singapore-MIT Alliance for Research and Technology, SIngapore 11:50 – 12:10 Machine Learning in Video Analytics for Smart Nation Chan Kap Luk, XJERA LABS, Singapore 12:10 – 12:30 Open Architecture for a Singular Video Analytic and Management Middle-Ware Platform Wang Fanglin, Neo Shi Yong, KAI Square, Singapore 12:30 – 12:50 GPU-accelerated cryo-EM: Image Processing and 3D Structure Determination of Large Macromolecular Assemblies Sara Sandin, Nanyang Technological University, Singapore
	APPLICATIONS: BIO & HEALTH
	11:50 – 12:10 Enabling Large Scale Modelling of Personalised Stem Cells Therapy Through High Performance Computing Joanna Jurek, Dominika Bakalarz & Maciej Cytowski, Ulster University, UK, Oxford University, UK, and ICM, University of Warsaw, Poland 12:10 – 12:30 A Clinical Pathogen Identification Pipeline Based on HPC Platform Haoran Ma & Kenneth Ban, National University of Singapore 12:30 – 12:50 Using High Performance Computing to Create and, Freely Deliver, the Asian Genomic Database Necessary for Precision Medicine in This Population Saumya Jamuar & Jonathan Picker, Global Gene Corporation, Singapore
12:50	Lunch
	SCIENTIFIC TRACK \| CHAIR: Bingsheng He	INDUSTRY TRACK \| CHAIR: Gabriel Noaje
13:50 – 15:40	13:50 – 14:40 Invited \| Challenges and Opportunities in Large-Scale Data Analytics: Experience with the Graph500 Richard Murphy, Micron Technology, USA	ARTIFICIAL INTELLIGENCE (AI) AND DEEP LEARNING (DL)
		13:50 – 14:20 Driving Human Progress with Dell EMC Artificial Intelligence Systems Andrew Underwood, Dell EMC, Asia Pacific Japan 14:20 – 14:50 Visual Search as a Cloud Service by Large-Scale Commodity GPU Adoption Ashwin Nanjappa, ViSenze, Singapore 14:50 – 15:10 Deep Learning as a Service based on Apache SINGA Wei Wang, National University of Singapore 15:10 – 15:30 Managing (Big) Data With Machines Christopher Muffat, Dathena, Singapore
	APPLICATIONS: BIO & HEALTH
	14:40 – 15:00 Data Science in High Resolution Imaging of Sample Heterogeneity Duane Loh, Colin Teo, Abhik Datta, Zhou Shen and Deepan Balakrishnan, National University of Singapore 15:00 – 15:20 Multi-GPU Deep Learning Techniques for Tendon Healing in Regenerative Stem Cells Based Medicine Norbert Kapinski, Jakub Zielinski, Bartosz Borucki & Krzysztof Nowinski, ICM, University of Warsaw, Poland 15:20 – 15:40 Mathematical Modelling and Large-Scale Computational Optimisation Elucidates the Link Between C. Elegans Neural Circuit Morphology and Worms Behaviour Franciszek Rakowski, Piotr Bala, Łukasz Górski & Jan Karbowski, ICM, University of Warsaw, Poland
15:40	Tea Break
	SCIENTIFIC TRACK \| CHAIR: Francis Lee	INDUSTRY TRACK \| CHAIR: Chianne Chong
	COUNTRY REPORTS	ARTIFICIAL INTELLIGENCE (AI) AND DEEP LEARNING (DL)
16:00 – 18:00	16:00 – 16:30 Update on Supercomputing in Singapore Jon Lau, National Supercomputing Centre, Singapore 16:30 – 17:00 India Update Nisha Kurkure, Indian Institute of Science Education and Research, Pune, India 17:00 – 17:30 Japan Update Motoi Okuda, Research Organization for Information Science and Technology, Japan 17:30 – 18:00 China HPC CPU Futures: To Exascale & Beyond Nebojsa Novakovic, Independent Consultant	16:00 – 16:30 A Fresh Look at High Performance Computing: Huawei HPC Solutions Rommel A. Camillo, Huawei Technologies, USA 16:30 – 16:50 AI-Driven Transformation for Advanced Manufacturing, Healthcare, Digital Economy and Urban Solutions Liu Yong, Institute of High Performance Computing, ASTAR, Singapore* 16:50 – 17:10 Industrial IoT for Automated Intelligence Sun Sumei, Institute for Infocomm Research, ASTAR, Singapore* 17:10 – 17:30 Accelerating Computing for HPC and AI Huynh Phung Huynh, Institute of High Performance Computing, ASTAR, Singapore*
		DATA CENTRE SOLUTIONS
		17:30 – 17:50 Thermal Modelling of NSCC Data Centre Zhang Song Hua, IMDA/ Nanyang Technological University/ National Supercomputing Centre, Singapore
		18:00	Travel to Restaurant
		18:30 – 20:30	Networking Reception @ The China Club
END OF DAY 2

Thu, March 16

Updated as of March 7, 2017

Day 3 \| Thursday, March 16, 2017
08:00	Registration
SCIENTIFIC TRACK \| CHAIR: Thom H. Dunning, Jr.
09:00	Invited \| The Future of Digital Computing Beyond Moore’s Law John Shalf, Lawrence Berkeley National Labs, USA
09:50	Invited \| Simultac Fonton: A Fine-Grain Architecture for Extreme Performance Beyond Moore’s Law Maciej Brodowicz, Indiana University, USA
10:40	Tea Break
ARCHITECTURE \| CHAIR: John Gustafson
11:00	Invited \| Specialised Architectures: Enabling Performance Scaling in a Post-Moore Era David Donofrio, Lawrence Berkeley National Labs, USA
11:50	Compile-Time Reconfigurable Superscalar Computer Architecture Earle Jennings, Qsigma, USA
12:10	Supercomputer Within Supercomputer: Discovery of Interconnect Topologies by Complete Supercomputing System Emulation Lukasz Orlowski, ASTAR Computational Resource Centre, Singapore*
12:30	Lunch
ARCHITECTURE \| CHAIR: Richard Murphy
13:30	Invited \| How to Build a Software Defined Server, and How Best to Use It Ike Nassi, TidalScale, USA
14:20	Towards a Data Centric System Architecture: SHARP Gil Bloch, Richard Graham & Gilad Shainer, Mellanox Technologies, Israel
14:40	Simultaneous Transmit and Receive (STAR) Messaging Protocol Earle Jennings, Qsigma, USA
15:00	Tea Break
ARCHITECTURE \| CHAIR: John Shalf
15:20	Prozium – A Workflow Manager Dedicated for Distributed and Redundant Computational Infrastructure Maciej Cytowski, Micon B. Kursa & Jacek M. Kopec, ICM, University of Warsaw, Poland
15:40	HPC Application Performance & Characteristics on Knights Landing Architecture Nisha Kurkure and Goldi Misra, IISER Pune, India
16:00	InfiniCortex – From Proof-of-Concept to Production Gabriel Noaje, Marek T. Michalewicz & Alan Davis, ASTAR Computational Resource Centre, Singapore*
16:20	Closing
END OF DAY CONFERENCE

Time to the conference

0Days0Hours0Minutes0Seconds

SUPERCOMPUTING FRONTIERS 2017

March 13 – 16, 2017
Level 4 Matrix Building, Biopolis
30 Biopolis Street, Singapore 138671

CONTACT SECRETARIAT

Sandy Fu
Tel: +65 6338 2321
E-mail: info@supercomputingfrontiers.com

IMPORTANT DATES

Extended to FEBRUARY 3, 2017
Submission of Papers (Short Abstract)

FEBRUARY 5, 2017
Notification to Authors

APRIL 10, 2017
Full Paper Submission for Journal Publication