hpc2015 のバックアップ(No.35)

回	日付	担当	発表資料	文献
1	10/05	(ガイダンス)
2	10/26 (W7-302)	本山	hpc_Motoyama.pdf	06735232.pdf
3	11/02	本山	HPC_Motoyama2.pdf
4	11/09 (W7-302)	上原	hpc_uehara.pdf	Hyperspectral.pdf
5	11/30	金刺	hpc_Kanezashi1.pdf	DaDianNao.pdf DianNao.pdf
6	12/07	寺西	hpc_teranishi.pdf	a1-keuper.pdf
7	12/14	Jian	HPC15_2015-12-14_Jian_FireCaffe.pdf	FireCaffe.pdf
8	12/21	寺西	hpc_teranishi2.pdf
9	01/04	Jian	HPC15_2015-12-14&21_Jian_FireCaffe v2.pdf
10	01/12	Piyawath	Optimizing FPGA-based Accelerator Design for Deep Convolutional Neural Networks.pdf	p161-zhang-small.pdf
11	01/18	黒田	HPC_kuroda.pdf	fpga2014-wjun.pdf
12	01/25	黒田
13	02/01	Hamid

↑

禁止リスト Inhibited List †

Training Large Scale Deep Neural Networks on the Intel Xeon Phi Many-Core Coprocessor
Memory fast-forward: A low cost special function unit to enhance energy efficiency in GPU for big data processing
Optimized Deep Learning Architectures with Fast Matrix Operation Kernels on Parallel Platform
Real-time anomaly detection in hyperspectral images using multivariate normal mixture models and GPU processing
Hyperspectral Unmixing on GPUs and Multi-Core Processors: A Comparison
Performance versus energy consumption of hyperspectral unmixing algorithms on multi-core platforms
Optimizing communication and cooling costs in HPC data centers via intelligent job allocation
Cost Minimization for Big Data Processing in Geo-Distributed Data Centers
On Characterization of Performance and Energy Efficiency in Heterogeneous HPC Cloud Data Centers
DaDianNao?: A Machine-Learning Supercomputer
Mariana: tencent deep learning platform and its applications
Performance Modeling and Scalability Optimization of Distributed Deep Learning Systems
Asynchronous parallel stochastic gradient descent: a numeric core for scalable distributed machine learning algorithms
FireCaffe?: near-linear acceleration of deep neural network training on compute clusters
CA-SVM: Communication-Avoiding Support Vector Machines on Distributed Systems
Large Scale Distributed Deep Networks
Optimizing FPGA-based Accelerator Design for Deep Convolutional Neural Networks
24.77 Pflops on a Gravitational Tree-Code to Simulate the Milky Way Galaxy with 18600 GPUs
Massively Parallel Models of the Human Circulatory System
Moving to memoryland: in-memory computation for existing applications
Intelligent SSD: A Turbo for Big Data Mining
Scalable Multi-Access Flash Store for Big Data Analytics
An FPGA-Based Tightly Coupled Accelerator for Data-Intensive Applications
A reconfigurable fabric for accelerating large-scale datacenter services
An FPGA-based In-Line Accelerator for Memcached

↑

期末レポート Report †

期限 Due date: 02/08
Summarize the general topic covering and including ALL THREE PAPERS regarding the state of the art in HPC and Big Data convergence.
It should be 10 pages in IEEE conference paper format
Please submit it to TA by email.

↑

ハイパフォーマンスコンピューティング †

目次 †

休講予定日 Lecture Cancelled †

授業概要と参考資料 Guidance and References †

発表スケジュール Schedule †

禁止リスト Inhibited List †

期末レポート Report †

リンク Links †