A comprehensive list of training resources from the HPC University. HPCU is a virtual organization whose primary goal is to provide a cohesive,… more
Learning
debugging hpc-operations professional-development
Beginner, Intermediate, Advanced
Cornell Virtual Workshop is a comprehensive training resource for high performance computing topics. The Cornell University Center for Advanced… more
Learning
jetstream stampede2 cloud-computing
Beginner, Intermediate, Advanced
We teach foundational coding and data science skills to researchers worldwide.
Website
Beginner, Intermediate, Advanced
The documentation provides an overview of using Pegasus, a workflow management system, on ACCESS resources for high throughput computing (HTC)… more
Docs
Beginner, Intermediate, Advanced
This workshop focuses on developing an understanding of the fundamentals of attention and the transformer architecture so that you can understand how… more
Learning
ai deep-learning machine-learning
Intermediate
Matplotlib is a comprehensive library for creating static, animated, and interactive visualizations in Python. It makes analyzing and presenting your… more
Docs
Beginner, Intermediate
DeapSECURE is a training program to infuse high-performance computational techniques into cybersecurity research and education. It is an NSF-funded… more
Learning
ai deep-learning machine-learning
Beginner
This course from MIT OpenCourseWare (OCW) covers very basic information on how to get started with programming using Python. Lectures are available,… more
Learning
Beginner
Geocoding is the process of taking a street address and converting it into coordinates that can be plotted on a map. This conversion typically… more
Docs
Beginner, Intermediate
An HPC focused Carpentry community. Trainings include: HPC fundamentals, python, chapel, LAMMPS, parallelization with python, scaling studies, etc.
Website
Beginner, Intermediate, Advanced
This workshop series introduces the essential concepts in deep learning and walks through the common steps in a deep learning workflow from data… more
Learning
ai deep-learning image-processing
Beginner, Intermediate
This documentation contains introductory material on Python Programming for Digital Humanities and Computational Research. This can be a go-to… more
Docs
Beginner
This workshop will go into the different ways python packages can be managed in a cluster environment using conda and python virtual environments… more
Docs
documentation pytorch data-science
Intermediate
Open OnDemand is an easy-to-use web portal that lets students, researchers, and industry professionals use supercomputers from anywhere. It is… more
Website
ondemand administering-hpc cluster-management
Beginner, Intermediate, Advanced
PyTorch is a Python library that supports accelerated GPU processing for Machine Learning and Deep Learning. In this tutorial, I will teach the… more
Docs
Beginner
This Udacity article listed the most frequently used R packages for data science and statistics. For each package, the article provided the link to… more
Docs
plotting visualization data-analysis
Beginner, Intermediate, Advanced
Learn how to use Linux commands in a python script. Specifically, learn how to use the subprocess and os modules in python to run shell commands (… more
Learning
cluster-management programming python
Beginner, Intermediate
Understand the benefits of an automated version control system and the basics of how automated version control systems work. Configure git the first… more
Learning
Beginner
A question and answer forum for neuroscience researchers, infrastructure providers and software developers.
Website
documentation image-processing data-sharing
Beginner, Intermediate, Advanced
These links take you to visualization resources supported by the University of Arizona's HPC visualization consultant ([rtdatavis.github.io](… more
Docs
Intermediate, Advanced
pip stands for "pip installs packages". It's the go-to package manager for Python, allowing developers to install, update, and manage… more
Learning
Beginner, Intermediate
This website is an interactive introduction to Gaussian Belief Propagation (GBP). A probabilistic inference algorithm that operates by passing… more
Website
Beginner, Intermediate
This is a short video on how to exchange ACCESS credits and connect to Jetstream 2 (please note this was created for Duke users but applies to all) .
Video
access-account ACCESS-credits exchange-request
Beginner
ACCESS requests proposals to be written following NSF proposal guidelines. The link provides an example of an ACCESS proposal using an NSF LaTeX… more
Learning
allocations-proposal proposal-request research-facilitation
Beginner
A guide for Duke OIT on how to advise users on using ACCESS and allocation credits to jetstream 2 for Duke University members. This can be used for… more
Docs
ACCESS-credits adding-users allocation-management
Intermediate, Advanced
Monthly workshops sponsored by ACCESS on a variety of HPC topics organized by Pittsburgh Supercomputing Center (PSC). Each workshop will be telecast… more
Learning
deep-learning machine-learning neural-networks
Beginner, Intermediate
Purdue University is the home of Anvil, a powerful supercomputer that provides advanced computing capabilities to support a wide range of… more
Docs
Beginner, Intermediate, Advanced
NCSA is the home of Delta, a computing and data resource that balances cutting-edge graphics processor and CPU architectures with a non-POSIX file… more
Docs
Beginner, Intermediate, Advanced
Expanse at SDSC is a cluster designed by Dell and SDSC delivering 5.16 peak petaflops, and offers Composable Systems and Cloud Bursting.
Docs
expanse composable-systems gpu
Beginner, Intermediate, Advanced
Website
affinity-group pegasus ACCESS-website
Beginner, Intermediate, Advanced
This tutorial introduces the use of Containers using the Charliecloud software suite. This tutorial will provide participants with background and… more
Learning
Beginner
This textbook is the first comprehensive treatment of active inference, an integrative perspective on brain, cognition, and behavior used across… more
Learning
ai machine-learning neural-networks
Beginner, Intermediate, Advanced
This is a self guided online course on compilers. The topics covered throughout the course include universal compilers topics like intermediate… more
Learning
optimization parallelization training
Advanced
Mathematical optimization deals with the problem of finding numerically minimums or maximums of a functions. This tutorial provides the Python… more
Learning
Beginner, Intermediate, Advanced
These slides provide an introduction on how Termius and Cursor, two new and freemium apps that use AI to perform more efficient work, can be used for… more
Slides
documentation ai machine-learning
Beginner, Intermediate
Materials from the SAIL meeting (https://aiinstitutes.org/2023/06/21/sail-2023-summit-for-ai-leadership/). A space where AI researchers can learn… more
Learning
access-account ai data-analysis
Beginner, Intermediate, Advanced
**Cursor: The AI-Powered Code Editor**
Cursor is a cutting-edge, AI-first code editor designed to revolutionize the way developers… more
Tool
Beginner, Intermediate
This technology lab contains a set of sessions to help a new user start an AI project on the ACES cluster, a composable accelerator testbed at Texas… more
Docs
Intermediate
The Julia Programming Language is one of the fastest growing software languages for AI/ML development. It writes in manner that's similar to… more
Learning
ai data-analysis machine-learning
Beginner
Purdue University is the home of Anvil, a powerful supercomputer that provides advanced computing capabilities to support a wide range of… more
Website
Beginner, Intermediate, Advanced
The provided text discusses various aspects of Android app development fundamentals. It covers key concepts related to app components, the… more
Website
Beginner, Intermediate
Slides for a tutorial on Machine Learning applications in Engineering and parameter tuning given at the RMACC conference 2019.
Learning
data-analysis machine-learning python
Beginner, Intermediate
Website
resources programming-best-practices
Beginner, Intermediate, Advanced
Astropy is a community-driven package that offers core functionalities needed for astrophysical computations and data analysis. From coordinate… more
Learning
visualization image-processing astrophysics
Intermediate, Advanced
The authoritative book on automated machine learning, which allows practitioners without ML expertise to develop and deploy state-of-the-art machine… more
Learning
ai data-analysis deep-learning
Intermediate, Advanced
A curated list of awesome Jupyter widget packages and projects for building interactive visualizations for Python code
Learning
Beginner, Intermediate, Advanced
An AWS Tutorial for Beginners is a course that teaches the basics of Amazon Web Services (AWS), a cloud computing platform that offers a wide range… more
Video
Beginner, Intermediate
This package lets you easily scrape websites and extract information based on html tags and various other metadata found in the page. It can be… more
Tool
Beginner, Intermediate
What is PyFR and how does it solve fluid flow problems?
PyFR is an open-source Computational Fluid Dynamics (CFD) solver that is based on… more
Tool
finite-element-analysis benchmarking parallelization
Intermediate
The Better Scientific Software (BSSw) project provides a community to collaborate and learn about best practices in scientific software development.… more
Website
community-outreach project-management research-facilitation
Beginner, Intermediate, Advanced
Nextflow is an open-source, domain-specific language and workflow manager designed for the execution and coordination of scientific and data-… more
Docs
cloud-computing parallelization data-management
Beginner, Intermediate
The Biopython Tutorial and Cookbook website is a dedicated online resource for users in the field of computational biology and bioinformatics. It… more
Learning
bioinformatics genomics python
Beginner, Intermediate, Advanced
This tutorial explains how to create an Anaconda Navigator Application (app) for JupyterLab. It is intended for users of Windows, macOS, and Linux… more
Tool
Intermediate
This article provides instructions for building AirSim, an open-source simulator for autonomous vehicles, on Linux. It outlines the steps to build… more
Docs
profiling data-transfer github
Beginner
"These notes are part of the UW Experimental College course on Introductory C Programming. They are based on notes prepared (beginning in Spring… more
Learning
Beginner
Campus Champions foster a dynamic environment for a diverse community of research computing and data professionals sharing knowledge and experience… more
Website
community-outreach professional-development
Beginner, Intermediate, Advanced
The Data-Facing Track of the People Network brings together people from research computing groups, libraries, research institutes, and other… more
Website
data-analysis data-access-protocols data-lifecycle
Beginner, Intermediate, Advanced
Chameleon is an NSF-funded testbed system for Computer Science experimentation. It is designed to be deeply reconfigurable, with a wide variety of… more
Docs
data-sharing data-reproducibility
Beginner, Intermediate, Advanced
Announcements for for users and developers of Charliecloud, which provides lightweight user-defined software stacks for high-performance computing.
Mailing List
Beginner
CHARMM (Chemistry at HARvard Macromolecular Mechanics) is a widely distributed molecular simulation program with a broad array of applications.… more
Learning
charmm molecular-dynamics namd
Beginner, Intermediate
CMake is an open-source tool used to manage the build process in operating systems. This tutorial takes you through how to use CMake from the very… more
Learning
Beginner, Intermediate, Advanced
Containerization is a software development method in which applications are packaged into standard units for development, shipment, and deployment.
Video
Beginner
Goes through in detail on how to build an application that can run on Android and IOS devices, using Qt Creator to develop Qt Quick applications.… more
Website
Intermediate
NVIDIA CUDA Toolkit Documentation: If you are working with GPUs in HPC, the NVIDIA CUDA Toolkit is essential. You can access the CUDA Toolkit… more
Docs
Intermediate, Advanced
learning cybersecurity is crucial for personal protection, safeguarding digital assets, financial security, and national security. It is important… more
Learning
training data-security cybersecurity
Beginner
DAGMan (Directed Acyclic Graph Manager) is a meta-scheduler for HTCondor. It manages dependencies between jobs at a higher level than the HTCondor… more
Tool
Intermediate, Advanced
DARWIN (Delaware Advanced Research Workforce and Innovation Network) is a big data and high performance computing system designed to catalyze… more
Docs
Beginner, Intermediate, Advanced
This webinar series is an orientation to R. We start with an overview of R’s history and place in the larger data science ecosystem. Next, we… more
Video
data-analysis data-science psychology
Beginner
This slices and videos introduced how to use K-Nearest-Neighbors method to impute climate data and how to use Bayesian Spatio-Temporal models in R-… more
Video
allocation-value documentation ai
Intermediate, Advanced
Plots.jl is the most widely used plotting library for the Julia programming language. It's known for being especially powerful in its… more
Tool
Beginner, Intermediate
Data visualization is a critical aspect of data analysis. It allows for a clear and concise representation of data, making it easier for users to… more
Website
Beginner
DeepChem is an open-source library built on TensorFlow and PyTorch. It is helpful in applying machine learning algorithms to molecular data.
Tool
pytorch tensorflow computational-chemistry
Beginner, Intermediate, Advanced
Introductory video about DELTA. Speaker Tim Boerner, Senior Assistant Director, NCSA
video
Beginner, Intermediate, Advanced
As developers, we get excited to think about challenging problems. When you ask us what we are working on, our eyes light up like children in a candy… more
Website
community-outreach professional-development training
Beginner, Intermediate, Advanced
Discover Data Science is all about making connections between prospective students and educational opportunities in an exciting new, hot, and growing… more
Website
data-analysis workforce-development
Beginner
Tableau is a popular and capable software product for creating charts that present data and dashboards that allow you to explore data. It is… more
Video
big-data data-analysis training
Intermediate
Docker allows for containerization of any task - basically a smaller, scalable version of a virtual machine. This is very useful when transferring… more
Tool
documentation cloud-computing deep-learning
Intermediate, Advanced
The Docker container library, commonly known as Docker Hub, is a vast repository that hosts a multitude of pre-configured container images,… more
Tool
A Docker tutorial for beginners is a course that teaches the basics of Docker, a containerization platform that allows you to package your… more
Video
Beginner, Intermediate, Advanced
EasyBuild is a software installation framework that allows administrators to easily build and install software on high-performance computing (HPC)… more
Docs
Intermediate
The purpose of this group is to provide a forum to discuss NIST 800-171 compliance. Participants are encouraged to collaborate and share effective… more
Website
Beginner, Intermediate, Advanced
This code showcases how to work with the header-only nlohmann JSON library for C++. In order to compile, change the extensions from json_test.txt to… more
Learning
Advanced
Some examples for writing Thrust code. To compile, download the CUDA compiler from NVIDIA. This code was tested with CUDA 9.2 but is likely… more
Learning
Intermediate, Advanced
Expanse at SDSC is a cluster designed by Dell and SDSC delivering 5.16 peak petaflops, and offers Composable Systems and Cloud Bursting.
Website
Beginner, Intermediate, Advanced
A tutorial paper that presents a generic message-passing algorithm, the sum-product algorithm, that operates in a factor graph. Following a single,… more
Docs
access-account ai machine-learning
Intermediate
The "Fairness and Machine Learning" book offers a rigorous exploration of fairness in ML and is suitable for researchers, practitioners,… more
Docs
ai data-analysis deep-learning
Intermediate, Advanced
Feed-forward neural networks are a simple type of network that simply rely on data to be "fed-forward" through a series of layers that… more
Website
deep-learning machine-learning neural-networks
Intermediate
Visual Studio Code, commonly known as VSCode, is a popular tool used by programmers worldwide. It serves as a text editor and an Integrated… more
Learning
Intermediate
As LLMs get larger fine-tuning to the full extent can become difficult to train on consumer hardware. Storing and deploying these tuned models can… more
Video
faster optimization performance-tuning
Intermediate, Advanced
This framework will help in scaling Machine Learning/Deep Learning/Artificial Intelligence/Natural Language Processing Models to Web Application… more
Learning
ai deep-learning machine-learning
Intermediate
The official MGH / Harvard tutorial page for FreeSurfer. The FreeSurfer group has provided and designed a series of tutorials for using FreeSurfer… more
Learning
data-analysis image-processing psychology
Beginner, Intermediate
This is the official University of Oxford FSL group lecture page. This includes information on upcoming and past courses (online and in-person), as… more
Learning
data-analysis image-processing psychology
Beginner, Intermediate, Advanced
This course is an introduction to the R programming language and covers the fundamental concepts needed to operate in the R environment. This course… more
Learning
Beginner
Gaussian 16 is a computational chemistry package that is used in predicting molecular properties and understanding molecular behavior at a quantum… more
Tool
gaussian computational-chemistry
Intermediate, Advanced
Below is a link for a book that focuses on how to use "sf" and "terra" packages for GIS computations. As of 5/1/2023, this book… more
Learning
Beginner, Intermediate
In GIS, projections are helpful to take something plotted on a globe and convert it to a flat map that we can print or show on a screen.… more
Learning
Beginner, Intermediate
Often when working with GIS, or spatial data, one encounters the word "datum" and it may require that you choose a "datum" when… more
Learning
Beginner
A couple of resources that:
1.) Presents and defends a git branching workflow for stable collaborative git based projects. ("A… more
Learning
Beginner, Intermediate, Advanced
Globus is a data transfer, sharing, automation, and discovery service used by hundreds of thousands of researchers to manage "big data" at… more
Docs
cloud-storage data-sharing data-management
Beginner, Intermediate, Advanced
GPU training series for scientists, software engineers, and students, with emphasis on Earth science applications.
The content of this… more
Learning
optimization performance-tuning profiling
Beginner
This article provides step-by-step instructions on how to build AirSim, a simulator for autonomous vehicles, on Linux. It includes both Docker and… more
Docs
documentation github github-pages
Beginner, Intermediate
This tutorial is essentially the "hello world" of image recognition and feed-forward neural network (using PyTorch). Using the MNIST… more
Website
ai visualization deep-learning
Intermediate
Documentation and presentation on how to use machine learning and deep learning framework using TensorFlow, Keras and sci-kit learn for Climate and… more
Learning
Intermediate
JSON is a lightweight format for storing and transporting data, for example in a config file. This library is header-only, and has easy-to-read… more
Learning
Intermediate, Advanced
High Performance Computing (HPC) Cluster
Video
Beginner, Intermediate
An introductory guide to High Performance Computing.
Website
Beginner
Horovod is a distributed deep learning training framework. Using horovod, a single-GPU training script can be scaled to train across many GPUs in… more
Tool
deep-learning distributed-computing gpu
Intermediate, Advanced
Hour of Cyberinfrastructure (Hour of CI) is a nationwide campaign to introduce undergraduate and graduate students to cyberinfrastructure and… more
Learning
Beginner
A tutorial entitled "How the Little Jupyter Notebook Became a Web App: Managing Increasing Complexity with nbdev" presented at SciPy 2023… more
Learning
data-sharing data-management-software data-reproducibility
Beginner, Intermediate, Advanced
Learn how to use Rclone to transfer data, specifically from your local drive to the Open Storage Network, vice versa.
Learning
Beginner
HPCwire is a prominent news and information source for the HPC community. Their website offers articles, analysis, and reports on HPC technologies,… more
Website
documentation pytorch data-science
Beginner, Intermediate, Advanced
The following link provides an easy method of implementing Markov Decision Processes (MDP) in the Julia computing language. MDPs are a class of… more
Tool
Intermediate, Advanced
R GIS packages "rgdal", "rgeos", and "maptools" are package set to be archived and no longer supported by end of 2023… more
Docs
Beginner, Intermediate, Advanced
InsideHPC is an informational site offers videos, research papers, articles, and other resources focused on machine learning and quantum computing… more
Website
ai machine-learning community-outreach
Beginner, Intermediate, Advanced
Rocky Linux is an open-source enterprise operating system. It is compatible with Red Hat Enterprise Linux (RHEL). It is a community-driven project… more
Learning
unix-environment software-installation
Beginner
The Stan language is used to specify a (Bayesian) statistical model with an imperative program calculating the log probability density function. Here… more
Docs
data-analysis machine-learning monte-carlo
Beginner, Intermediate
Introduction to the basics of OpenACC.
Slides
Beginner
Open Multi-Processing, is an API designed to simplify the integration of parallelism in software development, particularly for applications running… more
Slides
Intermediate
The tutorial is intended to provide a brief overview of the extensive and broad topic of Parallel Computing. It covers the basics of parallel… more
Learning
Beginner
This website summarizes the notes of Stanford's introductory course on probabilistic graphical models.
It starts from the very basics and… more
Learning
Beginner, Intermediate
This workshop has an introduction to the concepts of visualization followed by hands on exercises. The concepts section has Speaker Notes, and the… more
Learning
visualization documentation training
Beginner
In this tutorial, I present an overview with many examples of the use of Numpy and Pandas for data analysis. Beginners in the field of data analysis… more
Docs
Beginner
Jetstream2 makes cutting-edge high-performance computing and software easy to use for your research regardless of your project’s scale—even if you… more
Website
Beginner, Intermediate, Advanced
Jetstream2 makes cutting-edge high-performance computing and software easy to use for your research regardless of your project’s scale—even if you… more
Docs
Beginner, Intermediate, Advanced
Jetstream2 makes cutting-edge high-performance computing and software easy to use for your research regardless of your project’s scale—even if you… more
Website
Beginner, Intermediate, Advanced
Documentation and research based on the latest NLP text generation detection methods for 2023.
Learning
Intermediate
The following pages are intended to give you a solid foundation in how to use the terminal, to get the computer to do useful work for you. You won… more
Learning
file-systems bash unix-environment
Beginner
A series of interviews with women in the HPC community
Website
science-gateway community-outreach professional-development
Beginner, Intermediate, Advanced
Machine learning is becoming increasingly important in field with large data such as astrophysics. AstroML is a Python module for machine learning… more
Docs
plotting big-data image-processing
Intermediate
The free online book for the mlr3 machine learning framework for R. Gives a comprehensive overview of the package and ecosystem, suitable from… more
Learning
data-analysis machine-learning r
Beginner, Intermediate, Advanced
In the realm of Python-based machine learning, Scikit-Learn stands out as one of the most powerful and versatile tools available. This introductory… more
Learning
Beginner
Bioinformatics Toolbox provides algorithms and apps for Next Generation Sequencing (NGS), microarray analysis, mass spectrometry, and gene ontology.… more
Tool
visualization data-analysis bioinformatics
Beginner, Intermediate, Advanced
MATLAB is a really useful tool for data analysis among other computational work. This tutorial takes you through using MATLAB with other programming… more
Tool
Beginner, Intermediate, Advanced
MDAnalysis is a python based library of tools for the analysis of molecular dynamics simulations. It is able to read and write many popular… more
Tool
computational-chemistry materials-science python
Beginner, Intermediate, Advanced
There is a detailed explanation about communication routines and managing methods of different MPI libraries, as well as several exercises designed… more
Website
Beginner
Metadata is a vital topic in libraries and librarianship, encompassing structured information used for accessing digital resources. The definition of… more
Learning
Intermediate
Links to MD tutorials for beginner's across various simulation platforms.
Learning
Beginner
The listed repository contains code written in C++ to model the flow inside a cavity with a lid moving above from left to right by discretizing… more
Docs
Intermediate
Workshop for beginners and intermediate students in MPI which includes helpful exercises. Open MPI documentation.
Learning
Beginner, Intermediate
CS244N is a renowned natural language processing course offered by Stanford University and taught by Christopher Manning. It covers a wide range of… more
Video
natural-language-processing training workforce-development
Beginner, Intermediate
Self-paced tutorials on high-end computing topics such as parallel computing, multi-core performance, and performance tools. Other related topics… more
Learning
performance-tuning profiling parallelization
Beginner, Intermediate
Self-paced tutorials on high-end computing topics such as parallel computing, multi-core performance, and performance tools. Some of the tutorials… more
Learning
training workforce-development
Beginner, Intermediate, Advanced
Neocortex is a new supercomputing cluster at the Pittsburgh Supercomputing Center (PSC) that features groundbreaking AI hardware from Cerebras… more
Docs
documentation ai deep-learning
Beginner
A comprehensive collection of NERSC developed training and tutorial events, offered on regular schedules. All sessions are archived, including slide… more
Learning
Beginner, Intermediate, Advanced
Making a neural network has never been easier! The following link directs users to the Flux.jl package, the easiest way of programming a neural… more
Tool
ai deep-learning machine-learning
Intermediate, Advanced
Neurodesk provides a containerised data analysis environment to facilitate reproducible analysis of neuroimaging data. Analysis pipelines for… more
Website
psychology containers software-installation
Beginner, Intermediate, Advanced
The Neuroimaging Tools and Resources Collaboratory (NITRC) is a neuroimaging informatics knowledge environment for MR, PET/SPECT, CT, EEG/MEG,… more
Website
data-analysis image-processing data-sharing
Beginner, Intermediate, Advanced
Numba is a Python compiler designed for accelerating numerical and array operations, enabling users to enhance their application's performance… more
Docs
vectorization optimization performance-tuning
Intermediate, Advanced
Numpy is a python package that leverages types and compiled C code to make many math operations in Python efficient. It is especially useful for… more
Tool
documentation big-data data-analysis
Beginner, Intermediate
Upcoming training events and archives of training materials detailing general HPC best practices as well as how to use OLCF resources and services.
Learning
Beginner, Intermediate, Advanced
The official documentation for PyTorch, a machine learning tensor-based framework, and NumPy, which allows for support for ndarrays which is useful… more
Docs
deep-learning neural-networks pytorch
Beginner
VisIt is a prominent open-source, interactive parallel visualization and graphical analysis tool predominantly used for viewing scientific data. Its… more
Docs
visIt novel-accelerators particle-physics
Intermediate, Advanced
The official documentation for Python 3.11.5. Python comes with a lot of features built into the language, so it is worth taking a look as you code.
Docs
This contains documentation for getting started with using OnShape for CAD. OnShape cloud-hosted CAD software that lets you work with others like on… more
Tool
Beginner
OnShape FeatureScripts allow users to create their own features via OnShape's programming language. The user can make these as simple or complex… more
Tool
documentation materials-science particle-physics
Intermediate, Advanced
The Open Storage Network, a national resource available through the XSEDE resource allocation system, is high quality, sustainable, distributed… more
Website
data-management data-retention open-storage-network
Beginner, Intermediate, Advanced
Proxmox Virtual Environment is a hyper-converged infrastructure open-source software. It is a hosted hypervisor that can run operating systems… more
Learning
Beginner
Techniques and support for multithreaded geospatial data processing in GRASS.
Tool
Intermediate
Snakemake is a powerful and versatile workflow management system that simplifies the creation, execution, and management of data analysis pipelines.… more
Docs
documentation data-analysis data-reproducibility
Intermediate, Advanced
pandas is a fast, powerful, flexible and easy to use open source data analysis and manipulation tool, built on top of the Python programming language… more
Docs
Beginner, Intermediate
These links take you to visualization resources supported by the University of Arizona's HPC visualization consultant (rtdatavis.github.io). The… more
Docs
Beginner
A class from MITOpenCourseware that gives a hands on approach to building scalable and high-performance software systems. Topics include performance… more
Learning
optimization parallelization training
Intermediate, Advanced
This video series provides a holistic understanding of machine learning, covering theory, application, and inner workings of supervised, unsupervised… more
Video
machine-learning programming python
Advanced
Humans cannot always be treated as oracles for collaborative sensing. Robots thus need to maintain beliefs over unknown world states when receiving… more
Docs
Advanced
5 Days of recordings of Python data analysis and visualization training.
Learning
Beginner, Intermediate
Python has become a very popular programming language and software ecosystem for work in Data Science, integrating support for data access, data… more
Video
Intermediate
This is a very barebones introduction to the PyTorch framework used to implement machine learning. This tutorial implements a feed-forward neural… more
Website
deep-learning machine-learning neural-networks
Intermediate
Data augmentation is a crucial step in the pipeline for image classification with deep learning. Albumentations is an extremely versatile Python… more
Tool
Advanced
R for Data Science is a comprehensive resource for individuals looking to harness the power of the R programming language for data analysis,… more
Learning
visualization data-analysis data-science
Beginner, Intermediate, Advanced
A book for researchers who contribute code to R projects: This booklet is the result of my work with the Social Cognition for Social Justice lab. It… more
Learning
software-carpentry workforce-development r
Beginner, Intermediate
Raftlib is an open-source C++ Library that provides a framework for implementing parallel and concurrent data processing pipelines. It is designed… more
Tool
parallelization pthreads openmp
Intermediate, Advanced
This repository contains information about Jupyter Widgets and how they can be used to develop interactive workflows, data dashboards, and web… more
Website
Beginner, Intermediate, Advanced
The daily news clearly shows the increasing threat to safety and privacy of data, personal as well as intellectual property. While the requirements… more
Website
community-outreach cybersecurity
Beginner, Intermediate, Advanced
This course takes through the fundamentals required to get started with reinforcement learning with Python, OpenAI Gym and Stable Baselines. You… more
Video
deep-learning machine-learning tensorflow
Beginner
Representation learning is a fundamental concept in machine learning and artificial intelligence, particularly in the field of deep learning. At its… more
Docs
deep-learning image-processing machine-learning
Intermediate
The NSF-funded ResearchSOC helps make scientific computing resilient to cyberattacks and capable of supporting trustworthy, productive research… more
Website
Beginner, Intermediate, Advanced
Iterative Programming takes place when you can explore your code and play with your objects and functions without needing to save, recompile, or… more
Learning
Beginner, Intermediate
An ongoing collection of RSE training material, workshops, and resources. We are compiling this list as a starting point for future activities. We… more
Website
astrophysics data-science novel-accelerators
Beginner, Intermediate, Advanced
Active inference is an emerging study field in machine learning and computational neuroscience. This website in particular introduces "active… more
Website
Beginner, Intermediate, Advanced
Rocky Mountain Advanced Computing Consortium Website
Website
Beginner, Intermediate, Advanced
Resources and User Guide available at Rockfish
Docs
Intermediate
WarpX is an advanced particle-in-cell code used to model particle accelerators, which needs to be run on HPC. This website contains the tutorial on… more
Docs
github github-pages novel-accelerators
Intermediate
Samtools is a suite of programs for interacting with high-throughput sequencing data, especially in the SAM/BAM format. It offers various utilities… more
Docs
documentation data-analysis bioinformatics
Beginner, Intermediate, Advanced
Use this template to turn any science gateway workflow into a web application!
Learning
data-analysis github astrophysics
Beginner
Scikit-learn is free software machine learning library for Python. It has a variety of features you can use on data, from linear regression… more
Tool
Beginner, Intermediate
Comprehensive tutorials and lecture notes covering various aspects of scientific computing using Python and Scipy.
Learning
visualization data-analysis machine-learning
Beginner, Intermediate
VSCode is a popular IDE that runs on Windows, MacOS, and Linux. This tutorial will explain how to get set up with VSCode to code in Python. It will… more
Learning
These instructions were executed on the FASTER and Grace cluster computing facilities at Texas A&M University. However, the process can be… more
Learning
Advanced
Singularity/Apptainer is a free and open-source container platform that allows users to build and run containers on high performance computing… more
Docs
Intermediate
Slurm is an open source, fault-tolerant, and highly scalable cluster management and job scheduling system for large and small Linux clusters. Slurm… more
Website
cluster-management cluster-support slurm
Intermediate, Advanced
Introduction to the Slurm Workload Manager for users and system administrators, plus some material for Slurm programmers.
Learning
administering-hpc cluster-management hpc-cluster-architecture
Beginner
Differential equations, the backbone of countless physical phenomena, have traditionally been solved using numerical methods or analytical techniques… more
Learning
Beginner, Intermediate
Spack is a package manager for supercomputers that can help administrators install scientific software and libraries for multiple complex software… more
Docs
Intermediate
TensorFlow is a powerful framework for Deep Learning, developed by google. This specifically is their python package, which is easy to use and can be… more
Tool
documentation faster tensorflow
Intermediate, Advanced
**Termius: The Modern SSH Client for 2023**
Termius is the future-facing SSH client that's redefining remote server access in… more
Website
cloud-computing data-sharing data-transfer
Beginner, Intermediate
Training Resources and Courses offered by Texas A&M's Research Computing Group
Learning
Beginner, Intermediate, Advanced
Pandas is one of the most essential Python libraries for data analysis and manipulation. It provides high-performance, easy-to-use data structures,… more
Docs
Beginner, Intermediate
This video by the YouTube channel 3Blue1Brown provides a very simplified introduction to the theory behind neural networks. This tutorial is perfect… more
Video
Beginner
Thrust is a CUDA library that optimizes parallelization on the GPU for you. The Thrust tutorial is great for beginners. The documentation is helpful… more
Learning
Intermediate, Advanced
A walkthrough (with a Google Colab link) on how to implement your own LSTM to observe time-dependent behavior.
Website
ai deep-learning machine-learning
Advanced
Trinity is one of the most popular tool to assemble transcripts from RNA-Seq short reads. In this tutorial, we will cover the basic usage of Trinity… more
Learning
Beginner
The mission of Trusted CI is to lead in the development of an NSF Cybersecurity Ecosystem with the workforce, knowledge, processes, and… more
Website
Beginner, Intermediate, Advanced
Very helpful list of external resources from Trusted CI
Website
Beginner, Intermediate, Advanced
The following link elaborates the usage of OpenMP API and its related syntax. There are also several exercises available for learners to help them… more
Website
Beginner
Unix is incredibly common and useful. This website provides all the common commands and explanations for one to get started with a unix system.
Website
Beginner
With the recent uprising of LLM's many business are looking at way to adopt these LLMs and fine-tuning these models on specfic data sets to… more
Learning
Beginner, Intermediate
Introductory training materials for working on the UNIX command line.
Learning
Beginner
Windows Subsystem for Linux (WSL) provides a Linux environment for Windows users to access HPC resources fast and efficiently.
Tool
Beginner
A tutorial on the effective use of Dask on HPC resources. The four-hour tutorial will be split into two sections, with early topics focused on novice… more
Learning
Beginner, Intermediate
It's not uncommon to see beautiful visualizations in HPC center galleries, but the majority of these are either rendered off the HPC or created… more
Docs
Beginner, Intermediate
Warewulf is an operating system provisioning platform for Linux that is designed to produce secure, scalable, turnkey cluster deployments that… more
Website
documentation administering-hpc distributed-computing
Beginner, Intermediate
Weka is a collection of machine learning algorithms for data mining tasks. It contains tools for data preparation, classification, regression,… more
Tool
big-data data-analysis machine-learning
Intermediate, Advanced
This reading will explain what a long short-term memory neural network is. LSTMs are a type of neural networks that rely on both past and present… more
Learning
ai deep-learning machine-learning
Intermediate, Advanced
This article discusses the importance of fairness in machine learning and provides insights into how Google approaches fairness in their ML models.… more
Docs
ai visualization data-analysis
Intermediate
A VPN, or Virtual Private Network, is a technology that creates a secure tunnel between your device and a VPN server. This tunnel encrypts all of… more
Website
Beginner
The Why & How seminar series is designed to introduce research assistants, graduate students, and postdoctoral and clinical fellows – really,… more
Learning
Beginner, Intermediate, Advanced
This is a resource for researchers and students looking to on-board onto the c3ddb cluster at MGHPCC. In the code section, there are example job… more
Learning
Beginner
Through collaboration and networking, WHPC strives to bring together women in HPC and technical computing while encouraging women to engage in… more
Website
Beginner
CAC summer student employee Jeff Lantz describes his experiences in running the WRF weather forecasting application in the public cloud. He compares… more
Video
Advanced