Skip to main content
openondemand.org
Run OOD
Administer OOD
Get Involved
Support
Log in
OOD Primary Menu
Affinity Groups
Knowledge Base
People
Events
Configuring a high-performance cluster, with virtual machines, to simulate Hadoop multi-node system for Data Science experiences
Submission navigation links for Project
‹
Previous submission
Next submission
›
Submission information
Submission Number:
134
Submission ID:
237
Submission UUID:
56e75a52-dd01-49dd-bb82-8616ea97d9f3
Submission URI:
/form/project
Created:
Thu, 01/13/2022 - 11:07
Completed:
Thu, 01/13/2022 - 11:07
Changed:
Wed, 07/06/2022 - 15:09
Remote IP address:
192.112.102.251
Submitted by:
Gerald Kruse
Language:
English
Is draft:
No
Webform:
Project
Received Sent
0
Accept and Publish Sent
0
Project Title
Configuring a high-performance cluster, with virtual machines, to simulate Hadoop multi-node system for Data Science experiences
Program
CAREERS
Project Image
{Empty}
Tags
cluster-management (495), hadoop (12), software-installation (211), unix-environment (60)
Status
Halted
Project Leader
Project Leader
Gerald Kruse
Email
kruse@juniata.edu
Project Personnel
Mentor(s)
{Empty}
Student-facilitator(s)
{Empty}
Mentee(s)
{Empty}
Project Information
Project Description
Our Data Science high-performance cluster was delivered in Jan 2020. It is a Cloudseek 1000 from PSSCLabs.
Unfortunately, Covid impacted our efforts to configure it for our Data Science courses (https://www.juniata.edu/academics/departments/data-science/curriculum.php). At Juniata, we offer a Major (our "Program of Emphasis"), a minor (our "Secondary Emphasis"), and an online graduate degree in Data Science. We've been able to get by, but with a Big Data course coming available, we need to configure this system. We would like funding for one of our students to work on this project. We have the name of a possible technical mentor, or at least someone who will need to be consulted.
It's been a challenge to get this cluster operational, and we would really appreciate any assistance.
Project Information Subsection
Project Deliverables
{Empty}
Project Deliverables
{Empty}
Student Research Computing Facilitator Profile
{Empty}
Mentee Research Computing Profile
{Empty}
Student Facilitator Programming Skill Level
{Empty}
Mentee Programming Skill Level
{Empty}
Project Institution
{Empty}
Project Address
{Empty}
Anchor Institution
CR-Penn State
Preferred Start Date
{Empty}
Start as soon as possible.
No
Project Urgency
Already behind3Start date is flexible
Expected Project Duration (in months)
{Empty}
Launch Presentation
{Empty}
Launch Presentation Date
{Empty}
Wrap Presentation
{Empty}
Wrap Presentation Date
{Empty}
Project Milestones
{Empty}
Github Contributions
{Empty}
Planned Portal Contributions (if any)
{Empty}
Planned Publications (if any)
{Empty}
What will the student learn?
{Empty}
What will the mentee learn?
{Empty}
What will the Cyberteam program learn from this project?
{Empty}
HPC resources needed to complete this project?
{Empty}
Notes
{Empty}
Final Report
What is the impact on the development of the principal discipline(s) of the project?
{Empty}
What is the impact on other disciplines?
{Empty}
Is there an impact physical resources that form infrastructure?
{Empty}
Is there an impact on the development of human resources for research computing?
{Empty}
Is there an impact on institutional resources that form infrastructure?
{Empty}
Is there an impact on information resources that form infrastructure?
{Empty}
Is there an impact on technology transfer?
{Empty}
Is there an impact on society beyond science and technology?
{Empty}
Lessons Learned
{Empty}
Overall results
{Empty}