Sign In
Not register? Register Now!
Pages:
5 pages/≈1375 words
Sources:
No Sources
Style:
MLA
Subject:
IT & Computer Science
Type:
Coursework
Language:
English (U.S.)
Document:
MS Word
Date:
Total cost:
$ 36.45
Topic:

Industrial Data Science and Machine Learning Research

Coursework Instructions:

At the end of each lecture, students will prepare a short (~2-3 pages, 12 point single space) report that addresses as many of the following questions as are relevant:
 Describe the market sector or sub-space covered in this lecture.
 What data science related skills and technologies are commonly used in this sector?
 How are data and computing related methods used in typical workflows in this sector? Illustrate with an example.
 What are the data science related challenges one might encounter in this domain?
 What do you find interesting about the nature of data science opportunities in this domain?
 Please discuss some of the characteristics of industrial data science problems alluded to in the talk and how they differ qualitatively from data science problems in other domains. (20 points of the 80 C+R points in the rubric)
Also, answer the following multiple-choice questions: You can list the quetsion number and the letter corresponding to the correct choice as Answer in your report, (2x5 = 10 pts of the 80 C+R points in the rubric)
Q1: Based on the lecture, what are the basic components of Industrial Data Science:
1. Physics/Engineering-based Models
2. Empirical, Heuristic rules & insights
3. Data-Driven techniques – machine learning, statistics, optimization, etc.
4. Cross-Dependencies and model validation
A. 1,2,3 B 1,3,4 C. 2,3,4 D. All of the choices
Q2: Based on the lecture, select all the correct statements:
1. Industrial Data Science is defined as the outcome-oriented application of mathematical & physics-based analysis & models to real-world problems in industrial operations
2. Business optimization by linking to financial & customer data is a part of Industrial Operations
3. There are many events to train data-driven models to detect anomalies and predict future events
4. User Acceptance Testing is a component of iDS framework
5. Data is often well collected as by-product of industrial systems, and readily fits the goal of enabling data-driven insights
A. 1,2,3 B. 1,2,4 C. 1,2,3,4 D. 1,3,4,5 E. All of the choices
Q3: Based on the lecture, what sort of data would be available from the asset lifecycles:
1. Design Data
2. Bill of Materials (BOM) Data
3. OEM Specifications
4. Operations Data
5. Alarms, Failures Data
A. 1,3,4,5 B. 1,2,3,4 C. 2,3,4,5 D. All of the choices
Q4. Based on the lecture, select all the correct statements about Time Series Data:
1. Time Series data is a sequence of data points, measured typically at successive points in time spaced at uniform time intervals
2. Event data is a type of Time Series data
3. The relationship to timestamp has to be maintained at all times in Time Series data
4. You CANNOT shuffle time series observations and simply pick a random sample
5. Each time series observation is NOT independent of its neighbor
A. 1,2,3,4 B. 1,3,4,5 C. 1,3,4 D. 1,3,5 E. All of the choices
Q5. Based on the lecture, there are certain aspects that can determine whether a DS project can succeed or fail, select all the correct statements from the following:
1. DS initiatives often fail if the stakeholders are ambiguous about their problems
2. DS initiatives fail if the benefits are not significant to the bottom line
3. DS initiatives fail if critical domain knowledge is not available in DS team
4. DS projects succeed if the problem has been clearly defined, and solution techniques are determined after defining the problem and exploring the data
A. 1,2,3 B. 1,2,4 C. 1,3,4 D. 2,3,4 E. All of the choices

Coursework Sample Content Preview:
[Author’s Name]
[Professor’s Name]
EAS504
19 June 2022
Lecture Report
Market Sector
Ram Narasimhan, a UB graduate and a principal data scientist at GE Digital Services, explores the emerging but fast-growing sector of Industrial Data Science (iDS) in lecture 3. The primary difference between “traditional” Data Science and Industrial Data Science is the nature of problems addressed through machine learning and other analytics approaches. iDS involve coming up with solutions to real-world problems in industrial operations or improving the performance of industrial operations through leveraging mathematical and physics-based analysis & models. Moreover, this sector encompasses developing tools and processes that others can continually use at scale. Examples of industrial operations problems that iDS addresses include how companies can achieve higher equipment uptime, how organizations can lower their maintenance costs, and contract management issues, among others.
Key Skills and Technologies
Statistical Analysis
Theoretical and applied statistical analysis is a fundamental skill in Industrial Data Science because it contributes to understanding the significance of industry-related experiments and whether industrial trials were designed and conducted correctly.
Machine Learning and Artificial Intelligence
ML and AI technologies are essential in routine industrial tasks for diagnostic, predictive, and prescriptive analytics. These technologies help make industrial tasks more automated, consistent, accurate, and reliable.
Sensor and Signal Processing
Sensor technologies today generate precise and predictive data. Moreover, sensor machines are also becoming self-aware and can utilize preconfigured optimal data to self-compare with their environment. Therefore, such technologies can self-diagnose.
Image Processing
Image processing is also a significant skill in IDS because much data on the industrial internet is formatted as images. Excellent examples in healthcare include X-Rays and MRI scans. Other types of images include inspection images of equipment which can help professionals predict when a piece of equipment will fail, which helps in scheduling maintenance, repairs, and replacements, thus reducing costs.
Use of Data and Computing-related methods in iDS Workflows
Data and computing-related methods are essential in both the development and deployment stages. However, in the development phase, the data scientists use large volumes of data, including historical and context data, while utilizing several computing methods to develop the model. Once the model is complete, it is deployed, but less volumes of data and computing are still essential. However, to effectively achieve the business results, the organization must adjust to accommodate the model’s changes. However, the deployment of the model and the adjustment of typical workflows do not necessarily mean the end of the process. A new model version can still be developed, deployed, and improved upon through various forms of learning such as descriptive, predictive, and prescriptive and using data and computing-related methods. Workflows are further readjusted to ensure that business results are achieved.
Updated on
Get the Whole Paper!
Not exactly what you need?
Do you need a custom essay? Order right now:

You Might Also Like Other Topics Related to scientific essays:

HIRE A WRITER FROM $11.95 / PAGE
ORDER WITH 15% DISCOUNT!