NPTEL Data Science for Engineers Assignment 8 Answers 2023 ! In this article we will discuss about the answers for Week 7 assignment of Data science for Engineers. Consider these answers as reference only. I am confident in providing these answers. Then Come with us until the last of page to know more about week 8 Assignment.
Also Read: Nptel Data Science Week 7 Answers
About Nptel
NPTEL (National Programme on Technology Enhanced Learning) is an online learning platform created by the Indian Institutes of Technology (IITs) and the Indian Institute of Science (IISc). It provides free online courses and certification in various fields of engineering, science, humanities, and management.
NPTEL offers courses in various formats such as video lectures, assignments, quizzes, and discussion forums. The courses are developed by faculty members of IITs and IISc, and the platform has collaborations with other institutions and organizations around the world.
NPTEL certification is recognized by various industries and organizations in India and abroad, and the platform has over 1,600 courses with more than 250 million total enrollments. NPTEL also provides online certification exams and has collaborated with edX to offer joint certificates.
NPTEL Data Science For Engineers Assignment 8 Answers 2023
Consider the dataset “USArrests.csv”. Answer questions 1 to 4 based on the information given below:
This data set contains statistics, in arrests per 100,000 residents for assault, murder, and rape in each of the 50 US states in 1973. Also given is the percent of the population living in urban areas.
Set the column “States” as index of the data frame while reading the data
• Set the random number generator to set.seed(123)
• Normalize the data using scale function and build the K-means algorithm with the given conditions:
– number of clusters = 4
– nstart=20
Q1. According to the built model, the within cluster sum of squares for each cluster is ______________ (the order of values in each option could be different):-
a. 8.316061 11.952463 16.212213 19.922437
b. 7.453059 12.158682 13.212213 21.158766
c. 8.316061 13.952463 15.212213 19.922437
d. None of the above
Answer: [ a ]
Q2. According to the built model, the size of each cluster is (the order of values _______________ in each option could be different):-
a. 13 13 7 14
b. 11 18 25 24
c. 8 13 16 13
d. None of the above
Answer: [ c ]
Q3. The Between Cluster Sum-of-Squares (BCSS) value of the built K-means model is _______________ (Choose the appropriate range)
a. 100 - 200
b. 200 - 300
c. 300 – 350
d. None of the above
Answer: [ a ]
Q4. The Total Sum-of-Squares value of the built k-means model is _________________
(Choose the appropriate range)
a. 100 - 200
b. 200 - 300
c. 300 – 350
d. None of the above
Answer: [ a ]
Q5. Which of the statement is INCORRECT about KNN algorithm?
a. KNN works ONLY for binary classification problems
b. If k=1, then the algorithm is simply called the nearest neighbour algorithm
c. Number of neighbours (K) will influence classification output
d. None of the above
Answer: [ a ] KNN works ONLY for binary classification problems
Q6. K means clustering algorithm clusters the data points based on:-
a. dependent and independent variables
b. the eigen values
c. distance between the points and a cluster centre
d. None of the above
Answer: [ c ] distance between the points and a cluster centre
Q7. The method / metric which is NOT useful to determine the optimal number of clusters
in unsupervised clustering algorithms is
a. Scatter plot
b. Elbow method
c. Dendrogram
d. None of the above
Answer: [ a ]
Q8. The unsupervised learning algorithm which aims to partition n observations into k clusters in
which each observation belongs to the cluster with the nearest centroid is
a. Hierarchical clustering
b. K-means clustering
c. KNN
d. None of the above
Answer: [ b ]
Also Read: Nptel Data Science Week 7 Answers