Hey Folks, this article featured the NPTEL Data Science for Engineers Week 8 Assignment 2025. The Previous assignments are given with 100% correct answers. So, this week's answers are 100% correct. If you didn't complete the Week 7 assignment, the link is provided below.

Also Read: NPTEL Data Science for Engineers Week 7 Assignment Answers 2025

For Faster Updates , Join us on Telegram 👉 CLICK HERE

NPTEL Data Science for Engineers Week 8 Assignment 2025

In this Week 8, the following topics are discussed:

K- Nearest Neighbours (kNN)
kNN implementation through R
Understanding Datasets

Last Date: 19-03-2025

Consider the dataset “USArrests.csv”. Answer questions 1 to 4 based on the information given below:
This data set contains statistics, in arrests per 100,000 residents for assault, murder, and rape in each of the 50 US states in 1973. Also given is the percent of the population living in urban areas.

Set the column “States” as index of the data frame while reading the data
Set the random number generator to set.seed(123)
Normalize the data using scale function and build the K-means algorithm with the given conditions:
– number of clusters = 4
– nstart=20

Q1. According to the built model, the within cluster sum of squares for each cluster is __________ (the order of values in each option could be different):-

A. 8.316061 11.952463 16.212213 19.922437
B. 7.453059 12.158682 13.212213 21.158766
C. 8.316061 13.952463 15.212213 19.922437
D. None of the above

Answer: [ a ] 8.316061 11.952463 16.212213 19.922437

Q2. According to the built model, the size of each cluster is _______ (the order of values in each option could be different):-

A. 13 13 7 14
B. 11 18 25 24
C. 8 13 16 13
D. None of the above

Answer: [ C ] 8 13 16 13

Q3. The Between Cluster Sum-of-Squares (BCSS) value of the built K-means model is _______ (Choose the appropriate range)

A. 100 - 200
B. 200 - 300
C. 300 – 350
D. None of the above

Answer: [ A ] 100-200

Q4. The Total Sum-of-Squares value of the built k-means model is ______ (Choose the appropriate range)

A. 100 - 200
B. 200 - 300
C. 300 – 350
D. None of the above

Answer: [ A ] 100-200

Q5. Which of the statement is INCORRECT about KNN algorithm?

A. KNN works ONLY for binary classification problems
B. If k=1, then the algorithm is simply called the nearest neighbour algorithm
C. Number of neighbours (K) will influence classification output
D. None of the above

Answer:[ A ] KNN works ONLY for binary classification problems

Q6. K means clustering algorithm clusters the data points based on:-

A. dependent and independent variables
B. the eigen values
C. distance between the points and a cluster centre
D. None of the above

Answer: [ C ] distance between the points and a cluster centre

Q7. The method / metric which is NOT useful to determine the optimal number of clusters in unsupervised clustering algorithms is

A. Scatter plot
B. Elbow method
C. Dendrogram
D. None of the above

Answer: [ A ] Scatter Plot

Q8. The unsupervised learning algorithm, which aims to partition n observations into k clusters in which each observation belongs to the cluster with the nearest centroid is

A. Hierarchical clustering
B. K-means clustering
C. KNN
D. None of the above

Answer: [ B ] K-means Clustering

Conclusion

Join us on Telegram 👉 CLICK HERE

NPTEL Data Science for Engineers Week 8 Assignment 2025

NPTEL Data Science for Engineers Week 8 Assignment 2025

Conclusion

Contact Form