Mark, the Student Segmentation Analyst at MDFT Pro, is building a K-means clustering model to group students based on their learning characteristics and course performance metrics. MDFT Pro, a well-known training agency, has collected diverse data about students including numerical scores (assessment grades, completion rates), categorical information (course levels, learning preferences), and text feedback (written evaluations). Before training the clustering model, Mark needs to ensure all the input features are properly formatted and compatible with the K-means algorithm’s mathematical requirements for calculating distances between student profiles.
What data type requirement must all features meet when building a K-means clustering model for student segmentation?
Choose the correct answer from the options below.
Explanations for each answer: