Explore the fundamental concepts of Non-negative Matrix Factorization (NMF) with this easy quiz. Designed for learners seeking to understand NMF basics, key properties, applications, and core terminology in data analysis and machine learning.
What is the main goal of Non-negative Matrix Factorization when applied to a dataset in matrix form?
Explanation: NMF aims to break down a matrix into two or more smaller matrices with only non-negative entries, revealing useful features or patterns. Increasing matrix size is not its purpose, and creating negative matrices goes against the non-negative requirement. Randomly shuffling values is unrelated to the factorization process.
Which type of matrix can be factorized using Non-negative Matrix Factorization?
Explanation: NMF is specifically designed for matrices that contain only non-negative (zero or positive) values, which is essential for its mathematical foundation. While large or small values do not matter as long as they are non-negative, matrices with negatives or all-zero entries are not typical for NMF. Matrices with just zeros would not reveal meaningful patterns.
After applying NMF to an m×n data matrix V, what are the typical names of the resulting two factor matrices?
Explanation: The standard notation for the factor matrices in NMF is W and H, where V is approximately equal to W multiplied by H. A and B, Q and R, and C and D are not conventional for NMF, though such symbols could be used in other contexts or factorizations.
How does Non-negative Matrix Factorization help in feature extraction for text data analysis?
Explanation: NMF is valued for generating parts-based and easily interpretable representations, which is particularly useful for understanding topics in text analysis. It does not encrypt data or remove duplicates directly, and sorting rows alphabetically is unrelated to its function.
What key constraint is enforced in Non-negative Matrix Factorization on all the entries of the factorized matrices?
Explanation: NMF strictly requires that every entry in both factorized matrices is zero or a positive number. Negatives are not allowed, so 'all entries must be negative' is incorrect. The entries do not have to sum to one nor be integers, making those options wrong as well.
Why is Non-negative Matrix Factorization commonly used in image processing tasks?
Explanation: NMF helps in revealing meaningful image components, such as separating faces or objects, due to its non-negative constraint. It does not inherently blur backgrounds, remove zero pixels, or invert image colors, which are separate image processing tasks.
In NMF, selecting the rank (number of components) primarily affects which aspect of the resulting factorization?
Explanation: Choosing the rank in NMF determines how many features or components will be extracted from the data. It does not affect entry signs, as all must remain non-negative. Running time might increase with higher rank, but the main influence is on feature quantity, not the original data size.
Why is the non-negativity constraint considered important in many NMF applications?
Explanation: Non-negativity tends to yield features that correspond to additive, understandable parts in real-world scenarios, such as objects or topics. While it can simplify some mathematics, that's not the main reason for its importance. NMF does not require square matrices, nor does non-negativity guarantee minimal computation time.
What is a key difference between Non-negative Matrix Factorization (NMF) and Principal Component Analysis (PCA)?
Explanation: A fundamental difference is that NMF restricts all matrix entries to zero or positive values, unlike PCA where negative values are possible. Both can be applied to various data types besides images, and performance in clustering depends on the problem, making the other options inaccurate.
Which approach is commonly used to initialize the factor matrices in NMF algorithms?
Explanation: Random non-negative initialization is frequently used to begin the iterative process of NMF, helping to avoid local minima. Negative or zero-only initializations are not suitable due to the non-negative constraint and computational limitations. Prime number initialization is not a standard practice.