QUIZ
1. A 2D rotation transform requires two parameters to be fully specified
True
False
2. SIFT is a rotation invariant descriptor
True
False
If we initialize the k-means clustering algorithm with the same number of clusters but different starting positions for the centers, the algorithm will always converge to the same solution
True
False
4. Corners are the only type of keypoints that can be used in images
True
False
3. SIFT uses histogram of image intensities as a descriptor
True
False
Given a set of 3 images as shown in next figure, finding and labeling the image in the center as ”containing cat” is considered a detection task in recognition
True
False
We need labels when performing clustering
True
False
9. When clustering an image using color values, regions belonging to same cluster needs to be adjacent
True
False
We need class labels when performing regression
True
False
11. Viola Jones can only be used for training a frontal face detector
True
False
10. K means will always result in the clusters independent of initialization of cluster centers
True
False
Being a template based method, Viola Jones can be also very suitable for deformable objects
True
False
In the K-means clustering algorithm, after convergence the distortion is minimized
True
False
13. Features extracted from images for the purpose of image matching need not to be distinctive
True
False
Given an image containing a cat, automatically finding the contour of the cat in the image is considered an object detection task.
True
False
One can make a visual descriptor rotationally invariant by assigning orientations to the key points and then rotating the patch to a canonical orientation. In SIFT this is done by constructing Histograms of Gradients in a neighborhood around each pixel in the image and assigning the largest bin as the corresponding direction of the keypoint
True
False
A Gaussian pyramid is a hierarchy of images. The bottom layer is an input image. The next layer is obtained by blurring the image in the previous layer and upsampling it, and so on. They are used in many computer vision algorithms such as SIFT
True
False
18. Estimating the price of a house is an example of a classification problem
True
False
In general we need more manual work when building a labelled dataset for semantic image segmentation compared to the case of image classification
True
False
Assume that while training a machine learning the training loss at an iteration was L = 90 and the weight vector was w=[■(1&3&-10)]^T. At a later iteration the loss became L = 87 and w=[■(1&-5&10)]^T. Consider we are using a validation set for which the validation losses at the two iterations are 150 and 160 respectively, the second weight vector would be preferred from a validation point of view
True
False
Assume that while training a machine learning the training loss at an iteration was L = 90 and the weight vector was w=[■(1&3&-10)]^T. At a later iteration the loss became L = 87 and w=[■(1&-5&10)]^T. From a training point of view the first weight vector is better
True
False
22. When we have less data and the model complexity stays the same, overfitting is more likely.
True
False
26. It is always better to use a linear classifier as opposed to a more complex classifier
True
False
27. In building an ensemble classifier it is always good to have classifiers that are dependent in terms of their classification outputs or scores
True
False
30. Assume the validation loss at successive iterations of the gradient descent method is as follows: 100, 90, 82, 80, 65, 70, 50
True
False
29. In a well-trained convolutional neural networks for classifying images the first few layers correspond to filters capturing bigger parts of the objects trained on while later layers represents simpler patterns such as edges and corners
True
False
31. If we use a simple stopping criterion to stop the training then the algorithm should stop when it sees a validation of 70
True
False
32. A good strategy in initializing weights in a neural network is to initialize them all as zeros
True
False
Bag of words feature matching has a main advantage of taking into account spatial feature arrangement when matching images
True
False
36. Given an image containing a cat, automatically labelling the image as ”containing cat” (without specifying where the cat exists in the image) is considered an object detection task
True
False
35. Bag of words feature matching does not take into account spatial feature arrangement when matching images
True
False
37. One can make a visual descriptor rotationally invariant by assigning orientations to the key points and then rotating the patch to a canonical orientation. In SIFT this is done by constructing Histograms of Gradients in a neighborhood around the feature point and assigning the largest bin as the corresponding direction of the keypoint. Later, all detected features are rotated so that the corresponding orientations are vertically aligned
True
False
In regular shallow neural networks one needs to use a non-linear activation function between layers while in deep neural networks this is not needed
True
False
40. In regression, the predicted output is discrete, this is in contrast to classification
True
False
38. One can make a visual descriptor rotationally invariant by assigning orientations to the key points and then rotating the patch to a canonical orientation. In SIFT this is done by constructing Histograms of Gradients in a neighborhood around each pixel in the image and assigning the largest bin as the corresponding direction of the keypoint. Later, all detected features are rotated so that the corresponding orientations are vertically aligned.
True
False
In mean shift clustering algorithm we find cluster centers as the modes of the probability density function of input features
True
False
42. Object localization is different than object detection because in object localization we could have multiple instances of the target object
True
False
45. Some of the reasons we use convnets instead of fully connected layers in images are computational efficiency (sharing of weights) and preserving spatial structure
True
False
We can solve the object detection problem as a regression problem to find all boxes of objects in images
True
False
The main idea behind resnet blocks is to add shortcuts in the network blocks so that we can learn identity mapping in the worst case scenario. This helps learning even when we increase number of layers
True
False
48. In the case of L2 regularization we have two terms in the loss function: a data term and a regularization term. The regularization term is multiplied by a parameter, let’s call it “lambda”. If we increase lambda we are risking more overfitting
True
False
50. Single shot detection (SSD) and You-only-look-once (YOLO) detection algorithms operate on regions extracted from a separate region proposal network (separate from the network that performs detection
True
False
53. The goal of the L2 loss in the following architecture is to predict the location of the object of interest
True
False
In designing cascaded object detection systems it is usually the case that the first detector in the cascade has high recall and possibly high false positive rate
True
False
56. Random forest classifier is an example of an ensemble classifier methods
True
False
If we want to build a neural network estimating 4 joint positions on an image, one possible way is to formulate it as a regression problem of x,y locations of the joints and have an 8 output unit network
True
False
57. Running a detector through an image results in many possible overlapping boxes with different detection scores, a popular method to select the best detection window(s) is called Non-maxima suppression
True
False
A condition for a point to be a corner feature in an 2-D image is that the two eigen values of the M matrix (of gradients) are small
True
False
60. Using gradient-based feature descriptors guarantees invariance against linear illumination changes
True
False
Estimating the house of a price is an example of a classification problem
True
False
61. The goal of the L2 loss in the following architecture is to predict the location of the object and the object type
True
False
In SSD/Yolo object detector we reply on a region proposal method/network
True
False
65. A condition for a point to be a corner feature in an 2-D image is that there is a large intensity gradient along the X, Y and depth directions of the image
True
False
The idea of the inception block is to test various sizes of convolution filters
True
False
Using gradient-based feature descriptors guarantees invariance against non-linear illumination changes
True
False
The goal of the L2 loss in the following architecture is to predict the location of the object of interest only
True
False
71. In the basic gradient descent optimization algorithm, we sample a batch of images in each iteration and update the weights accordingly
True
False
70. The main idea behind Resnet blocks is to add shortcuts in the network blocks so that we can learn identity mapping in the worst-case scenario. This helps learning even when we increase the number of layers.
True
False
Suppose a dataset is linearly separable, a logistic regressor with regularization parameter λ > 0 is guaranteed to separate the data
True
False
{"name":"QUIZ", "url":"https://www.quiz-maker.com/QPREVIEW","txt":"1. A 2D rotation transform requires two parameters to be fully specified, 2. SIFT is a rotation invariant descriptor, If we initialize the k-means clustering algorithm with the same number of clusters but different starting positions for the centers, the algorithm will always converge to the same solution","img":"https://www.quiz-maker.com/3012/images/ogquiz.png"}