The GATE 2026 Data Science and Artificial Intelligence (DA) Question Paper with Solution PDF will be available here following the completion of the exam. The GATE DA paper is scheduled for February 15, 2026, in the Afternoon Session (AN) from 2:30 PM to 5:30 PM.

Based on trends since its introduction in 2024, the GATE DA paper is generally expected to be moderate to hard. Candidates aiming for top-tier IITs or specialized data science programs should target a score between 60–75+ marks. Achieving this typically requires a strategic attempt of 42–50 questions with high accuracy, given the significant weightage of Machine Learning and Probability & Statistics.

GATE 2026 Data Science and Artificial Intelligence (DA) Question Paper with Solution PDF

GATE 2026 Data Science and Artificial Intelligence (DA) Question Paper download iconDownload Check Solutions

Question 1:

"If his latest movie had been a commercial success, the actor would have made enough money to sponsor his next movie." Based only on the above sentence, which one is TRUE?

  • (A) The actor will certainly sponsor his next movie
  • (B) The actor made enough money from his latest movie
  • (C) His latest movie wasn't commercially successful
  • (D) His latest movie was a commercial success

Question 2:

P, Q and R are three points on a circle of radius 10cm with O as its center. PQ = RQ and \(\angle\)PQR = 45\(^\circ\) the area of the shaded region PQRO is __________ cm\(^2\).



  • (A) 100
  • (B) 50
  • (C) 25\(\sqrt{2}\)
  • (D) 50\(\sqrt{2}\)

Question 3:

Verbosity : Brevity :: Insolence : _______

  • (A) Solace
  • (B) Respect
  • (C) Innocence
  • (D) Wealth

Question 4:

My friend and I parted _____ the door _____ the cabin that I had rented _____ the night"

  • (A) for, at, of
  • (B) In, of, for
  • (C) of, for, in
  • (D) at, of, for

Question 5:

The product of the digits of a three-digit number is 70. The sum of the digits of the three-digit number is:

  • (A) 16
  • (B) 14
  • (C) 12
  • (D) 18

Question 6:

Consider two distinct positive numbers m, n with m \(>\) n. Let x = n\(^{log_n m}\), y = m\(^{log_m n}\). The relation between x and y is-

  • (A) x \(>\) y
  • (B) x = y
  • (C) x = log\(_{10}\) y
  • (D) x \(<\) y

Question 7:

Five integers are picked from 0 to 20 with possible representation such that their mean is 12, median is 18 and they have single mode of 20. Ignoring the permutation, the number of ways to pick these five integers is ____.

  • (A) 2
  • (B) 1
  • (C) 3
  • (D) 0

Question 8:

Rishi and Swathi are students of Class 5. Pavan and Tanvi are students of Class 4. Rishi and Pavan are boys. Swathi and Tanvi are girls. These four students played a total of three games of chess. The games were played one after another. A player who lost a game did not participate in any more games. It was observed that:

1. The first game was the only game where two students of the same class played against each other.

2. The students of Class 5 won more games than the students of Class 4.

3. The boys won 2 games and the girl won 1 game.

The student who did not lose any game is:

  • (A) Tanvi
  • (B) Pavan
  • (C) Swati
  • (D) Rishi

Question 9:

Consider the visualization of a 3-dimensional data cube showing sales quantity for each combination of the attributes Product-type, Month & Country. From this, if we want to further visualize the sale quantity for each combination of PT, M & S, which of the following OLAP operator should be performed?

  • (A) Slicing
  • (B) Roll-up
  • (C) Dicing
  • (D) Drill-down

Question 10:

Which of the following NOT true? (The name of the predicate are intuitive)

  • (A) \(\forall\)x likes (x, Ice-cream) \(\Rightarrow\) \(\neg\)\(\exists\)x\(\neg\) likes (x, Ice-cream)
  • (B) \(\forall\)x \(\forall\)y classmate (x, y) \(\Rightarrow\) Classmate (y, x)
  • (C) "All humans are mortal" is equivalent to \(\forall\)x is human (x) \(\Rightarrow\) Ismortal(x)
  • (D) "Each King is a person" is equivalent to \(\forall\) Isking (x) \(\wedge\) Isperson(x)

Question 11:

A clinic specializes in testing for a Disease D. The result of the test can be either positive (+ve) or negative (-ve). A study reveals that if a person suffers from the Disease D, the test result in that clinic comes up +ve 80% of the time and negative 20% of the time. If a person is not suffering from the Disease D, the test comes out positive 10% of the time & negative 90% of the time. It is also known among the general population the disease D occurs in 30% of the individuals. If the person tests positive for D in that clinic, the probability that he/she actually suffers from the Disease D is __________ (Round off to 2 decimal places).


Question 12:

A node size is 4096B. Node pointer size is 10B. Search key is 11B. Record pointer is 12B. What is the Max node pointers that can be stored in a non-leaf node of a B+ tree?


Question 13:

Consider two relations R(A,B) and S(E,C). A is primary key and E is a FK referring A. Which of the following operations never violate FK constraint?

  • (A) Insert in s
  • (B) Delete from R
  • (C) Insert in R
  • (D) Delete from S

Question 14:

Consider two entity sets E1(A11, A12, A13) and E2(A21,A22,A23) with A11 and A21 as PK. A22 is a multivalued attribute. R12 is a many to many relationship with total participation on both side. What is the min number of relations required to convert this ER model to relational model in 3NF?


Question 15:

Consider the directed graph G = (V, E) where V is the finite set of vertices and E is the set of directed edges between the vertices. G may contain cycle but there is no self-loop, further G may not be strongly connected. Let G\(^R\) be the graph obtained by reversing direction of all edges without changing set of vertices. Assume that BFS or DFS for any given vertex V of a graph will visit only the reachable vertices from V in that graph. Which of the following statement must always be true regardless of the structure of G?

  • (A) In G\(^R\), BFS traversal from V will visit exactly the same set of vertices as the DFS from V in G.
  • (B) If U is a reachable vertex in the DFS of G from V then V is also a reachable vertex in the BFS of G\(^R\) from U.
  • (C) If U is a reachable vertex in the BFS of G\(^R\) from V then U is also a reachable vertex in the DFS of G from V.
  • (D) The order of vertices visited in the BFS of G\(^R\) from V is the reverse of the order of vertices visited in the DFS of G from V.

Question 16:

A be a sorted array containing 1000 distinct integers. You perform recursive binary search on A to find an element y. Suppose each comparison checks whether the middle element computed during the current recursive step is equal to, less than, or greater than y. The max number of comparisons that may have to be performed if y is not an element of A is __________ . (Answer in int)


Question 17:

Consider Quick sort algorithm used to sort an array of n distinct randomly ordered element. In every call the pivot is chosen as the first element of the current subarray. Let T(n) denote the expected time to sort the array. Assume that the time to partition is linear in the size of current subarray. Which of the following option represents T(n) in this scenario.

  • (A) T(n) = T(\(\frac{n}{4}\)) + T(\(\frac{3n}{4}\)) + O(n)
  • (B) T(n) = \(\frac{1}{n} \sum_{K=0}^{n-1}\) [T(K) + T(n - K - 1)] + O(n)
  • (C) T(n) = 2T(\(\frac{n}{2}\)) + O(n)
  • (D) T(n) = T(1) + T(n - 1) + O(n)

Question 18:

P = [1, 2, 3, 5, 4]
Two sorting algo Binary sort (BS), Insertion sort (IS) apply. Let N\(_1\) be the total number of comparisons done by BS on the element of P and N\(_2\) be the total number of comparisons done by IS on the elements of P. Which of the following option is/are correct?

  • (A) IS on P perform only one swap.
  • (B) N\(_1\) = 10, N\(_2\) = 4
  • (C) N\(_1\) \(>\) N\(_2\)
  • (D) Both BS and IS on P will make at least one unnecessary comparison. (i.e. comparing element that are already in correct order)

Question 19:

Let M be a randomly chosen non-empty subset of {1,2,3,..,2026}. Which of the following is a probability that product all the elements of M is even.

  • (A) \(\frac{2^{1013} (2^{1013}-1)}{(2^{2026}-1)}\)
  • (B) \(\frac{2^{1013} (2^{1013}-1)}{2^{2026}}\)
  • (C) \(\frac{1}{(2^{2026}-1)}\)
  • (D) \(\frac{2^{1013}}{2^{2026}}\)

Question 20:

Let X be an exp. distributed random variable with mean \(\lambda\)(\(>\) 0) if P (X\(>\) 5) = 0.35 then the conditional probability P(x\(>\) 10\(|\) x\(>\) 5) is _______.


Question 21:

Let 4 points in 3D: P1 = [2,3,4,], P2 = [3,1,1], P3 = [5, -2, 3] ,P4 = [3,3,3]. Hierarchical Agglomerative clustering is used to cluster the above points. If Manhattan Distance is used as the distance metric during clustering, which two points will be merged first?

  • (A) P2 P3
  • (B) P1P2
  • (C) P2P4
  • (D) P3P4

Question 22:

Let X be a discrete valued random variable with cumulative dist. F(x) is/are correct:

  • (A) F(x) is a left continuous
  • (B) always a positive F(x)
  • (C) has jump discontinuity
  • (D) is non decreasing F(x)

Question 23:

Let y1, y2, y3 eigen value of M = \(\begin{bmatrix} 1 & 0 & 0
0 & \cos t & \sin t
0 & \sin t & \cos t \end{bmatrix}\) Where t \(\in\) [-\(\pi\),\(\pi\)] & y\(_1\) + y\(_2\) + y\(_3\) = 1 + \(\sqrt{2}\) then t = ?

  • (A) \(\frac{\pi}{4}, \frac{\pi}{3}\)
  • (B) \(\frac{\pi}{3}, -\frac{\pi}{3}\)
  • (C) \(-\frac{\pi}{3}, \frac{\pi}{4}\)
  • (D) \(\frac{\pi}{4}, -\frac{\pi}{4}\)

Question 24:

A\(_{5\times5}\) each element following Bernoulli (P = 0.50) Dist. independently. The prob. that row sum of the second row and column sum of the third column are both equal to 3 is-


Question 25:

Which of the following algo is NOT an example of uninformed search?

  • (A) DFS
  • (B) A*
  • (C) Depth Limited Search
  • (D) BFS

Question 26:

For a classification problem PCA has been used to reduce the dimensionality of a feature space from 100 to 10. Which of the following option is true about the angle b/w first 2 and 10th principal components?

  • (A) 90\(^\circ\) \(<\) \(\theta\) \(<\) 180\(^\circ\)
  • (B) \(\theta\) = 0
  • (C) \(\theta\) = 90
  • (D) 0 \(<\) \(\theta\) \(<\) 90\(^\circ\)

Question 27:

Let P(x) be a predicate. Which of the following is NOT valid in first-order logic?

  • (A) \(\forall x P(x) \Rightarrow \exists x \neg P(x)\)
  • (B) \(\forall x P(x) \Rightarrow \exists x P(x)\)
  • (C) \(\exists x P(x) \Rightarrow \forall x P(x)\)
  • (D) \(\exists x P(x) \Leftrightarrow \forall x P(x)\)

Question 28:

Consider the game tree for a 2-player turn-taking minimax game shown in the figure. The value of the terminal node represent the utility of the game state if the game end there. There are two player Max & Min. At any particular point state of the game Max prefers to move to a state of maximum value, on the other hand Min prefers to move to a state of min value. Suppose Max starts the game at the root and has three strategies 1, 2, 3. Next, Min plays and also has three strategies 1, 2, 3. The game end there. Both player always take optimal strategies throughout the game. At the root, the best strategy for max is ________ (int).




Question 29:

Consider the supervised learning task. The objective function being minimized is f(w) = w \(\cdot\) x, where w \(\in\) R is the parameter. Stochastic Gradient Descent with learning rate of 0.10. Let w = 10.00 be the i\(^{th}\) iteration(w\(_i\)). The value of w at the end of iteration (i+1) is if x = 10 ________. Round off 2 decimal.


Question 30:

The value of \(\sum_{j=0}^{\infty} \sum_{i=1}^{\infty} 2^{-j}3^{-i}\) is ________.


Question 31:

Consider that you are training a classifier for a 10-class classification problem. Each I/P is represented as a 512 dimensional vector. There are 1000 samples out of which first 100 will be used for testing. Let Leave-one-out-Cross-Validation (LOOCV) be used for selection of the classifier model before testing. Which of the following option is the correct option for no. of validation split that will be generated?

  • (A) 10
  • (B) 512
  • (C) 1000
  • (D) 900

Question 32:

Consider the Ridge LRR is being used to learn prediction function y\(_{pred}\)=w\(^T\)x where w,x \(\in\) R\(^2\) & mean absolute error (MAE) is used to measure the prediction error. A weight of 0.20 is associated with the regularizer. At an intermediate step of training process assume that the parameter w = [-3.00,4.00]\(^T\) . In the next step for the I/P x=[1.00,2.00]\(^T\), the predicted value of y is noted. Let the relation b/w x=[x\(_1\),x\(_2\)]\(^T\) & the true value of y be y\(_{true}\) = x\(_1\)+x\(_2\). The value of the overall regularized loss for instance is __________ (upto 2 decimal).


Question 33:

S\(_1\) = \{x = (x\(_1\), x\(_2\), x\(_3\))\(^T\) \(\in\) R\(^3\) \(|\) x\(^T\)x \(\le\)16\. Let S\(_2\) be subspace of R\(^3\) with dimension 2 then Area of S\(_1\) \(\cap\) S\(_2\) ?

  • (A) 16 \(\pi\)
  • (B) 16\(\pi^2\)
  • (C) 4\(\pi^2\)
  • (D) 4\(\pi\)

Question 34:

M=(I\(_n\)-\(\frac{1}{n}\)11\(^T\)) be a matrix where 1 = (1,1,...,1)\(^T\) \(\in\) R\(^n\) and I\(_n\) is the identity matrix of order n. The value of max\(_{x \in S}\) x\(^T\)Mx where S = \{x \(\in\) R\(^n\) \(|\) x\(^T\)x =1\ is _________.


Question 35:

Consider a fully connected forward multi-layer perceptron. It has 30 neurons in the i/p layer foll. by two hidden layers and an o/p layer. The first hidden layer has 4 neurons and the second 3 neurons. The o/p layer has only 1 neuron. Assume that no biased parameter parameter in the mul_________.


Question 36:

Consider that 20 stories of author X and 10 stories of author Y were kept together without mentioning the names of the authors. A classifier was then asked to predict the author (X or Y) of each of the stories. Later out of X's stories 6 were classified as that of Y. On the other hand, out of Y's stories 2 were classified as that of X. Considering X and Y as two classes, then which of the following is/are true?

  • (A) Recall of class X is higher than the recall of class Y.
  • (B) Precision of class X is higher than the precision of class Y.
  • (C) Accuracy of the classifier is 11/15.
  • (D) Accuracy of the classifier is 14/15.

Question 37:

Which one of the following is true for Ridge Regression (RR)

  • (A) The regularizer of RR may increase the bias of the model, but it helps in reducing the variance in prediction.
  • (B) The reg. of RR uses L1 norm.
  • (C) RR aims to reduce the num. of parameters that have –ve value.
  • (D) The reg. in the objective fn. of RR is used to guard against scenarios where the model works well for the test data but poorly for the training data.

Question 38:

Assume that creative (C) person will succeed (S) if the person is also disciplined (D) but will not succeed otherwise.
Statement:
(i) C \(\land\) S \(\leftrightarrow\) D
(ii) C \(\rightarrow\) (S \(\rightarrow\) D)
(iii) C \(\leftrightarrow\) ((D \(\rightarrow\) S) \(\lor\) \(\neg\)S)


Question 39:

Let L = lim\(_{n \to \infty} \sum_{k=0}^{n} \frac{e^{-n} n^k}{k!}\) value of L

  • (A) 1.0
  • (B) 0.5
  • (C) 0
  • (D) e\(^{-1}\)

Question 40:

M = \(\begin{bmatrix} \cos\theta & -\sin\theta
\sin\theta & \cos\theta \end{bmatrix}\) , \(\theta\) = \(\frac{2\pi}{5}\) then M\(^{2026}\) = ?

  • (A) M\(^2\)
  • (B) M
  • (C) I
  • (D) M\(^{-1}\)

Question 41:

M = (I\(_n\)-\(\frac{1}{n}\)11\(^T\)) be a matrix where 1 = (1,1,...,1)\(^T\) \(\in\) R\(^n\) and I\(_n\) is the identity matrix of order n. then which of the following is/are true.

  • (A) M is a projection matrix
  • (B) M\(^T\) = M
  • (C) M\(^2\) = I
  • (D) trace (M) = n

Question 42:

def outer():
\indent x = []
\indent def inner(val):
\indent \indent x.append(val)
\indent \indent return x
\indent return inner
f1 = outer()
f2 = outer()
print(f1(10)) \# line P
print(f1(20)) \# line Q
print(f2(30)) \# line R
print(f1(40)) \# line S
Which of the following is/are correct?

  • (A) Output of line Q is [10, 20]
  • (B) Output of line S is [10, 20, 40]
  • (C) Output of line R is [10, 20, 30]
  • (D) f1 and f2 share the same list x

Question 43:

A recursive function is given:
def mystery(n):
\indent if n \(<=\) 0:
\indent \indent return 1
\indent else:
\indent \indent return mystery(n-1) + mystery(n-2)
Find the value of mystery(4).


Question 44:

Assume a typical runtime stack is used for the recursive function mystery(n) as defined earlier. How many total function calls (stack activations), including the initial call, are made to compute mystery(4)?

  • (A) 5
  • (B) 15
  • (C) 25
  • (D) 20

Question 45:

def fun(L, i=0):
\indent if i \(>=\) len(L) - 1:
\indent \indent return 0
\indent if L[i] \(>\) L[i+1]:
\indent \indent L[i+1], L[i] = L[i], L[i+1]
\indent \indent return 1 + fun(L, i+1)
\indent else:
\indent \indent return fun(L, i+1)
data = [5, 3, 4, 1, 2]
count = 0
for _ in range(len(data)):
\indent count += fun(data)
print(count)


Question 46:

def append_to_lst(val, lst=[]):
\indent lst.append(val)
\indent return lst
print(append_to_lst(1))
print(append_to_lst(2))
print(append_to_lst(3, []))

  • (A) [1] [1,2] [3]
  • (B) [1] [1,2] [1,3]
  • (C) [1] [2] [1,2,3]
  • (D) [1] [2] [3]

Question 47:

X is said to entail sentence Y. If whenever X is true, Y is also TRUE. Which of the following is/are correct if X entails Y?

  • (A) If x then y
  • (B) x \(\rightarrow\) y
  • (C) x \(\land\) \(\neg\)y is false
  • (D) If y then x

Question 48:

R(A B C D E)
F = {A\(\rightarrow\) BC, CD\(\rightarrow\) E, E\(\rightarrow\)A\
Which of the following is correct?

  • (A) A and ED and CK
  • (B) AD, ED, CD are CK
  • (C) A, E, C, D and CK
  • (D) A, E, CD are CK

Question 49:

You are given the following preorder and In-order traversal of binary Tree T with nodes E, F, G, P, Q, R, S-
Preorder : P, Q, S, E, R, F, G
Inorder: S, Q, E, P, F, R, G
Which of the following statements is/are true about the binary True T?

  • (A) Node Q has only once child
  • (B) Post order traversal SEQFGRP
  • (C) P is the root of T
  • (D) The left subtree of node R contains node G

Question 50:

For a given data set {X\(_1\), X\(_2\), ..., X\(_n\)\ where n = 100
\(\frac{1}{2000} \sum_{i=1}^{n} \sum_{j=1}^{n} (x_i - x_j)^2 = 99\)
Let us denote \(\bar{x} = \frac{1}{n}\sum_{i=1}^{n} x_i\)
The value of \(\frac{1}{99} \sum_{i=1}^{n} (x_i - \bar{x})^2\) is __________.


Question 51:

Let X be a random variable that follows uniform (-1, 1) dist. The conditional dist. of the random variable Y given X = x is the Uniform (x\(^2\) - 0.1, x\(^2\) + 0.1) dist. The value of correlation (X, Y) is _______.