To be a successful business analyst, you need a combination of technical skills, analytical abilities, and interpersonal qualities. Here are some essential skills and pointers to excel in the field of business analysis:
1. Analytical Skills
2. Problem-Solving Skills
3. Domain Knowledge
4. Data Management:
5. Business Intelligence Tools:
6. Requirement Elicitation:
7. Documentation and Reporting:
8. Technical Knowledge
9. Critical Thinking
10. Interpersonal Skills
11. Project Management
12. Adaptability
13. Presentation Skills
1. Analytical Skills
2. Problem-Solving Skills
3. Domain Knowledge
4. Data Management:
5. Business Intelligence Tools:
6. Requirement Elicitation:
7. Documentation and Reporting:
8. Technical Knowledge
9. Critical Thinking
10. Interpersonal Skills
11. Project Management
12. Adaptability
13. Presentation Skills
👍33❤2
Different Types of Data Analyst Interview Questions
👇👇
Technical Skills: These questions assess your proficiency with data analysis tools, programming languages (e.g., SQL, Python, R), and statistical methods.
Case Studies: You might be presented with real-world scenarios and asked how you would approach and solve them using data analysis.
Behavioral Questions: These questions aim to understand your problem-solving abilities, teamwork, communication skills, and how you handle challenges.
Statistical Questions: Expect questions related to denoscriptive and inferential statistics, hypothesis testing, regression analysis, and other quantitative techniques.
Domain Knowledge: Some interviews might delve into your understanding of the specific industry or domain the company operates in.
Machine Learning Concepts: Depending on the role, you might be asked about your understanding of machine learning algorithms and their applications.
Coding Challenges: These can assess your programming skills and your ability to translate algorithms into code.
Communication: You might need to explain technical concepts to non-technical stakeholders or present your findings effectively.
Problem-Solving: Expect questions that test your ability to approach complex problems logically and analytically.
Remember, the exact questions can vary widely based on the company and the role you're applying for. It's a good idea to review the job denoscription and the company's background to tailor your preparation.
👇👇
Technical Skills: These questions assess your proficiency with data analysis tools, programming languages (e.g., SQL, Python, R), and statistical methods.
Case Studies: You might be presented with real-world scenarios and asked how you would approach and solve them using data analysis.
Behavioral Questions: These questions aim to understand your problem-solving abilities, teamwork, communication skills, and how you handle challenges.
Statistical Questions: Expect questions related to denoscriptive and inferential statistics, hypothesis testing, regression analysis, and other quantitative techniques.
Domain Knowledge: Some interviews might delve into your understanding of the specific industry or domain the company operates in.
Machine Learning Concepts: Depending on the role, you might be asked about your understanding of machine learning algorithms and their applications.
Coding Challenges: These can assess your programming skills and your ability to translate algorithms into code.
Communication: You might need to explain technical concepts to non-technical stakeholders or present your findings effectively.
Problem-Solving: Expect questions that test your ability to approach complex problems logically and analytically.
Remember, the exact questions can vary widely based on the company and the role you're applying for. It's a good idea to review the job denoscription and the company's background to tailor your preparation.
👍15❤2
Machine Learning for Business Analytics Concepts, Techniques.pdf
40.1 MB
📚 Title: Machine Learning for Business Analytics (2023)
👍12
👍3🎉1
1. How to change a table name in SQL?
This is the command to change a table name in SQL:
ALTER TABLE table_name
RENAME TO new_table_name;
We will start off by giving the keywords ALTER TABLE, then we will follow it up by giving the original name of the table, after that, we will give in the keywords RENAME TO and finally, we will give the new table name.
2. How to use LIKE in SQL?
The LIKE operator checks if an attribute value matches a given string pattern. Here is an example of LIKE operator
SELECT * FROM employees WHERE first_name like ‘Steven’;
With this command, we will be able to extract all the records where the first name is like “Steven”.
3. If we drop a table, does it also drop related objects like constraints, indexes, columns, default, views and sorted procedures?
Yes, SQL server drops all related objects, which exists inside a table like constraints, indexes, columns, defaults etc. But dropping a table will not drop views and sorted procedures as they exist outside the table.
4. Explain SQL Constraints.
SQL Constraints are used to specify the rules of data type in a table. They can be specified while creating and altering the table. The following are the constraints in SQL: NOT NULL CHECK DEFAULT UNIQUE PRIMARY KEY FOREIGN KEY
This is the command to change a table name in SQL:
ALTER TABLE table_name
RENAME TO new_table_name;
We will start off by giving the keywords ALTER TABLE, then we will follow it up by giving the original name of the table, after that, we will give in the keywords RENAME TO and finally, we will give the new table name.
2. How to use LIKE in SQL?
The LIKE operator checks if an attribute value matches a given string pattern. Here is an example of LIKE operator
SELECT * FROM employees WHERE first_name like ‘Steven’;
With this command, we will be able to extract all the records where the first name is like “Steven”.
3. If we drop a table, does it also drop related objects like constraints, indexes, columns, default, views and sorted procedures?
Yes, SQL server drops all related objects, which exists inside a table like constraints, indexes, columns, defaults etc. But dropping a table will not drop views and sorted procedures as they exist outside the table.
4. Explain SQL Constraints.
SQL Constraints are used to specify the rules of data type in a table. They can be specified while creating and altering the table. The following are the constraints in SQL: NOT NULL CHECK DEFAULT UNIQUE PRIMARY KEY FOREIGN KEY
👍18👎2❤1
Here's a snippet of code written in C:
Solution:Alias
say, you are asked to count the frequency of each color occurring in the same the table , so you can write something like this:
for(int i=0;i<n;i++)How can I write something equivalent in SQL?
{
for(int j=0;j<n;j++)
{
//some operation
}
}
Solution:Alias
say, you are asked to count the frequency of each color occurring in the same the table , so you can write something like this:
select distinct color ,(select count(*) from colors where c.color=color) from colors c;👍15❤1
Q.Autoencoder methods
A. Autoencoder is a type of neural network where the output layer has the same dimensionality as the input layer. In simpler words, the number of output units in the output layer is equal to the number of input units in the input layer. Various techniques exist to prevent autoencoders from learning the identity function and to improve their ability to capture important ' information and learn richer representations. 1.Sparse autoencoder (SAE) 2. Denoising autoencoder (DAE) 3. Contractive autoencoder (CAE) 4. Principal component analysis.
Q. L1 and L2 regularization?
A. L1 regularization gives output in binary weights from 0 to 1 for the model's features and is adopted for decreasing the number of features in a huge dimensional dataset. L2 regularization disperse the error terms in all the weights that leads to more accurate customized final models.
Q. How to measure the Euclidean distance betweeen the two arrays in numpy?
A. Euclidean distance is defined in mathematics as the magnitude or length of the line segment between two points. There are multiple methods for measuring the euclidean methods.
Method 1. In this method, we first initialize two numpy arrays. Then, we use linalg.norm() of numpy basically to compute the euclidean distance directly.
Method 2. In this method, we first initialize two numpy arrays. Then, we take the difference of the two arrays, compute the dot product of the result, and transpose of the result. Then we take the square root of the answer. This is another way to implement Euclidean distance.
Method 3. In this method, we first initialize two numpy arrays. Then, we compute the difference of these arrays and take their square. We take the sum of the squared elements, and after that, we take the square root in the end. This is another way to implement Euclidean distance.
Q.What are the support vectors in SVM?
A. Support vectors are data points that are closer to the hyperplane and influence the position and orientation of the hyperplane. Using these support vectors, we maximize the margin of the classifier. Deleting the support vectors will change the position of the hyperplane. These are the points that help us build our SVM.
Q. How do you handle categorical data?
A. One-Hot Encoding is the most common, correct way to deal with non-ordinal categorical data. It consists of creating an additional feature for each group of the categorical feature and mark each observation belonging (Value=1) or not (Value=0) to that group.
Q. What is coerrelation?
A.Correlation is a statistical measure that expresses the extent to which two variables are linearly related (meaning they change together at a constant rate). It's a common tool for describing simple relationships without making a statement about cause and effects
Q. What is covariance?
A. Covariance is nothing but a measure of correlation. Covariance is a measure of how much two random variables vary together. It’s similar to variance, but where variance tells you how a single variable varies, co variance tells you how two variables vary together
A. Autoencoder is a type of neural network where the output layer has the same dimensionality as the input layer. In simpler words, the number of output units in the output layer is equal to the number of input units in the input layer. Various techniques exist to prevent autoencoders from learning the identity function and to improve their ability to capture important ' information and learn richer representations. 1.Sparse autoencoder (SAE) 2. Denoising autoencoder (DAE) 3. Contractive autoencoder (CAE) 4. Principal component analysis.
Q. L1 and L2 regularization?
A. L1 regularization gives output in binary weights from 0 to 1 for the model's features and is adopted for decreasing the number of features in a huge dimensional dataset. L2 regularization disperse the error terms in all the weights that leads to more accurate customized final models.
Q. How to measure the Euclidean distance betweeen the two arrays in numpy?
A. Euclidean distance is defined in mathematics as the magnitude or length of the line segment between two points. There are multiple methods for measuring the euclidean methods.
Method 1. In this method, we first initialize two numpy arrays. Then, we use linalg.norm() of numpy basically to compute the euclidean distance directly.
Method 2. In this method, we first initialize two numpy arrays. Then, we take the difference of the two arrays, compute the dot product of the result, and transpose of the result. Then we take the square root of the answer. This is another way to implement Euclidean distance.
Method 3. In this method, we first initialize two numpy arrays. Then, we compute the difference of these arrays and take their square. We take the sum of the squared elements, and after that, we take the square root in the end. This is another way to implement Euclidean distance.
Q.What are the support vectors in SVM?
A. Support vectors are data points that are closer to the hyperplane and influence the position and orientation of the hyperplane. Using these support vectors, we maximize the margin of the classifier. Deleting the support vectors will change the position of the hyperplane. These are the points that help us build our SVM.
Q. How do you handle categorical data?
A. One-Hot Encoding is the most common, correct way to deal with non-ordinal categorical data. It consists of creating an additional feature for each group of the categorical feature and mark each observation belonging (Value=1) or not (Value=0) to that group.
Q. What is coerrelation?
A.Correlation is a statistical measure that expresses the extent to which two variables are linearly related (meaning they change together at a constant rate). It's a common tool for describing simple relationships without making a statement about cause and effects
Q. What is covariance?
A. Covariance is nothing but a measure of correlation. Covariance is a measure of how much two random variables vary together. It’s similar to variance, but where variance tells you how a single variable varies, co variance tells you how two variables vary together
👍15❤1
1. What is concurrency control in DBMS?
This is a process of managing simultaneous operations in a database so that database integrity is not compromised. The following are the two approaches involved in concurrency control:
Optimistic approach – Involves versioning
Pessimistic approach – Involves locking
2. What is a checkpoint in DBMS and when does it occur?
A checkpoint is a mechanism where all the previous logs are removed from the system and are permanently stored on the storage disk. So, basically, checkpoints are those points from where the transaction log record can be used to recover all the committed data up to the point of crash.
3. What are groups in Tableau?
A group is a combination of dimension members that make higher level categories. For example, if you are working with a view that shows average test scores by major, you may want to group certain majors together to create major categories.
4. How are nested IF statements used in Excel?
The function IF() can be nested when we have multiple conditions to meet. The FALSE value in the first IF function is replaced by another IF function to make a further test.
5. Do you wanna make your career in Data Science & Analytics but don't know how to start ?
https://news.1rj.ru/str/sqlspecialist/398
Here is a complete roadmap from scratch that will make you technically strong enough to crack any Data Scientist / Analyst and also learn Pro Career Growth Hacks to land on your Dream Job.
This is a process of managing simultaneous operations in a database so that database integrity is not compromised. The following are the two approaches involved in concurrency control:
Optimistic approach – Involves versioning
Pessimistic approach – Involves locking
2. What is a checkpoint in DBMS and when does it occur?
A checkpoint is a mechanism where all the previous logs are removed from the system and are permanently stored on the storage disk. So, basically, checkpoints are those points from where the transaction log record can be used to recover all the committed data up to the point of crash.
3. What are groups in Tableau?
A group is a combination of dimension members that make higher level categories. For example, if you are working with a view that shows average test scores by major, you may want to group certain majors together to create major categories.
4. How are nested IF statements used in Excel?
The function IF() can be nested when we have multiple conditions to meet. The FALSE value in the first IF function is replaced by another IF function to make a further test.
5. Do you wanna make your career in Data Science & Analytics but don't know how to start ?
https://news.1rj.ru/str/sqlspecialist/398
Here is a complete roadmap from scratch that will make you technically strong enough to crack any Data Scientist / Analyst and also learn Pro Career Growth Hacks to land on your Dream Job.
👍10
SELECT Syntax
1. SELECT column1, column2, ...
FROM table_name;
Here, column1, column2, ... are the field/column names of the table you want to select data from.
2. SELECT * FROM table_name;
Here * ( star ) means all column names/fields
3. SELECT DISTINCT Syntax
SELECT DISTINCT column1, column2, ...
FROM table_name;
The SELECT DISTINCT statement is used to return only distinct (different) values.
Inside a table, a column often contains many duplicate values; and sometimes you only want to list the different (distinct) values.
4. WHERE Syntax
SELECT column1, column2, ...
FROM table_name
WHERE condition;
The SQL WHERE Clause
The WHERE clause is used to filter records.
It is used to extract only those records that fulfill a specified condition.
WHERE Clause Example
The following SQL statement selects all the customers from the country "Mexico", in the "Customers" table:
Example
SELECT * FROM Customers
WHERE Country='Mexico' ;
1. SELECT column1, column2, ...
FROM table_name;
Here, column1, column2, ... are the field/column names of the table you want to select data from.
2. SELECT * FROM table_name;
Here * ( star ) means all column names/fields
3. SELECT DISTINCT Syntax
SELECT DISTINCT column1, column2, ...
FROM table_name;
The SELECT DISTINCT statement is used to return only distinct (different) values.
Inside a table, a column often contains many duplicate values; and sometimes you only want to list the different (distinct) values.
4. WHERE Syntax
SELECT column1, column2, ...
FROM table_name
WHERE condition;
The SQL WHERE Clause
The WHERE clause is used to filter records.
It is used to extract only those records that fulfill a specified condition.
WHERE Clause Example
The following SQL statement selects all the customers from the country "Mexico", in the "Customers" table:
Example
SELECT * FROM Customers
WHERE Country='Mexico' ;
👍12❤1
Sample Resume
Red -> action verbs
Pink -> hard skill
Yellow -> soft skill
Cyan -> impact statement
Green -> impact matrix
Resume Tips: https://news.1rj.ru/str/sqlspecialist/464
Red -> action verbs
Pink -> hard skill
Yellow -> soft skill
Cyan -> impact statement
Green -> impact matrix
Resume Tips: https://news.1rj.ru/str/sqlspecialist/464
👍13❤2👌1
Second_Language_Acquisition_Research_Series_Guilherme_D_Garcia_Data.pdf
5.7 MB
Book: Data Visualization and Analysis in Second Language Research(2021)
👍12
Few common problems with lot of resumes:
1. 𝐈𝐫𝐫𝐞𝐥𝐞𝐯𝐚𝐧𝐭 𝐢𝐧𝐟𝐨𝐫𝐦𝐚𝐭𝐢𝐨𝐧.
I understand that there are a lot of achievements that we are personally proud of (things like represented school/clg in XYZ competition or school head/class head etc), but not all of them are relevant to technical roles. As a fresher, try to focus more on technical achievements rather than managerial ones.
2. 𝐋𝐚𝐜𝐤 𝐨𝐟 𝐪𝐮𝐚𝐥𝐢𝐭𝐲 𝐩𝐫𝐨𝐣𝐞𝐜𝐭𝐬.
Many resumes have the same common projects, such as:
Creating just the front-end using HTML and CSS and redirecting all the work to an open-source API (e.g., weather prediction and recipe suggestion apps).
Most common projects are: -
Tic-tac-toe game.
Sorting algorithms visualizers.
To-do application.
Movie listing.
The codes for these projects are often copied and pasted from GitHub repositories.
Projects are like a bounty. If you are prepared well and have quality projects in your resume, you can set the tempo of the interview. It is one of the few questions that you will almost certainly be asked in the interview.
I don't understand why we can spend 2 years preparing for data structures and algorithms (DSA) and competitive programming (CP), but not even 2 weeks to create quality projects.
Even if your resume passes the applicant tracking system (ATS) and recruiter's screening, weak projects can still lead to your rejection in interviews. And this is completely in your hands.
I feel that this topic needs a lot more discussion about the type and quality of projects that one needs. Let me know if you want a dedicated post on this.
3. 𝐋𝐚𝐜𝐤 𝐨𝐟 𝐪𝐮𝐚𝐧𝐭𝐢𝐭𝐚𝐭𝐢𝐯𝐞 𝐝𝐚𝐭𝐚.
For technical roles, adding quantitative data has a big impact.
For example, instead of saying "I wrote unit tests for service X and reduced the latency of service Y by caching," you can say "I wrote unit tests and increased the code coverage from 80% to 95% of service X and reduced latency from 100 milliseconds to 50 milliseconds of service Y."
1. 𝐈𝐫𝐫𝐞𝐥𝐞𝐯𝐚𝐧𝐭 𝐢𝐧𝐟𝐨𝐫𝐦𝐚𝐭𝐢𝐨𝐧.
I understand that there are a lot of achievements that we are personally proud of (things like represented school/clg in XYZ competition or school head/class head etc), but not all of them are relevant to technical roles. As a fresher, try to focus more on technical achievements rather than managerial ones.
2. 𝐋𝐚𝐜𝐤 𝐨𝐟 𝐪𝐮𝐚𝐥𝐢𝐭𝐲 𝐩𝐫𝐨𝐣𝐞𝐜𝐭𝐬.
Many resumes have the same common projects, such as:
Creating just the front-end using HTML and CSS and redirecting all the work to an open-source API (e.g., weather prediction and recipe suggestion apps).
Most common projects are: -
Tic-tac-toe game.
Sorting algorithms visualizers.
To-do application.
Movie listing.
The codes for these projects are often copied and pasted from GitHub repositories.
Projects are like a bounty. If you are prepared well and have quality projects in your resume, you can set the tempo of the interview. It is one of the few questions that you will almost certainly be asked in the interview.
I don't understand why we can spend 2 years preparing for data structures and algorithms (DSA) and competitive programming (CP), but not even 2 weeks to create quality projects.
Even if your resume passes the applicant tracking system (ATS) and recruiter's screening, weak projects can still lead to your rejection in interviews. And this is completely in your hands.
I feel that this topic needs a lot more discussion about the type and quality of projects that one needs. Let me know if you want a dedicated post on this.
3. 𝐋𝐚𝐜𝐤 𝐨𝐟 𝐪𝐮𝐚𝐧𝐭𝐢𝐭𝐚𝐭𝐢𝐯𝐞 𝐝𝐚𝐭𝐚.
For technical roles, adding quantitative data has a big impact.
For example, instead of saying "I wrote unit tests for service X and reduced the latency of service Y by caching," you can say "I wrote unit tests and increased the code coverage from 80% to 95% of service X and reduced latency from 100 milliseconds to 50 milliseconds of service Y."
👍16❤2
Data Analyst Interview Questions
1. What do Tableau's sets and groups mean?
Data is grouped using sets and groups according to predefined criteria. The primary distinction between the two is that although a set can have only two options—either in or out—a group can divide the dataset into several groups. A user should decide which group or sets to apply based on the conditions.
2.What in Excel is a macro?
An Excel macro is an algorithm or a group of steps that helps automate an operation by capturing and replaying the steps needed to finish it. Once the steps have been saved, you may construct a Macro that the user can alter and replay as often as they like.
Macro is excellent for routine work because it also gets rid of mistakes. Consider the scenario when an account manager needs to share reports about staff members who owe the company money. If so, it can be automated by utilising a macro and making small adjustments each month as necessary.
3.Gantt chart in Tableau
A Tableau Gantt chart illustrates the duration of events as well as the progression of value across the period. Along with the time axis, it has bars. The Gantt chart is primarily used as a project management tool, with each bar representing a project job.
4.In Microsoft Excel, how do you create a drop-down list?
Start by selecting the Data tab from the ribbon.
Select Data Validation from the Data Tools group.
Go to Settings > Allow > List next.
Choose the source you want to offer in the form of a list array.
1. What do Tableau's sets and groups mean?
Data is grouped using sets and groups according to predefined criteria. The primary distinction between the two is that although a set can have only two options—either in or out—a group can divide the dataset into several groups. A user should decide which group or sets to apply based on the conditions.
2.What in Excel is a macro?
An Excel macro is an algorithm or a group of steps that helps automate an operation by capturing and replaying the steps needed to finish it. Once the steps have been saved, you may construct a Macro that the user can alter and replay as often as they like.
Macro is excellent for routine work because it also gets rid of mistakes. Consider the scenario when an account manager needs to share reports about staff members who owe the company money. If so, it can be automated by utilising a macro and making small adjustments each month as necessary.
3.Gantt chart in Tableau
A Tableau Gantt chart illustrates the duration of events as well as the progression of value across the period. Along with the time axis, it has bars. The Gantt chart is primarily used as a project management tool, with each bar representing a project job.
4.In Microsoft Excel, how do you create a drop-down list?
Start by selecting the Data tab from the ribbon.
Select Data Validation from the Data Tools group.
Go to Settings > Allow > List next.
Choose the source you want to offer in the form of a list array.
👍2