Which of the following is a framework to solve bigdata problems?
Anonymous Quiz
5%
Java
51%
Hadoop
16%
MongoDB
29%
Python
👍16😁6👏2❤1
Ways to improve the performance of Tableau
👉🏻 Use an Extract to make workbooks run faster.
👉🏻 Reduce the number of marks on the view to avoid information overload.
👉🏻 Hide unused fields.
👉🏻 Use Context filters.
👉🏻 Use indexing in tables and use the same fields for filtering.
👉🏻 Remove unnecessary calculations and sheets.
👉🏻 Use an Extract to make workbooks run faster.
👉🏻 Reduce the number of marks on the view to avoid information overload.
👉🏻 Hide unused fields.
👉🏻 Use Context filters.
👉🏻 Use indexing in tables and use the same fields for filtering.
👉🏻 Remove unnecessary calculations and sheets.
👍26🤩4👏1
What is the full form of DML in database management system?
Anonymous Quiz
14%
Data Markup Language
3%
Decision Makeup Language
1%
Distorted Making Language
82%
Data Manipulation Language
👍14⚡7😁7❤1🔥1👏1
Which of the following is not a DML command in SQL?
Anonymous Quiz
79%
REVOKE
9%
INSERT
6%
DELETE
7%
UPDATE
👍23🔥4👏4😁3🥰1
In Hypothesis testing, a Type I error occurs when the null hypothesis is rejected even if it is true. It is also known as
Anonymous Quiz
68%
False Positive
32%
False Negative
👍9🥰5
Which of the following data types is not available in SQL?
Anonymous Quiz
6%
CHAR
4%
INT
5%
DATE
85%
ALPHABET
👍22👏3❤1
Requirements for data analyst based on some job profiles from @getjobss channel
👉 Strong data handling, data modeling & data flow understanding
👉 Ability to write complex queries on SQL to manipulate, consolidate multiple data sources for the purpose of dashboarding and analysis
👉 Intuition for data and ability to handle big data sources
👉 Strong working knowledge in Excel and visualization tools like PowerBI, Tableau, QlikView
👉 Ability to work on ambiguous tasks, find suitable solutions, and seek help/advice where appropriate.
👉 Strong data handling, data modeling & data flow understanding
👉 Ability to write complex queries on SQL to manipulate, consolidate multiple data sources for the purpose of dashboarding and analysis
👉 Intuition for data and ability to handle big data sources
👉 Strong working knowledge in Excel and visualization tools like PowerBI, Tableau, QlikView
👉 Ability to work on ambiguous tasks, find suitable solutions, and seek help/advice where appropriate.
👍61😁2❤1
What is the ful form of TCL in SQL?
Anonymous Quiz
89%
Transaction Control Language
11%
Total Control Level
👍11🔥9❤1
Which of the following is not available in marks card of tableau?
Anonymous Quiz
6%
Size
6%
Color
28%
Tooltip
8%
Label
52%
Query
👍21🥰1
Which of the following clause is used to sort data in SQL?
Anonymous Quiz
13%
where
66%
order by
5%
having
16%
group by
👍26🔥8😢3❤2👏1
Which of the following clause is not available in SQL?
Anonymous Quiz
3%
SELECT
3%
ORDER BY
4%
GROUP BY
5%
WHERE
85%
INPUT
👍13❤3🔥1
1. Define the term 'Data Wrangling.
Data Wrangling is the process wherein raw data is cleaned, structured, and enriched into a desired usable format for better decision making. It involves discovering, structuring, cleaning, enriching, validating, and analyzing data. This process can turn and map out large amounts of data extracted from various sources into a more useful format.
2. What are the best methods for data cleaning?
Create a data cleaning plan by understanding where the common errors take place and keep all the communications open. Before working with the data, identify and remove the duplicates. This will lead to an easy and effective data analysis process.Focus on the accuracy of the data. Set cross-field validation, maintain the value types of data, and provide mandatory constraints.Normalize the data at the entry point so that it is less chaotic. You will be able to ensure that all information is standardized, leading to fewer errors on entry.
3. Explain the Type I and Type II errors in Statistics?
In Hypothesis testing, a Type I error occurs when the null hypothesis is rejected even if it is true. It is also known as a false positive.
A Type II error occurs when the null hypothesis is not rejected, even if it is false. It is also known as a false negative.
4. How do you make a dropdown list in MS Excel?
First, click on the Data tab that is present in the ribbon.Under the Data Tools group, select Data Validation.Then navigate to Settings > Allow > List.Select the source you want to provide as a list array.
5. State some ways to improve the performance of Tableau?
Use an Extract to make workbooks run faster.
Reduce the scope of data to decrease the volume of data.
Reduce the number of marks on the view to avoid information overload.
Hide unused fields.
Use Context filters.
Use indexing in tables and use the same fields for filtering.
Remove unnecessary calculations and sheets.
Data Wrangling is the process wherein raw data is cleaned, structured, and enriched into a desired usable format for better decision making. It involves discovering, structuring, cleaning, enriching, validating, and analyzing data. This process can turn and map out large amounts of data extracted from various sources into a more useful format.
2. What are the best methods for data cleaning?
Create a data cleaning plan by understanding where the common errors take place and keep all the communications open. Before working with the data, identify and remove the duplicates. This will lead to an easy and effective data analysis process.Focus on the accuracy of the data. Set cross-field validation, maintain the value types of data, and provide mandatory constraints.Normalize the data at the entry point so that it is less chaotic. You will be able to ensure that all information is standardized, leading to fewer errors on entry.
3. Explain the Type I and Type II errors in Statistics?
In Hypothesis testing, a Type I error occurs when the null hypothesis is rejected even if it is true. It is also known as a false positive.
A Type II error occurs when the null hypothesis is not rejected, even if it is false. It is also known as a false negative.
4. How do you make a dropdown list in MS Excel?
First, click on the Data tab that is present in the ribbon.Under the Data Tools group, select Data Validation.Then navigate to Settings > Allow > List.Select the source you want to provide as a list array.
5. State some ways to improve the performance of Tableau?
Use an Extract to make workbooks run faster.
Reduce the scope of data to decrease the volume of data.
Reduce the number of marks on the view to avoid information overload.
Hide unused fields.
Use Context filters.
Use indexing in tables and use the same fields for filtering.
Remove unnecessary calculations and sheets.
👍44❤10🔥4
Which of the following is not a join type in SQL?
Anonymous Quiz
4%
Left Join
8%
Inner Join
89%
Down Join
👍30❤1
Which of the following is not a data visualization tool?
Anonymous Quiz
4%
Tableau
5%
Power BI
91%
Javanoscript
👍24❤12😁9👏2
What is full form of DAX in Power BI?
Anonymous Quiz
87%
Data Analysis Expressions
13%
Data Acronym Experts
👍22❤5😢3🥰2👏1
What is the full form of DML in database language?
Anonymous Quiz
87%
Data manipulation language
1%
Data munging language
9%
Data management language
3%
Data machine language p
👍15🔥3👏3❤2🥰1
🗂The order of operations used in MS Excel while evaluating formulas
MS Excel follows a standard math protocol to evaluate a formula.
This protocol is called “order of operations” – PEMDAS –
~Parentheses
~Exponents
~Multiplication
~Division
~Addition
~Subtraction
MS Excel also applies some customization to handle the formula syntax.
The order in which MS Excel performs calculations can affect the return value of the formula.
First of all, Excel evaluates any expressions in parentheses.
As we have seen in mathematical formulae too, parentheses essentially override the normal order of operations. It prioritizes certain operations.
Next, Excel resolves cell references like A1 (cell address). It evaluates range references like A1:A10, making them arrays of values.
It also performs range operations like a union (comma) and an intersection (space).
Next, Excel performs –
-Exponentiation
-Negation
-% conversions
-Multiplication and division
-Addition and subtraction
-Concatenation
-Logical operators
MS Excel follows a standard math protocol to evaluate a formula.
This protocol is called “order of operations” – PEMDAS –
~Parentheses
~Exponents
~Multiplication
~Division
~Addition
~Subtraction
MS Excel also applies some customization to handle the formula syntax.
The order in which MS Excel performs calculations can affect the return value of the formula.
First of all, Excel evaluates any expressions in parentheses.
As we have seen in mathematical formulae too, parentheses essentially override the normal order of operations. It prioritizes certain operations.
Next, Excel resolves cell references like A1 (cell address). It evaluates range references like A1:A10, making them arrays of values.
It also performs range operations like a union (comma) and an intersection (space).
Next, Excel performs –
-Exponentiation
-Negation
-% conversions
-Multiplication and division
-Addition and subtraction
-Concatenation
-Logical operators
👍46❤7
We are 10k+ now before the new year 💪
Here is a special channel where you will find FREE Data Analysis Books
👇👇
https://news.1rj.ru/str/learndataanalysis
You guys are amazing
Thanks for sharing and supporting the channel ❤️❤️
Here is a special channel where you will find FREE Data Analysis Books
👇👇
https://news.1rj.ru/str/learndataanalysis
You guys are amazing
Thanks for sharing and supporting the channel ❤️❤️
❤54👍23👏10🔥8🥰7
1. What is the meaning of dropout in Deep Learning?
Dropout is a technique that is used to avoid overfitting a model in Deep Learning. If the dropout value is too low, then it will have minimal effect on learning. If it is too high, then the model can under-learn, thereby, causing lower efficiency.
2. What are sets in Tableau?
Sets are custom fields that define a subset of data based on some conditions. A set can be based on a computed condition, for example, a set may contain customers with sales over a certain threshold. Computed sets update as your data changes. Alternatively, a set can be based on specific data point in your view.
3. What is the difference between DROP and TRUNCATE commands?
DROP command removes a table and it cannot be rolled back from the database whereas TRUNCATE command removes all the rows from the table.
4. What is slicing in Python?
Ans: Slicing is used to access parts of sequences like lists, tuples, and strings. The syntax of slicing is-[start:end:step]. The step can be omitted as well. When we write [start:end] this returns all the elements of the sequence from the start (inclusive) till the end-1 element. If the start or end element is negative i, it means the ith element from the end.
Dropout is a technique that is used to avoid overfitting a model in Deep Learning. If the dropout value is too low, then it will have minimal effect on learning. If it is too high, then the model can under-learn, thereby, causing lower efficiency.
2. What are sets in Tableau?
Sets are custom fields that define a subset of data based on some conditions. A set can be based on a computed condition, for example, a set may contain customers with sales over a certain threshold. Computed sets update as your data changes. Alternatively, a set can be based on specific data point in your view.
3. What is the difference between DROP and TRUNCATE commands?
DROP command removes a table and it cannot be rolled back from the database whereas TRUNCATE command removes all the rows from the table.
4. What is slicing in Python?
Ans: Slicing is used to access parts of sequences like lists, tuples, and strings. The syntax of slicing is-[start:end:step]. The step can be omitted as well. When we write [start:end] this returns all the elements of the sequence from the start (inclusive) till the end-1 element. If the start or end element is negative i, it means the ith element from the end.
👍28❤2🥰2
1. What is the difference between the RANK() and DENSE_RANK() functions?
The RANK() function in the result set defines the rank of each row within your ordered partition. If both rows have the same rank, the next number in the ranking will be the previous rank plus a number of duplicates. If we have three records at rank 4, for example, the next level indicated is 7. The DENSE_RANK() function assigns a distinct rank to each row within a partition based on the provided column value, with no gaps. If we have three records at rank 4, for example, the next level indicated is 5.
2. Explain One-hot encoding and Label Encoding. How do they affect the dimensionality of the given dataset?
One-hot encoding is the representation of categorical variables as binary vectors. Label Encoding is converting labels/words into numeric form. Using one-hot encoding increases the dimensionality of the data set. Label encoding doesn’t affect the dimensionality of the data set. One-hot encoding creates a new variable for each level in the variable whereas, in Label encoding, the levels of a variable get encoded as 1 and 0.
3. Explain the Difference Between Tableau Worksheet, Dashboard, Story, and Workbook in Tableau?
Tableau uses a workbook and sheet file structure, much like Microsoft Excel.
A workbook contains sheets, which can be a worksheet, dashboard, or a story.
A worksheet contains a single view along with shelves, legends, and the Data pane.
A dashboard is a collection of views from multiple worksheets.
A story contains a sequence of worksheets or dashboards that work together to convey information.
4. How can you split a column into 2 or more columns?
You can split a column into 2 or more columns by following the below steps:
1. Select the cell that you want to split. Then, navigate to the Data tab, after that, select Text to Columns. 2. Select the delimiter. 3. Choose the column data format and select the destination you want to display the split. 4. The final output will look like below where the text is split into multiple columns.
5. Do you wanna make your career in Data Science & Analytics but don't know how to start ?
https://news.1rj.ru/str/sqlspecialist/94
Here is a complete roadmap from scratch that will make you technically strong enough to crack any Data Analyst and also learn Pro Career Growth Hacks to land on your Dream Job.
The RANK() function in the result set defines the rank of each row within your ordered partition. If both rows have the same rank, the next number in the ranking will be the previous rank plus a number of duplicates. If we have three records at rank 4, for example, the next level indicated is 7. The DENSE_RANK() function assigns a distinct rank to each row within a partition based on the provided column value, with no gaps. If we have three records at rank 4, for example, the next level indicated is 5.
2. Explain One-hot encoding and Label Encoding. How do they affect the dimensionality of the given dataset?
One-hot encoding is the representation of categorical variables as binary vectors. Label Encoding is converting labels/words into numeric form. Using one-hot encoding increases the dimensionality of the data set. Label encoding doesn’t affect the dimensionality of the data set. One-hot encoding creates a new variable for each level in the variable whereas, in Label encoding, the levels of a variable get encoded as 1 and 0.
3. Explain the Difference Between Tableau Worksheet, Dashboard, Story, and Workbook in Tableau?
Tableau uses a workbook and sheet file structure, much like Microsoft Excel.
A workbook contains sheets, which can be a worksheet, dashboard, or a story.
A worksheet contains a single view along with shelves, legends, and the Data pane.
A dashboard is a collection of views from multiple worksheets.
A story contains a sequence of worksheets or dashboards that work together to convey information.
4. How can you split a column into 2 or more columns?
You can split a column into 2 or more columns by following the below steps:
1. Select the cell that you want to split. Then, navigate to the Data tab, after that, select Text to Columns. 2. Select the delimiter. 3. Choose the column data format and select the destination you want to display the split. 4. The final output will look like below where the text is split into multiple columns.
5. Do you wanna make your career in Data Science & Analytics but don't know how to start ?
https://news.1rj.ru/str/sqlspecialist/94
Here is a complete roadmap from scratch that will make you technically strong enough to crack any Data Analyst and also learn Pro Career Growth Hacks to land on your Dream Job.
👍24❤2