Techtalks: Data Science

About Data Science

Data is no longer a trend or a buzzword, it’s a part of the growth strategy for every business. Machine Learning, Artificial Intelligence, Business Intelligence, Big Data – these tools help us analyze the past, understand the present, and look into the future with unmatched accuracy. Join the Techtalks community to explore these possibilities, find answers to your questions, and share your experience with others.
image placeholder
Frances Frazier

answered  Jan 22, 2025

Any idea?

image placeholder
Vivek Garg

asked  Oct 7, 2022

Windows Server 2008 Tableau Desktop 7.0 Local MySQL database using Xampp

When attempting to connect to a localhost MySQL database server using the native MySQL data connection in Tableau Desktop, I'm getting the following error:

"The connection to the data source might have been lost. Communication with the Tableau Protocol Server process was lost.

Unable to connect to the server "localhost". Check that the server is running and that you have access privileges to the requested database. Unable to connect to the server. Check that the server is running and that you have access privileges to the requested database."

When using "Other Databases (ODBC)" in Tableau, I am able to connect and return data. However, I would prefer to use the MySQL connection to avoid the limitations of the Other Databases connection.

I created a specific username in phpmyadmin as well as tried using the root username and password and the same error was returned. I created a user DSN in odbcad32.exe and attempted using multiple versions of the MySQL ODBC driver (3.5, 5.2.7, 5.3.4).

Does anyone have any ideas of what might be causing the error? Really appreciate any help.

In Addition, I learn from Tableau certification course.
 
Thank you in advance.
image placeholder
Casey Jacqueline

answered  Sep 15, 2024

Open your SQL query and navigate to line 62. Look for any references to Field1 at or near the 33rd character of that line. You should check if Field1 is referenced without specifying which table it belongs to. Ensure that all your field names are fully qualified with their respective table aliases.

Python Code Error

7.4 Kviews
1answer
0votes
image placeholder
James Harry, Digital Trainer at igmGuru

answered  Oct 17, 2022

Hy Guys,

CCSP Course

With the help of this CCSP online training certification course, develop your knowledge of cloud security architecture, design, applications, and operations. You will get step-by-step instructions and a comprehensive lesson plan that is simple to follow covering every aspect of the CCSP, which will help you further your career in the expanding sector of cloud security.

image placeholder
Kajal Digital

answered  Sep 14, 2023

The cost of Robotic Process Automation (RPA) certification can vary widely depending on several factors, including the certification provider, the level of certification, and your location. Here are some general guidelines:

  • Certification Provider: Different organizations and vendors offer RPA certifications. Some of the popular ones include UiPath, Blue Prism, Automation Anywhere, and Microsoft. Each of these providers may have their own pricing structure for certification exams and training materials.
  • Certification Level: RPA certifications often come in different levels, such as beginner, intermediate, and advanced. The cost of certification may increase with the level of expertise being tested. For example, an entry-level certification might be less expensive than an advanced certification.
  • Training Materials: In addition to the exam fee, you may need to invest in training materials, courses, and study resources to prepare for the certification. These additional costs can vary depending on the provider and your chosen method of study.
  • Location: The cost of certification exams can also vary by region. Some regions may have lower or higher exam fees based on local market conditions.
  • Promotions and Discounts: Some certification providers offer promotions, discounts, or bundle deals that can reduce the overall cost of certification.

To get the most accurate and up-to-date information on the cost of RPA certification, it's recommended to visit the official website of the certification provider you're interested in or contact their customer support. They will provide you with specific pricing details for the certification program you wish to pursue. Additionally, you can check if there are any community colleges, universities, or online platforms that offer RPA courses and certification at a lower cost compared to vendor-specific certifications.

 

 

image placeholder
oilu miun

answered  Nov 15, 2023

The amount of training data required for machine learning can vary depending on several factors, including the complexity of the task, the complexity of the model, and the quality of the data. In general, having more diverse and representative training data tends to improve the performance of machine learning models.

However, there is no fixed rule for the exact amount of training data needed. Some machine learning algorithms can achieve reasonable performance with a few hundred or thousand labeled examples, while others may require millions of examples. It also depends on the dimensionality of the data and the complexity of the patterns to be learned.

image placeholder
Embert Morton

answered  May 19, 2025

Yes, it’s definitely possible—especially if you're using labeled data to validate or evaluate unsupervised models. Techniques like clustering can help identify hidden structures even when labels are present, and later you can compare results against known outcomes. It’s not traditional, but it can surface patterns and anomalies you might otherwise overlook in a purely supervised setup.

 

 

 

 

What is data profiling in ETL?

7.6 Kviews
3answers
0votes
image placeholder
malan sanni

answered  Feb 26, 2024

Data profiling in the context of ETL (Extract, Transform, Load) refers to the process of analyzing and understanding the structure, quality, and content of the source data before it is transformed and loaded into a target system. Data profiling helps to uncover inconsistencies, anomalies, and patterns within the data, and it provides insights that are crucial for designing effective ETL processes.

image placeholder
chicken plusone

answered  Aug 24, 2023

Yes, there are several free and open source image recognition APIs available:

Clarifai - An open source image and video recognition API. It has a generous free tier.

Amazon Rekognition - Amazon's image and video analysis service. Has a free tier that offers 5,000 transactions per month.

Microsoft Computer Vision - Provides image analysis and description. The free tier offers 5,000 transactions per month.

Google Cloud Vision API - Google's image recognition service. The free tier offers 1,000 requests per month.

OpenCV - A popular computer vision and machine learning software library. Can be used to build custom image recognition models and applications.

Tensorflow/Keras - Deep learning frameworks that can be used with transfer learning to build image classifiers. Free to use and open source.

IBM Watson Visual Recognition - IBM's visual recognition service. The free tier offers 50 API calls per month.

Alteryx Computer Vision - A library of computer vision tools for tasks like object detection, image classification etc. Open source.

DeepDetect - A framework for deep learning microservices including image recognition. Free for non-commercial use.

Cloudsight - Provides image recognition via a REST API. The free tier allows 1,000 API calls per month. 

So in summary, Clarifai, Amazon Rekognition, OpenCV and libraries like Tensorflow offer the most full-featured free open source options.

image placeholder
Akash Nager

answered  Jan 17, 2024

OLAP (Online Analytical Processing) databases and relational databases are both types of database systems, but they serve different purposes and have distinct characteristics.

  1. Data Model:

    • Relational Database: Relational databases are based on the relational model, which organizes data into tables with rows and columns. Each table has a unique key, and relationships between tables are established using foreign keys.

    • OLAP Database: OLAP databases use a multidimensional data model. Instead of tables, they organize data into dimensions and measures. Dimensions represent the characteristics or categories of data, and measures represent the data values.

  2. Purpose:

    • Relational Database: Relational databases are designed for transactional processing (OLTP - Online Transaction Processing). They are optimized for efficient insertion, updating, and retrieval of individual records.

    • OLAP Database: OLAP databases are designed for analytical processing (OLAP - Online Analytical Processing). They are optimized for complex queries and aggregations on large datasets to support business intelligence and decision-making.

  3. Query Performance:

    • Relational Database: Relational databases excel in handling simple queries involving individual records or small sets of records. They may struggle with complex analytical queries or aggregations on large datasets.

    • OLAP Database: OLAP databases are specifically designed for complex queries and aggregations. They can efficiently handle multidimensional queries, making them suitable for analytical tasks.

  4. Data Aggregation:

    • Relational Database: Aggregations in relational databases can be computationally expensive, especially when dealing with large datasets, as they involve scanning and summarizing numerous records.

    • OLAP Database: OLAP databases are optimized for data aggregation. They store pre-aggregated data at different levels of granularity, allowing for faster retrieval of summarized information.

  5. Schema:

    • Relational Database: Relational databases use a schema with predefined tables and relationships. The schema is typically normalized to eliminate redundancy.

    • OLAP Database: OLAP databases often use a star or snowflake schema. Data is organized into a central fact table (containing measures) surrounded by dimension tables (containing attributes). This allows for efficient querying and aggregation along dimensions.

In summary, while both OLAP and relational databases store and manage data, their design and optimization are geared toward different types of applications. Relational databases are well-suited for transactional processing, while OLAP databases are designed to efficiently handle complex analytical queries and aggregations for business intelligence purposes