Open in app

Sign in

Write

Sign in

Erdogan Taskesen
Erdogan Taskesen

2.3K Followers

Home

Lists

About

Published in

Towards Data Science

·Pinned

Chat with Your Dataset using Bayesian Inferences.

The ability to ask questions to your data set has always been an intriguing prospect. You will be surprised how easy it is to learn a local Bayesian model that can be used to interrogate your data set. — With the rise of chatGPT-like models, it has become accessible for a broader audience to analyze your own data set and, so to speak, “ask questions”. Although this is great, such an approach has also disadvantages when using it as an analytical step in automated pipelines. This is especially the…

Bayesian Inference

13 min read

Chat with Your Dataset using Bayesian Inferences.
Chat with Your Dataset using Bayesian Inferences.
Bayesian Inference

13 min read


Published in

Towards Data Science

·Pinned

A Step-by-Step Guide To Accurately Detect Peaks and Valleys.

Peak Detection is a challenging step in many applications. Read and learn how to accurately detect peaks and valleys in 1D vectors and 2D arrays (images). — Our human brain is excellent in peak detection in relation to its context. What seems an easy task by eye can be a challenging task to automate by machines. In general, peaks and valleys indicate (significant) events such as sudden increases or decreases in price/volume, or sharp rises in demand…

Peaks And Valleys

13 min read

A Step-by-Step Guide To Accurately Detect Peaks and Valleys.
A Step-by-Step Guide To Accurately Detect Peaks and Valleys.
Peaks And Valleys

13 min read


Published in

Towards Data Science

·Pinned

The Path to Success in Data Science Is About Your Ability to Learn. But What to Learn?

The chances of successfully delivering data science projects are greatest when you keep learning, but it’s not always clear what to focus on — Many great developments in data science have been made in the last decade but despite these achievements, many projects never see the light of day. As data scientists we must not only show strong technical skills but also understand the business context, effectively communicate with stakeholders, and translate their questions…

Data Science

7 min read

The Path to Success in Data Science Is About Your Ability to Learn. But What to Learn?
The Path to Success in Data Science Is About Your Ability to Learn. But What to Learn?
Data Science

7 min read


Published in

Towards Data Science

·Pinned

How to Find the Best Theoretical Distribution for Your Data

Knowing the underlying data distribution is an essential step for data modeling and has many applications, such as anomaly detection, synthetic data creation, and data compression. — Knowing the underlying (probability) distribution of your data has many modeling advantages. The easiest manner to determine the underlying distribution is by visually inspecting the random variable(s) using a histogram. With the candidate distribution, various plots can be created such as the Probability Distribution Function plot (PDF/CDF), and the QQ…

Probability Distributions

19 min read

How to Find the Best Theoretical Distribution for Your Data
How to Find the Best Theoretical Distribution for Your Data
Probability Distributions

19 min read


Published in

Towards Data Science

·Pinned

D3Blocks: The Python Library to Create Interactive and Standalone D3js Charts.

Create interactive, stand-alone, and visually attractive charts that are built on the graphics of d3 javascript (d3js) but configurable with Python. — Python has become one of the most popular programming languages to analyze and visualize your data. Visualizing can be the key to success in projects because it can reveal hidden insights in the data, and improve understanding. The best way to understand and explain the data is by making it…

Python

11 min read

D3Blocks: The Python Library to Create Interactive and Standalone D3js Charts.
D3Blocks: The Python Library to Create Interactive and Standalone D3js Charts.
Python

11 min read


Published in

Towards Data Science

·Oct 21

Detection of Multicollinearity in Data sets using Statistical Testing.

Detecting multicollinearity in data sets is an important step but also challenging. I will demonstrate how to detect variables with similar behavior in mixed data sets and how to deeper examine the relationships with interactive charts. — Understanding the strength of relationships between variables in a data set is important because variables with statistically similar behavior can affect the reliability of models. To remove the so-called multicollinearity we can use correlation measures for continuous variables. However, when we also have categorical variables and thus mixed data sets…

Python

11 min read

Detection of Multicollinearity in Data sets using Statistical Testing.
Detection of Multicollinearity in Data sets using Statistical Testing.
Python

11 min read


Published in

Towards Data Science

·Aug 26

The Next Step is Responsible AI. How Do We Get There?

Machine learning solutions take an important place in our lives. It is not only about performance anymore but also about responsibility. — In the last decades, many AI projects focused on model efficiency and performance. Results are documented in scientific articles, and the best-performing models are deployed in organizations. Now it is the time to put another important part into our AI systems; responsibility. The algorithms are here to stay and nowadays…

Data Science

12 min read

The Next Step is Responsible AI. How Do We Get There?
The Next Step is Responsible AI. How Do We Get There?
Data Science

12 min read


Published in

Towards Data Science

·Aug 13

Maximize Your Insights by Choosing the Best Chart: Network, Heatmap, or Sankey?

Beautiful visualizations are great but to maximize the interpretability, you need to choose a chart carefully. — Visualization is an important part of data analysis as it can transform data into insights and help you with storytelling. In this blog post, I will focus on Network charts, Heatmaps, and Sankey charts. These charts have the same input, but we should keep in mind that they are designed…

D3js

9 min read

Maximize Your Insights by Choosing the Best Chart: Network, Heatmap, or Sankey?
Maximize Your Insights by Choosing the Best Chart: Network, Heatmap, or Sankey?
D3js

9 min read


Published in

Towards Data Science

·Jul 17

Effectively Optimize Your Regression Model with Bayesian Hyperparameter Tuning

Learn to effectively optimize hyperparameters, and prevent creating overtrained models for XGBoost, CatBoost, and LightBoost — Gradient boosting techniques such as XGBoost, CatBoost, and LightBoost has gained much popularity in recent years for both classification and regression tasks. An important part of the process is the tuning of hyperparameters to gain the best model performance. The key is to optimize the hyperparameter search space together with…

Regression

15 min read

Effectively Optimize Your Regression Model with Bayesian Hyperparameter Tuning
Effectively Optimize Your Regression Model with Bayesian Hyperparameter Tuning
Regression

15 min read


Published in

Towards Data Science

·Jun 8

Create and Explore the Landscape of Roles and Salaries in Data Science

A tutorial on how to create a landscape using categorical data, and perform unsupervised analysis for deeper insights — The data science field is under constant development for which new roles and functions are created. The traditional data science role is evolving into tens of new roles, from Data Engineer, ML Engineer, Product Data Analyst, Research Scientist, Cloud Data Engineer, and many more. In this blog, we will load…

Clustering

14 min read

Create and Explore the Landscape of Roles and Salaries in Data Science
Create and Explore the Landscape of Roles and Salaries in Data Science
Clustering

14 min read

Erdogan Taskesen

Erdogan Taskesen

2.3K Followers

Machine Learning | Statistics | D3js visualizations | Data Science | Ph.D | erdogant.github.io

Following
  • Tim Denning

    Tim Denning

  • Anthony Alcaraz

    Anthony Alcaraz

  • TDS Editors

    TDS Editors

  • Tony Stubblebine

    Tony Stubblebine

  • Dr. Alessandro Crimi

    Dr. Alessandro Crimi

See all (21)

Help

Status

About

Careers

Blog

Privacy

Terms

Text to speech

Teams