Open in app

Sign in

Write

Sign in

Alvaro Matsuda
Alvaro Matsuda

48 Followers

Home

Lists

About

Published in

Artificial Intelligence in Plain English

·Oct 25

Building Complex ML Pipelines

Introduction A machine learning project have several steps to build the final model from preprocessing data, feature engineering, feature selection to training the model and making it ready to make predictions. Each one of these steps have their own complexity and sometimes your code can be unorganized and messy. Even worse…

Machine Learning

8 min read

Building complex ML pipelines
Building complex ML pipelines
Machine Learning

8 min read


Jun 20

Spatial Clustering

Introduction Spatial clustering is a valuable technique that can be used in various fields such as urban planning, ecology, and data analysis. It helps us identify patterns, group similar data points together, and gain insights into spatial distributions. In this article, we will explore how to perform spatial clustering using, scikit-learn…

Spatial Analysis

9 min read

Spatial Clustering
Spatial Clustering
Spatial Analysis

9 min read


May 10

NLP (Part 2): Feature Extraction

Introduction Continuing on my previus post, in this second part I will focus on techniques of feature extraction for text data. Feature extraction are techniques that encode text data into numerical data such that machine learning models can interpret it. Techniques covered: Bag of words (BoW); Term Frequency Inverse Document Frequency…

NLP

7 min read

NLP (Part 2): Feature Extraction
NLP (Part 2): Feature Extraction
NLP

7 min read


Apr 6

NLP (Part 1): Preprocessing text data

Introduction Text is everywhere and it can give us many insights about a company, product and service. However, extracting such insights is not an easy task. Text data is an unstructured data and, as the name suggest, we have to process it to a more “structured” form to be able to…

NLP

6 min read

NLP: Preprocessing text data (Part 1)
NLP: Preprocessing text data (Part 1)
NLP

6 min read


Mar 8

Exploratory Spatial Data Analysis (ESDA): Spatial Autocorrelation

Introduction In data science, an EDA (Exploratory Data Analysis) is a fundamental step of a project. It gives you better understanding of the data you are working with. It helps to identify outliers and extract insights through hypothesis testing. …

Spatial Analysis

10 min read

Exploratory Spatial Data Analysis (ESDA): Spatial Autocorrelation
Exploratory Spatial Data Analysis (ESDA): Spatial Autocorrelation
Spatial Analysis

10 min read


Feb 7

Aggregating information of two overlaying GeoDataFrames using GeoPandas with code example

Introduction In this post I will show how to aggregate information from two different GeoDataFrames (layers) using GeoPandas. I will show one of many ways that we can do that with code example. But first, let’s see a situation when we might need to do this kind of operation. Obs 01…

Data Science

10 min read

Aggregating information of two overlaying GeoDataFrames using GeoPandas with code example
Aggregating information of two overlaying GeoDataFrames using GeoPandas with code example
Data Science

10 min read


Jan 6

Spatial Data Science: 3 main data structures for GeoSpatial data

Introduction It is said that wherever that is data, there is work for data scientist. And in the geospatial field that are lots of data being generated that can be analysed to extract insights from. As Sergio J. Rey, Dani Arribas-Bel and Levi J. …

Data Science

9 min read

Spatial Data Science: 3 main data structures for GeoSpatial data
Spatial Data Science: 3 main data structures for GeoSpatial data
Data Science

9 min read


Sep 28, 2021

Learning to Rank — Another use of Machine Learning

When we think of problems that machine learning can solve, the first thing that comes to our minds are classification, regression, NLP, clustering. We rarely see someone talking about Learning to Rank. So in this post you are going to see: What is Learning to Rank; How I applied Learning…

Learning To Rank

4 min read

Learning to Rank — Another use of Machine Learning
Learning to Rank — Another use of Machine Learning
Learning To Rank

4 min read


Jun 3, 2021

My first Data Science project and what I’ve learned with it.

Hi everyone! As my first data science project I’ve learned a lot doing it and I would like to share some things, as it may help others and is a way to register my progress. This post do not have any technical knowledge and is not the intend of it…

Data Science

10 min read

My first Data Science project and what I’ve learned with it.
My first Data Science project and what I’ve learned with it.
Data Science

10 min read

Alvaro Matsuda

Alvaro Matsuda

48 Followers

Writing about Geospatial Data Science, AI, ML, Python, GIS

Following
  • Markus Stoll

    Markus Stoll

  • Ransaka Ravihara

    Ransaka Ravihara

  • Bryan R. Vallejo

    Bryan R. Vallejo

  • Sutan Mufti

    Sutan Mufti

  • Sebastiao Ferreira de Paula Neto

    Sebastiao Ferreira de Paula Neto

See all (12)

Help

Status

About

Careers

Blog

Privacy

Terms

Text to speech

Teams