Programando sem cafeína

Guide: Installing many pythons in Ubuntu without uv or pyenv

2026-01-25T15:27:00.003-03:00

Run the commands to install:

sudo add-apt-repository ppa:deadsnakes/ppa
sudo apt update
sudo apt install python3.9 python3.10 python3.11

After that, configure the uptade-alternatives:

sudo update-alternatives --install /usr/bin/python python /usr/bin/python3.9 9
sudo update-alternatives --install /usr/bin/python python /usr/bin/python3.10 10
sudo update-alternatives --install /usr/bin/python python /usr/bin/python3.11 11

Then to use, just choose the default, choose the alternative:

sudo update-alternatives --config python

At the end, if you want to use venv in any project, just create the virtual env like this:

python3.xx -m venv .venv
source .venv/bin/activate

If the first command raises error, maybe you need to install:

sudo apt install python3.xx-venv

Solução para: Laptop Vaio FE16 Linux sem Wifi

2026-01-25T08:49:00.001-03:00

O notebook Vaio FE16 com Linux vem com um Policorp Linux (Linux Policorp 6.10.8-policorp-amd64) e o Wifi não consegue ser identificado. Antes de formatar e colocar um Ubuntu mais recente eu quis pesquisar se eu conseguiria resolver o problema via software ou se seria um problema de hardware. Depois de muito pesquisar e tentar corrigir o driver da placa de rede RTL8852BE através de instalações de drivers de diversos tipo, eu desisti e resolvi fazer uma restauração do sistema através do menu de boot. Após a restauração o wifi foi identificado.

Então é isso: Restaure o sistema.

Solucionado eu pude instalar o Ubuntu 24.04. Inclusive, eu poderia logo de inicio ter testado no Ubuntu Live se ele conectaria, pois sim, ele conectou.

Deno "Error: illegal value for flag --max-old-space-size=4096 --expose-gc of type size_t"

2025-12-27T15:27:00.001-03:00

I'm using an old version of Deno (version 1.38.4) and when trying to use the --v8-flags= parameter on the command line, with more than one flag, I get this error:

Error: illegal value for flag --max-old-space-size=4096 --expose-gc of type size_t

The solution is trivial: Just use comma:

example:

--v8-flags="--flag-1=xpto,--flag-2"

How to exclude fields from pydantic submodels model_dump()

2025-10-22T15:16:00.003-03:00

Everybody knows that using .model_dump() with exclude parameter we can exclude some attributes from the output. But if your model contains attributes that are other pytdantic models, and you want to hide some attributes of this submodel, the best solution, that are not so explicit in the pydantic documentation is define the attribute with annotated to be excluded, like this:

from pydantic import BaseModel, Field
from typing import Annotated

class MySubmodel(BaseModel):
not_hidden_attr: str
hidden_attr: Annotated[str, Field(exclude=True)]

class MyModel(BaseModel)
attr_of_submodel: MySubmodel

Solution to "ModuleNotFoundError: No module named '_cffi_backend'" on python

2025-09-25T14:59:00.004-03:00

Got this error executing grothbook lib on python3.11

To solve:

> pip install --upgrade cffi cryptography

Solution to "TypeError: error sending request for url (https://...) error trying to connect: invalid peer certificate: UnknownIssuer" in Deno

2025-07-07T15:21:00.002-03:00

This error occurs for me when running Deno locally. My Deno version is older, 1.38.4:

Example error:

error: (in promise) TypeError: error sending request for url (https://esm.sh/dd-trace@5.36.0&pin=v135&no-dts/package.json): error trying to connect: invalid peer certificate: UnknownIssuer

const response = await fetch("https://esm.sh/dd-trace@5.36.0&pin=v135&no-dts/package.json");

Solution, export this environment variable:

DENO_TLS_CA_STORE=system

Solution to "ImportError: cannot import name 'GrowthBookClient' from 'growthbook'"

2025-05-13T09:44:00.006-03:00

If you're using version 1.2.1 of the GrowthBook Python library, you may have seen in the documentation on GitHub or PyPI that the GrowthBookClient should be imported like this:

from growthbook import GrowthBookClient

However, this results in the following error:

api/feature_flag_service.py", line 1, in <module>
from growthbook import GrowthBookClient, Options, UserContext
ImportError: cannot import name 'GrowthBookClient' from 'growthbook'

The fix is simple: instead of following the documentation, use the correct import path based on the actual library structure:

from growthbook.growthbook_client import GrowthBookClient

Solution to sqlx prepare error: failed to lookup address information: Name or service not known

2025-05-01T10:28:00.003-03:00

I was getting the error "failed to lookup address information: Name or service not known" when running the command cargo sqlx prepare. The error seemed obvious — my DATABASE_URL environment variable was probably incorrect. However, after checking everything and confirming that it was correct (ad-hoc connections were working, and even the service startup connected successfully), I figured out the issue:

My service required additional environment variables to start. Without these env vars, it wouldn't start and would return an error. It turns out that when running cargo sqlx prepare without those variables, the error gets silently swallowed, and the database connection simply fails.

After supplying the same environment variables I use for cargo run to cargo sqlx prepare, the problem was resolved.

Solution to failed to acquire username/password from local configuration on rust git libs

2025-03-21T18:01:00.001-03:00

Let's say you need to work on a Rust project that depends on other libraries hosted in a private Git repository. This project uses the repository's HTTPS URL. For example:

your-lib = { git = "ssh://git@github.com/your-company/your-lib.git"}

Then you try to start the project and encounter the following error:

$ cargo run

Blocking waiting for file lock on package cache

Updating crates.io index

Updating git repository `https://github.com/your-company/your-lib.git`

error: failed to get `outbox-pattern-processor` as a dependency of package `anti-fraud-service v0.1.0 (/project/your-project)`

Caused by:

failed to load source for dependency `your-lib`

Caused by:

Unable to update https://github.com/your-company/your-lib.git?tag=v1.0.0#49efa247

Caused by:

failed to fetch into: /home/tiago.motta/.cargo/git/db/your-lib-31d59910066df49a

Caused by:

revision 49efa2476e2613cf809315b4b0abf16f079b5dcb not found

Caused by:

failed to authenticate when downloading repository

* attempted to find username/password via git's `credential.helper` support, but failed

if the git CLI succeeds then `net.git-fetch-with-cli` may help here

https://doc.rust-lang.org/cargo/reference/config.html#netgit-fetch-with-cli

Caused by:

failed to acquire username/password from local configuration

The solution is simple. Just edit your global .gitconfig to replace HTTPS URLs with SSH by adding the following lines:

[url "ssh://git@github.com/"]

insteadOf = https://github.com/

What is the best rust lib to post custom metric on datadog?

2025-03-10T17:49:00.002-03:00

The best library for recording custom metrics in Datadog using Rust is dogstatsd (https://crates.io/crates/dogstatsd).

The alternative statsd (https://crates.io/crates/statsd) does not allow sending tags. There is even a fork, datadog-statsd (https://crates.io/crates/datadog-statsd), that enables tag sending. However, both define the client without deriving Clone, which prevents using this client as with_state in Axum, resulting in an error similar to this:

the trait bound `...` is not satisfied

the trait `Clone` is not implemented for `...`rustcClick for full compiler diagnostic

method_routing.rs(168, 16): required by a bound in `post`

....rs(6, 1): consider annotating `...` with `#[derive(Clone)]`: `#[derive(Clone)]

the trait bound `...` is not satisfied

the trait `Clone` is not implemented for `...`rustcClick for full compiler diagnostic

method_routing.rs(168, 16): required by a bound in `post`

....rs(6, 1): consider annotating `...` with `#[derive(Clone)]`: `#[derive(Clone)]

the trait bound `fn(axum::extract::State<...>, Json<...>) -> impl futures::Future<Output = Result<hyper::Response<axum::body::Body>, AppError>> {...::handler}: Handler<_, _>` is not satisfied

the following other types implement trait `Handler<T, S>`:

`Layered<L, H, T, S>` implements `Handler<T, S>`

`MethodRouter<S>` implements `Handler<(), S>`rustcClick for full compiler diagnostic

....rs(55, 32): required by a bound introduced by this call

method_routing.rs(166, 16): required by a bound in `post`

the trait bound `fn(axum::extract::State<...>, axum::Json<...>) -> impl futures::Future<Output = Result<hyper::Response<axum::body::Body>, ...>> {...}: Handler<_, _>` is not satisfied

the following other types implement trait `Handler<T, S>`:

`Layered<L, H, T, S>` implements `Handler<T, S>`

`MethodRouter<S>` implements `Handler<(), S>`rustcClick for full compiler diagnostic

....rs(55, 32): required by a bound introduced by this call

method_routing.rs(166, 16): required by a bound in `post`

axum::routing::method_routing

pub fn post<H, T, S>(handler: H) -> MethodRouter<S, Infallible>

where

H: Handler<T, S>,

T: 'static,

S: Clone + Send + Sync + 'static,

H = fn handler(State<...>, …) -> …, S = ...

Solution to failed to run custom build command for `openssl-sys v0.9.104`

2025-02-18T16:11:00.010-03:00

When upgrading "reqwests" library I got the following error on docker build:

#18 182.8 The following warnings were emitted during compilation:
#18 182.8
#18 182.8 warning: openssl-sys@0.9.104: Could not find directory of OpenSSL installation, and this `-sys` crate cannot proceed without this knowledge. If OpenSSL is installed and this crate had trouble finding it, you can set the `OPENSSL_DIR` environment variable for the compilation process. See stderr section below for further information.
#18 182.8
#18 182.8 error: failed to run custom build command for `openssl-sys v0.9.104`
#18 182.8
#18 182.8 Caused by:
#18 182.8 process didn't exit successfully: `/usr/app/target/release/build/openssl-sys-ff7852766e78b685/build-script-main` (exit status: 101)
#18 182.8 --- stdout
#18 182.8 cargo:rustc-check-cfg=cfg(osslconf...

Could not find the solution after googling, but I tried the obvious solution, install openssl-dev into the docker image:

FROM rust:1.82-alpine AS builder
RUN apk add --no-cache openssl-dev

After this I got the following error:

x86_64-alpine-linux-musl/bin/ld: cannot find -lssl: No such file or directory
x86_64-alpine-linux-musl/bin/ld: cannot find -lcrypto: No such file or directory

And the fix for this was adding the openssl-libs-static:

FROM rust:1.82-alpine AS builder
RUN apk add --no-cache openssl-dev openssl-libs-static

Fixed

Solution to cargo-tarpaulin and missing `GLIBC_2.38' not found

2025-01-03T11:05:00.005-03:00

Using cargo-tarpaulin 0.31.4 using binstall on Ubuntu 22.04 may result in errors like the following:

cargo-tarpaulin: /lib/x86_64-linux-gnu/libc.so.6: version `GLIBC_2.38' not found (required by cargo-tarpaulin)
cargo-tarpaulin: /lib/x86_64-linux-gnu/libc.so.6: version `GLIBC_2.39' not found (required by cargo-tarpaulin)

I resolved the issue by avoiding binstall. Instead of running:

I resolved the issue by avoiding binstall. Instead of running

> cargo binstall cargo-tarpaulin

I used the following command:

> cargo insatall cargo-tarpaulin

While this installation method is significantly slower, it successfully resolves the issue.

I suspect similar problems could arise with other Rust libraries when using binstall.

The non-experimentation result anti-pattern

2019-03-12T23:00:00.001-03:00

Many of us have gone through the scenario of putting new software functionality into production and seeing tremendous improvement in business metrics. These results are often celebrated and attributed to this new achievement and this is used in presentations and corporate bonuses definitions.

But can we attribute causal relation between the new functionality and the new business metrics?

Before answering this question, however difficult it may be to abandon the emotive side of the functionality we have created, let us try to imagine some hypothesis that might have caused this improvement other than our change:

An advertising campaign
An unexpected viral
The functionality created by another squad
A concurrent product crashes
It is a normal seasonal movement

These are just a few assumptions that could cause improvement in metrics. It is possible to mitigate each of them through some data analysis. But can you anticipate all other possible causal assumptions to be able to rule them out? Difficultly.

Since you can not assign causality between the two events, the most you can say is that there is a correlation between them. And yet it may be a spurious correlation. Some examples of this kind of correlations can be seen on this site here. It speaks, for example, about the strong correlation between the number of Nicolas Cage films per year and the number of drownings in the United States per year.

So the answer to the initial question is no! It is not correct to attribute the improvement of the business metric just because of its new functionality. If you want to check whether a new feature causes a change in some metric, an AB experiment is required.

In an AB experiment we have a variant (or alternative) called control (usually A) and a variant containing the modification to be validated (B). A random sample of users will receive the version of their product with variant A, and another random sample of users will see variant B. Everything in the two variants must be the same, except the modification made in variant B. With this you can control the environment and all those possible causes we reported above and more the others hypothesis we could not predict before.

At the end of the experiment we ate going to be able to attribute the causality of the new functionality to the incremental metric because it was the only thing different in the whole environment. Taking into account of course all statistical variance due to the use of a sample, which can lead us to possible false positives in a minimal percentage of cases.

AB experiments are the most accepted technique currently for you to determine the causality of a change in your product. They are not a new thing and are widely used as a part of the scientific method.

However there is currently a large discussion in the statistical area regarding causal inferences, which would be models for inferring cause from one event to another without necessarily an experiment controlling all variables. This discussion became even stronger after the publication of The Book of Why.

However, if you do not master the techniques of causal inference, it is best not to disclose cause and effect without being sure. I know it's very hard for us to leave our emotions aside for having participated in the development of that piece of software. But it's important to stay cool. And of course, always do AB experiments before putting something into production.

This post is part of a series about Experimentation Anti-Patterns, a talk that I presented recently.

Spark "first" function behavior on pandas dataframe

2019-01-13T21:51:00.000-03:00

Spark first function is used to choose a value after aggregating some dataset value.

In python pandas we don't have this behavior as default after aggregating some dataframe, but we can do it easily we a few lines of code.

Since I could not find this solution on Stack Overflow. But first, let's see what happen with a column with string type when we do not use a function like firtst:

import pandas as pd
df = pd.DataFrame.from_records([
       dict(k=1, i=10, t="a"),
       dict(k=1, i=20, t="b"),
       dict(k=1, i=20, t="c"),
])
df.groupby("k", as_index=False).sum()

If we run the code bellow, we get this result:

  k i
0 1 50

You can see that the column t was removed since you cannot sum it.

Now, let's add the first aggregation function to this column:

first = lambda a: a.values[0] if len(a) > 0 else None
df.groupby("k", as_index=False).agg({'i': sum, 't': first})

If we run the code bellow, we get this result:

  k i  t
0 1 50 a

Solved

"Learning to ranking" with xCLiMF python implementation

2017-09-03T12:35:00.000-03:00

At Globo.com we try to optimize our personalized recommendation providing better content in a ranked list. We have a hybrid system that can automatically essemble collaborative filtering, content based and non personalized algorithms.

But none of our implemented algorithms are intented to rank. Our best collaborative filtering algorithm, based on the paper Collaborative Filtering for Implicit Feedback Datasets, doesn't optimizes a ranking metric like MRR or NDCG.

Even so we have good results of those algorithms choosing the best hyper parameters focusing in MRR, we thought we could get better results using algorithms focusing in optimizing a ranking metric: Our first hypothesis is xCLiMF.

xCLiMF is a latent factor collaborative filtering variant wich optimizes a lower bound of the smoothed reciprocal rank of "relevant" items in ranked recommendation list. It is a evolution o CLiMF algorithm that can handle using multiple levels of relevance data instead of using only unary user preferences.

Implementation references

Since xCLiMF is very similar to CLiMF, I implemented xCLiMF based on this python CLiMF repository ( https://github.com/gamboviol/climf ). During the process I found an error and submitted a pull request ( https://github.com/gamboviol/climf/pull/2 ).

I have also found a C++ xCLiMF implementantion by other brazilian guy ( https://github.com/gpoesia/xclimf ). But this solution have two problems: The first one is a bug reported here ( https://github.com/gpoesia/xclimf/issues/1 ) and the second one is related to performance, since it updates user and item vectors separately as two loops of other two chained loops.

My implementation

So my implementation also have some differences from original papers:

In the paper they don't mention any kind of previously normalization, but it's always a good practice to do it for machine learning algorithms input data. So I normalized dividing all rating by the maximum rating in the dataset. Using original ratings lead to some math errors:

Exponential function of large values causes math range error;
Exponential function of very large negative numbers get 0, and lead to some division by zero error;
Log function of negative numbers, caused by large exponentials, causes math domain error

The paper experimentation setup uses N random ratings by user in training and in the cross validation data. Even so I have implemented this way, I have put an option to select 2N top ratings by user, randomly distributed in training and cross validation. For me, this last protocol reflected a more realistic kind of problem that xCLiMF is going to solve.
In the original paper they described the algorithm as two chained loops. Firstly I implemented as described in the paper. But them I updated the code to use a more vectorized solution, having a 54% improvement in the performance. I maintained the chained loop solution for comparison purpose ( https://github.com/timotta/xclimf/blob/master/tests/looping_update.py )
For the objective function, the paper describes that it's not necessary to divide by the number of users, because it is only used for validating if the gradient ascending is still increasing after many interactions. But I made this division for interpretability purposes.

Results

Using the experimentation protocol of N random ratings by user in training and in the cross validation, applied to the MovieLens 1M dataset, I got MRR variying from 0.08 to 0.09 at the 50 iteration using the hyper-parameters described in the paper.

Using the top K experimentation protocol in the same dataset I got MRR varying from 0.2 to 0.26. I tried those two experimentation protocol with the Alternative Least Squares implemented at Spark and the maximum MRR I got was 0.01.

The sequence of experiments, and how to run can be seen at https://github.com/timotta/xclimf

Conclusions and future

Now we have a working xCLiMF implementation in python, but it is not feasible to train it with Globo.com dataset. Running it using MovieLens 1M dataset that have 6k users took 4 minutes. With MovieLens 20M dataset that have 130k users took more than one hour. Globo.com datasets are much bigger than that, if we count only the logged users, our dataset of users preferences on shows has more than 4 millions users, it would take more than 40 hours to run.

That's why my next step is implementing this same algorithm on Spark. Since the updating user by user step is independent, it is possible to scale it running in parallel on our hadoop cluster.

I appreciate any help, correction, suggestion or criticism

References

CLiMF: Learning to Maximize Reciprocal Rank with Collaborative Less-is-More Filtering
Yue Shi, Martha Larson, Alexandros Karatzoglou, Nuria Oliver, Linas Baltrunas, Alan Hanjalic
ACM RecSys 2012

xCLiMF: Optimizing Expected Reciprocal Rank for Data with Multiple Levels of Relevance
Yue Shia, Alexandros Karatzogloub, Linas Baltrunasb, Martha Larsona, Alan Hanjalica
ACM RecSys 2013

Collaborative Filtering for Implicit Feedback Datasets
Yifan Hu, Yehuda Koren, Chris Volinsky
IEEE, 2008

Dados abertos do Web Democracia

2015-11-28T19:23:00.001-03:00

Nas minhas conversas com os participantes do RecSys de 2015 em Viena foi possível notar o esgotamento deles em relação aos dados existentes para análise. Os acadêmicos longe de grandes empresas como Linkedin, Google, Facebook e Netflix, estavam sempre em busca de mais dados para tunar e implementar novos algoritmos. Em resumo, ninguém aguenta mais o MovieLens.

Por essa razão resolvi disponibilizar todos os dados de avaliações de políticos no Web Democracia publicamente.

Para garantir a privacidade dos usuários utilizei a total aleatorização dos IDs de forma a evitar qualquer engenharia reversa, deixando então as avaliações anônimas.

Os dados podem ser baixados aqui.

Agora então os quase meio milhão de avaliações de políticos do Web Democracia é mais uma fonte de informação e de um domínio bem diferente que os tradicionais livros e filmes.

Meu primeiro aplicativo Android

2013-10-14T23:29:00.003-03:00

Publiquei essa semana no Google Play meu primeiro aplicativo Android. O 9Gag Offline trata-se de um aplicativo que permite que você baixe os gags do site 9Gag para momentos em que não haja conexão com internet. Por exemplo se você for viajar, no avião você ficará isolado do mundo mas poderá rir bastante com os gags que estarão guardados no seu celular.

Já fazia mais ou menos uns seis meses que estava estudando esporadicamente o sistema operacional da Google e fazendo diversos testes. A um mês atrás resolvi fechar de vez o aplicativo e colocá-lo ao público. Deu pra aprender bastante sobre o framework e reviver os problemas de sincronia e concorrência que haviam na época em que programava para desktop com Delphi, Kylix e Java Swing.

Espero que gostem e se acharem qualquer problema ou tiverem alguma sugestão é só avisar. Para baixar o 9Gag Offline basta clicar aqui e ir ao Google Play.

Plugin de Fonética Portuguesa para ElasticSearch

2013-07-25T15:22:00.000-03:00

O ElasticSearch possui um plugin fonético que abrange diversas regras e padrões. No entanto nenhuma dessas regras é boa o bastante para as peculiaridades da lingua portuguesa.

Pensando nisso e de posse de uma gramática portuguesa, forkei o repositório e criei um encoder com as regras de nosso estimado idioma lusitano. Como alguns meses depois ainda não obtive resposta alguma da equipe do plugin original, promovi então este encoder a um novo plugin: Portuguese Phonetic Plugin.

Para utilizá-lo você precisa clonar o projeto no github e instalá-lo:

git clone https://github.com/timotta/elasticsearch-fonetica-portuguesa.git
cd elasticsearch-fonetica-portuguesa
./script/install.sh ~/Programas/elasticsearch-0.20.5

Depois é preciso configurar um analyser em config/elasticsearch.yml:

index :
    analysis :
        analyzer :
            fonetico :
                type : custom
                tokenizer : standard
                filter : [ standard, lowercase, foneticaportuguesa_filter, asciifolding ]
        filter :
            foneticaportuguesa_filter:
                type : foneticaportuguesa
                replace : false

Com tudo configurado, você pode criar um novo indice usando este analyser, como é mostrado abaixo:

$ curl localhost:9200/comfonetica -XPUT -d '
{
  "mappings" :{
     "document" : {
        "analyzer" : "fonetico"
     }
   }    
}'

Com o indice criado e configurado é possivel verificar as transformações de texto da seguinte forma:

$ curl -XGET 'localhost:9200/comfonetica/_analyze?analyzer=fonetico&pretty=true' -d chiclete
{
  "tokens" : [ {
    "token" : "XICLETE",
    "start_offset" : 0,
    "end_offset" : 8,
    "type" : "",
    "position" : 1
  }, {
    "token" : "chiclete",
    "start_offset" : 0,
    "end_offset" : 8,
    "type" : "",
    "position" : 1
  } ]

Repare que a palavra se transformou em duas. Isso acontece porque a configuração replace do filter está como false.

Atualmente só testei o plugin com a versão 0.20.5 do elasticsearch, se testarem em outras versões peço que reporte no github. Além disso, nem todas as regras fonéticas foram implementadas, então se você precisar de alguma que esteja faltando, colabore ou solicite lá também.

Complexidade Ciclomática com Pygenie recursivo

2013-07-16T00:20:00.001-03:00

Pygenie é uma biblioteca python bem simples para calcular a complexidade ciclomática de seus método. É tão simples, mas tão simples que para projetos mais complexos é preciso scriptar um pouco para que todos os diretórios sejam analisados.

Para facilitar então a nossa vida aqui na empresa, fiz uma contribuição (com pouca esperança de ser aceita pois o projeto está parado a dois anos) para que seja possível obter os resultado de forma recursiva. Com esta contribuição é possivel informar o parâmetro -r que analisará recursivamente todos os arquivos .py dentro de um diretório.

$ pygenie complexity mylib -r

Se dentro de seu projeto houver algum arquivo ou diretório que você não deseja que seja analisado você pode utilizar um outro parâmetro adicionado, o -e, onde você informar um pattern para ser excluído. Por exemplo se houver um diretório de testes:

$ pygenie complexity mylib -r -e tests

Caso você deseje utilizar desde já pode instalar o pygenie através do meu fork e neste meio tempo tentar incentivar nosso amigo Matthew Von Rocketstein a aceitar o pull request.

SlimIt, Head.js e django compressor

2013-07-16T00:05:00.000-03:00

Ao configurar o django compressor com o filtro do SlimIt, o arquivo do head.js comprimido pela templatetag compress era gerado com um caracter ï ao ínicio. Utilizando o SlimIt na mão eu obtinha o seguinte erro:

$ slimit head.js
Illegal character '\xbb' at 1:1 after LexToken(ID,'\xef',1,0)
Illegal character '\xbf' at 1:2 after LexToken(ID,'\xef',1,0)

Criei uma issue no repositório do SlimIt acreditando ser um problema desta lib. E determinado a corrigir tal problema acabei descobrindo que a raíz deste era o Head.js. No arquivo fonte desta biblioteca javascript havia realmente um caracter estranho, que nenhum editor exibia, nem mesmo o Vi. No entanto, se abríssemos o arquivo via python com um simples:

open("head.js").read()

Víamos claramente o caracter escondido. Espantosamente o único editor que testei e que exibiu o caracter estranho foi o editor online do github.

Forked, pull resqueted e merged: Felizes para sempre.

Liberando o GIL do python para paralelizar seu código com threads

2012-11-21T22:56:00.000-02:00

No Python o Global Interpreter Locker impede que duas threads executem ao mesmo tempo. Uma thread só é executada quando nenhuma outra estiver executando. A solução mais comum para aproveitar todos os cores de uma máquina em python é abandonar threads e utilizar vários processos. No entanto há uma maneira de aproveitar todos os cores com threads no python. Para isso vamos precisar criar uma extensão em C.

Primeiro vamos criar uma extensão que não libere o GIL para podermos comparar e conferir a melhoria de performance depois. O pivô dessa extensão é a seguinte função:

static int reduce_com_gil(int max, int (*f)(int x, int y)) {
int retorno = 0;
int i;
for(i=0; i < max; i++){
retorno = (*f)(retorno, i);
}
return retorno;
}

Essa função faz algo similar ao que um reduce faria, mas indo de 0 ao valor indicado em max. Para cada iteração a função enviada como segundo parâmetro é executada. A idéia é causar um grande processamento para que meu core fique travado. O uso dela está descrito no código abaixo:


static PyObject *antigil_calcular_com_gil(PyObject *self) {

   int valor = reduce_com_gil(100*1000, *antigil_calculos);

   char numero [5000];

   sprintf(numero, "%d", valor );

   return Py_BuildValue("s", numero);

}

Repare que eu passo para a função reduce_com_gil o ponteiro da função antigil_calculos. Essa outra função faz diversos cálculos a cada iteração do reduce. O nome antigil é o nome da extensão de exemplo. O código completo da extensão pode ser visto aqui.

Instalada a extensão, podemos testar a performance da lib com o seguinte código:


import antigil

antigil.calcular_com_gil()

E o seguinte comando:

$ time python teste.py

real 0m2.702s

user 0m2.692s

Mas o que a gente quer é saber como se comportam as threads. No caso o seguinte script abre 4 threads para aproveitar os 4 cores da minha máquina:


import antigil

from threading import Thread



threads = []

for i in xrange(4):

   t = Thread(target=antigil.calcular_com_gil)

   t.start()

   threads.append(t)



for t in threads:

   t.join()

No entando, acaba não aproveitando. Repare que ao executar 4 threads, o GIL age e impede que duas sejam executadas ao mesmo tempo. Dessa forma só um core da máquina é aproveitado. Isso pode ser observado pelo tempo total que é aproximadamente quatro vezes o tempo de execução de uma:


$ time python teste.py

real 0m10.910s

user 0m10.881s

Bom, vamos então à extensão não bloqueante. A chave do sucesso nesse caso são as macros Py_BEGIN_ALLOW_THREADS e Py_END_ALLOW_THREADS que respectivamente liberam o GIL e obtem o GIL de volta. Essas macros estão definidas em Python.h. O código da função reduce ficaria assim:


static int reduce_sem_gil(int max, int (*f)(int x, int y)) {

   int retorno = 0;

   Py_BEGIN_ALLOW_THREADS

   int i;

   for(i=0; i < max; i++){

       retorno = (*f)(retorno, i);

   }

   Py_END_ALLOW_THREADS

   return retorno;

}

Pronto, basta criar a função antigil_calcular_sem_gil similar à antigil_calcular_com_gil, utilizando a função reduce_sem_gil e então podemos repetir o teste. No caso parametrizei o script de teste para que você possa escolher qual função deseja executar:


import antigil

from threading import Thread

import sys



if 'com-gil' in sys.argv:

    calcular = antigil.calcular_com_gil

elif 'sem-gil' in sys.argv:

    calcular = antigil.calcular_sem_gil

else:

    print "Informar com-gil ou sem-gil"

    exit(0)



threads = []

for i in xrange(4):

   t = Thread(target=calcular)

   t.start()

   threads.append(t)



for t in threads:

   t.join()

O resultado é bem animador e mostra bem que o código rodou em paralelo:

$ time python teste.py sem-gil

real 0m3.414s

user 0m13.413s

Como o código utiliza apenas CPU, sem IO algum, ao aumentar para 8 threads, mesmo a versão que libera o GIL dobra de tempo pois só tenho disponível 4 cores na minha máquina. No entanto acredito que se fizer alguma operação de IO entre as macros Py_BEGIN_ALLOW_THREADS e Py_END_ALLOW_THREADS outras threads poderão ser executadas no caminho. Isso eu ainda preciso validar.

Só é preciso ter muito cuidado pois código entre essas macros está em território perigoso. A alteração de váriaveis globais ou ponteiros que podem ser compartilhados entre outras threads podem causar erros inesperados a qualquer momento. Portanto, é importante utilizar somente váriaveis locais e dados copiados.

O código completo da extensão em C antigil pode ser visto aqui.

Ganhe convites de graça para a SEMCOMP

2012-08-09T22:43:00.001-03:00

Este ano o maior evento de tecnologia da Bahia, a Semana da Computação da UFBA, será fantástica. A lista de palestrantes já confirmados é um dos pontos que chama a atenção, com presença de Nívio Ziviani, Osvaldo Matos, Fábio Akita, Sérgio Cavalcante entre outros. Você pode ver a lista completa dos palestrantes aqui: http://infojr.com.br/semcomp/palestrantes.

Os dois primeiros lotes de entradas ao evento se esgotaram logo nos primeiros dias de lançamento, e o terceiro pode estar em vias de acabar. Como a Globo.com está patrocinando o evento, ela me deu cinco convites para que eu distribuisse por conta própria. Resolvi então fazer de uma maneira divertida.

Para garantir o seu convite grátis para a SEMCOMP, basta fazer três coisas:

Curtir o Música.com.br (novo site de música da Globo.com) no Facebook.
Criar uma playlist no Música.com.br.
Enviar a url dela para timotta@gmail.com indicando em quais quesitos ela se encaixa, lembrando de colocar no assunto do e-mail: Playlist SEMCOMP.

O criador da melhor playlist em cada um dos seguintes quesitos abaixo leva um convite para o evento.

Melhor playlist para programar em par
Melhor playlist para resolver bugs
Melhor playlist para configurar servidor
Melhor playlist para ajeitar layout web
Melhor playlist para programar sozinho

Você pode criar quantas playlists quiser. O próprio time de desenvolvimento do Música.com.br escolherá, em um método científico de experimentação, ou seja programando usando as playlists enviadas. Os vencedores serão anunciados apartir de 10 de Setembro de 2012 na página do facebook da SEMCOMP.

Mas veja que são apenas os convites. Não está incluído aí passagem ou hospedagem. Ou seja, caso você more em outro estado ou cidade, você irá arcar com esses custos. Será oferecido apenas o convite.

Para servir de inspiração, segue algumas das playlists que eu criei:

Que a diversão comece!

Divagações sobre GIL, threads e IO em python e ruby

2012-07-22T18:18:00.000-03:00

Uma coisa que eu sempre me confundi sobre ruby e python é a questão do Global Interpreter Locker (GIL). Na verdade, a dúvida maior é se operações de IO realmente bloqueiam os processos, impedindo a execução de outras threads.

Recentemente li um pouco mais sobre a versão 1.9 do Ruby e como ela passou a utilizar threads do sistema operacional, ao contrário das chamadas Green Threads da versão 1.8. No entanto, o GIL do ruby continua impedindo que duas threads executem ao mesmo tempo. A menos que uma esteja parada executando alguma operação de IO não bloqueante.

No caso do Python, o pouco de informação que tenho me leva a crer que a linguagem utiliza Green Threads. Fica então a minha dúvida se mesmo assim é possivel que o processo execute uma outra thread enquanto aguarda um retorno de IO.

Para começar fiz um pequeno script python para simular uma query pesada do MySql sendo executada 4 vezes. Se o script demorar por volta de dois segundo significa que durante uma operação de IO, o python prosseguiu executando as outras threads:


import _mysql

from threading import Thread



def executa():

    db = _mysql.connect(host="localhost",

                        user="root",

                        passwd="",

                        db="teste")

    db.query("select sleep(2)")

    r=db.use_result()

    r.fetch_row()



threads = []

for i in range(4):           

    t = Thread(target=executa, args=())

    t.start()

    threads.append(t)



for t in threads:

    t.join()

O resultado da execução pode ser visto abaixo, mostrando que o IO não bloqueou o programa:


> time python teste.py

real 0m2.030s

user 0m0.016s

sys 0m0.012s

Executei o mesmo teste com ruby, com o script similar abaixo:


require 'mysql2'



threads = []

4.times do |i|

    thread = Thread.new do

        my = Mysql2::Client.new(host: "127.0.0.1",

username: "root",
database: "teste")

        my.query("select sleep(2)").collect{|i|i}

    end

    thread.run

    threads << thread

end



threads.each do |thread|   

    thread.join

end

E o resultado também foi satisfatório:


> time ruby teste.rb

real 0m2.178s

user 0m0.076s

sys 0m0.008s

Ou seja, tanto python como ruby estão lidando bem com execução paralela. Mesmo se não utilizar todos os cores disponiveis, no mínimo o IO para o MySql não está bloqueando.

Com esse bom resultado, resolvi então subir um pouco mais de nível e verificar se colocando o Django na equação poderíamos aproveitar esse bom desempenho. Criei essa pequena view para simular uma query lenta, assim como nos scripts acima, e iniciei o Django (versão 1.3) com o gunicorn.


from django.db import connection

def debug(request):

    cursor = connection.cursor()

    cursor.execute("select sleep(2)")

    return HttpResponse('ok')

E o resultado, como pode ser visto abaixo, foi ruim:


> ab -n 4 -c 4 http://127.0.0.1:8000/debug/

Time taken for tests:   8.161 seconds

O mesmo teste executado para a dupla ruby on rails tem resultado parecido:


> ab -n 4 -c 4 http://localhost:3000/politicos/

Time taken for tests:   8.043 seconds

Conclusão:

Embora python e ruby permitam IO não bloqueante, os frameworks Django e Rails ainda são bloqueantes. Um dos motivos daqueles memes de "Rails não escala" e "Django não escala". Felizmente sempre há alternativas.

É possivel escalar via processos como explicado em Solucionando IO bloqueante do mysql para Rails, e utilizando vários workers do gunicorn para Django. Caso seja necessário uma escalabilidade maior ainda, o ideal então é partir para soluções como Event Machine do ruby, GEvent para Python, ou até mesmo NodeJs. Combinando essas soluções com múltiplos processos.

Estratégias de persitência do Redis

2012-07-09T01:55:00.001-03:00

Nestes ultimos meses eu ministrei algumas palestras sobre o uso de redis na prática, usando como exemplo algumas funcionalidades do Musica.com.br. Uma das dúvidas mais constantes nesses eventos é a questão das opções de persistência deste banco. Confesso que não havia estudado o suficiente sobre o assunto. Eis que com o crescimento do site, surgiu a necessidade de uma estratégia melhor sobre a manutenção desses dados. Então aproveitei minhas horas em aeroportos para realizar alguns testes e responder algumas dúvidas.

Pra começar o Redis possui três formas de persitência de dados: não persistir, dump no disco e append only. O dump no disco, basicamente consiste em gravar uma cópia da memória em disco de tempos em tempos. O append only grava todos os comandos em um log, de forma que para recuperar em caso de restarte ele refaz todo o caminho percorrido.

Vamos então às perguntas que eu me fazia, e as respostas que obtive com meus testes:

1) É possível converter o rdb (dump) para um aof (appendonly)?

Sim para isso é preciso executar o comando BGREWRITEAOF estopar o redis server e estartar de novo com a nova configuração habilitada para appendonly.

2) O tempo para gravar e carregar o rdb (dump) é grande?

Para os padrões do Redis gravar é sim um tempo grande. Mas talvez não seja algo que possa atrapalhar. Depende muito da quantidade de dados que sua base terá. Fiz um teste adicionando três vezes 1 milhão de registros e comparando o tempo que demorava para gerar o dump em disco. Essa tabela pode servir de guia para saber quando é o momento de colocar o seu redis em uma máquina separada, e quando é hora de trocar de estratégia de gravação.

Registros	Memória	Disco	Tempo para salvar	Gravação de Memória/segundo
1.000.000	99.94M	24M	0.56s	178.46 M/s
2.000.000	191.50M	42M	0.93s	205.37 M/s
3.000.000	283.03M	62M	1.32s	214.41 M/s

No entanto, o load do dump com 3 milhões de registros foi irrisório.

3) O tempo para carregar o aof (appendonly) é grande?

Sim, muito grande. Fiz um teste similar ao do dump, com 3 milhões de registros colocando 313.57M em memória e gerando um arquivo aof de 192M. Ao restartar o servidor ele demorou 4 segundos para carregar o arquivo. Um resultado que achei péssimo. No entanto, se pensarmos que restartar servidor é algo que não faremos com tanta constância, pode não ser algo tão ruim. Só deve-se ficar em mente que manutenções desse tipo em uma base grande devem ser em horários com pouco ou nenhum uso do seu servidor master.

4) É possivel compactar o aof (appendonly) com BGREWRITEAOF?

Um dos problemas do aof é que se um registro for incluido e removido, as duas instruções serão gravadas, de forma que não há uma razão clara entre memória do servidor e tamanho do arquivo em disco gerado. Minha dúvida era se eu poderia limpar o aof reescrevendo somente aquilo que estava na memória. E sim, isso é possível com o comando BGREWRITEAOF.

5) Os comandos ficam mais lentos com aof (appendonly) ligado?

Sim. Fiz um teste com concorrência. Três processos inserindo 1 milão de itens em listas diferentes. Com rdb (dump) habilitado a média de tempo foi de 0.33s. Com aof (appendonly) habilitado e rdb (dump) desabilitado a média de tempo foi de 0.43s. Essa proporção de 30% mais demorado para aof foi constante nos três testes seguintes que fiz como tira-teima.

6) Se o redis slave ficar down por um tempo ele recebe os dados do master?

Sim, fiz uma série de testes e percebi que o redis slave ignora os arquivos salvos por ele mesmo. Sempre que se inicia ele recebe todos os dados novamente do master, seja ele configurado como rdb ou como aof. Portanto, se estiver utilizando master slave, uma boa é configurar o seu slave para não gravar nada.

7) Gravar o rdb (dump) afeta a performance na resposta a outros comandos?

Sim, fiz um teste gravando três vezes uma memória de 2GB enquanto estava inserindo 1 milhão de novos registros. A insersão deles demorou 25% mais que inserir a mesma quantidade de registros sem salvar nenhuma vez o rdb. Só para se ter uma idéia, o tempo de gravação deste dump foi de 9 segundos e o tempo de load foi de 4 segundos.

Conclusão:

- Manter seus dados com rdb deixa o redis mais rápido, mas pode exigir muito de IO nos intervalos de persistência. Se a memória estiver muito grande, vale a pena colocar o servidor master em uma máquina dedicada para que não atrapalhe outros serviços.

- Manter seus dados com aof deixa o redis mais lento, mas garante que nenhuma transação será perdida. É preciso ficar atento para o arquivo gerado e periodicamente compactar ele com BGREWRITEAOF. Também tomar cuidado nos restarts que podem demorar.

Bate-papo sobre o Redis no 3o Dev in Santos

2012-05-02T01:44:00.001-03:00

Neste sábado dia 5 de Maio participarei do maior encontro de desenvolvedores da baixada santista, falando um pouco sobre o Redis, e como o utilizamos pra tornar o desenvolvimento do Musica.com.br mais fácil, estável e seguro.

O programa completo do Dev in Santos com todas as apresentações você pode ver aqui: http://www.mktvirtual.com.br/mailing/2012/dev-in-santos/. Repare que teremos ótimas apresentações sobre uma grande variedade de assuntos como NodeJS, Python, IOS, Unity3D e é claro, Redis.

Se você estiver pela baixada santista, ou até mesmo em São Paulo neste fim de semana, será uma bela oportunidade para bater um papo sobre desenvolvimento e outras nerdices. Não deixe de se inscrever enquanto ainda há vagas.

Vejo vocês lá.