A tutu on how to count hapaxes (words which occur only once in a text or corpus) using NLTK.
Some alternatives mentioned:
- Pattern : Python package for datamining the WWW which includes submodules for language processing and machine learning
- Polyglot : language library focusing on "massive multilingual applications"
- spaCy : an "industrial strength" NLP library focused on performance with a streamlined API
Recipe on how to build a 35$ cloud hosted distributed Python web scrapper.
There is also really interesting advice on the legal aspect of scrapping on this blog: https://benbernardblog.com/web-scraping-and-crawling-are-perfectly-legal-right/
Avec Thomas Durey, nous organisons un meetup à Nantes mardi 3 octobre dans les locaux de Voyages-Sncf.com avec l'AFPY pour échanger autour du langage Python !
Au programme :
https://en.wikipedia.org/wiki/Bullet_graph with matplotlib & seaborn
Je donnerai 2 courtes confs à la PyConFr à Toulouse les 23 & 24 septembre prochains !
Les sujets:
- hooks de pre-commit pour git
- mindmap web contrôlée par sms avec Python & Twilio
J'ai hâte d'y être et de participer aux sprints :)
Online tool to add nice effects to photos
OSS Python script: https://github.com/anishathalye/neural-style
Pants supports Java, Scala, Python, C/C++, Go, Javascript/Node, Thrift, Protobuf and Android code.
Meson is an open source build system meant to be both extremely fast, and, even more importantly, as user friendly as possible.
Multiplatform support for Linux, OSX, Windows, GCC, Clang, Visual Studio and others supported languages include C, C++, Fortran, Java, Rust
Optional Ninja backend
Ça y est ! Bascule effectuée !
Ce blog est désormais un blog statique, généré avec Pelican.
Et au passage: c'est son anniversaire ! 3 ans :)
Pourquoi migrer vers un blog statique avec Pelican ?
- par souci de simplicité:
make publish
et il n'y a plus qu'à servir les fichiers HTML générés - par sécurité …
"logging.basicConfig
configures the root logger : this is not something you want to do in practice, because it causes debugging output for all loggers in your program, including every library that uses logging.
In general, you'll want ot leave the root logger alone and configure a specific "main" logger for your program and put all the other loggers under that logger."
Correcteur grammatical open source dédié à la langue française, pour Writer (LibreOffice, OpenOffice), Firefox & Thunderbird.
mkdir grammalecte && cd grammalecte
fossil clone http://code.grammalecte.net grammalecte.fossil && fossil open grammalecte.fossil
./make.py fr
PYTHONPATH=$PWD ./cli.py -f tests/fr/text1.txt
Using:
- Librosa - Python library for audio and music analysis
- MoviePy - Python library for video editing
Python-based framework for configuring, compiling and installing applications
VIA: http://taint.org
function lazyRender(el, html) {
let cloneEl = el.cloneNode();
cloneEl.innerHTML = html;
window.requestAnimationFrame(function(){
morphdom(el, cloneEl);
});
};
I ended up not using this code, but it may be useful to others:
mock_setup_provider.py
:
import sys
from unittest.mock import Mock
class MockSetupProvider(Mock):
def __init__(self, *args, **kwargs):
super().__init__(*args, **kwargs)
self.captured_setup_requires = set()
def setup(self, *args, **kwargs):
self.captured_setup_requires.update(kwargs.get('setup_requires'))
setup_extractor …
Used by netsuite.com to build and upload packages of Python libs depending on C extensions, before pushing them to a Nexus with repositorytools
ProximityHash: Geohashes in Proximity (with the option of compression using Georaptor)