zhihu-comment-wordcloud
-
知乎评论区词云分析
项目代码说明
- 2.download_comment.py 下载全量评论
- 2.word_cloud_by_dt 生成词云
- 2.to_csv 将原始评论数据提取到CSV文件
- 2.to_json 将原始评论数据提取到Json文件
知乎评论区词云分析
A production-ready pipeline for text mining and subject indexing
Text2ASCII Description This python script (converter.py) contains two functions: encode() is used to return a list of Integer, one item per character
pangu.py Paranoid text spacing for good readability, to automatically insert whitespace between CJK (Chinese, Japanese, Korean) and half-width charact
Handwriter Make writing easier! How to Download and install a handwriting font, or create a font from your handwriting. Use a word processor like Micr
Name Parser A simple Python (3.2+ & 2.6+) module for parsing human names into their individual components. hn.title hn.first hn.middle hn.last hn.suff
Microsoft's Cascadia Code font customized to my liking. Also includes some simple batch patch and bake scripts to batch patch glyphs and bake font features into fonts!
colormate Python script text formatting package What is colormate? colormate is a python library that lets you add text formatting to your scripts, it
flomo-word-cloud 从flomo导出的笔记中生成词云 如何使用? 将本项目克隆到你的电脑上,使用如下的命令,安装所需python库 pip install -r requirements.txt 在项目里新建一个file文件夹,把所有从flomo导出的html文件放入其中 运行main
Python tool to make adding to your armory spreadsheet armory slightly less of a pain by creating a CSV to simply copy and paste.
utoken utoken is a multilingual tokenizer that divides text into words, punctuation and special tokens such as numbers, URLs, XML tags, email-addresse
🚩 A simple and clean python banner generator - Banners
umamusume-story-patcher How to use Go to your umamusume folder, usually C:\Users\user\AppData\LocalLow\Cygames\umamusume Make a mods folder and clon
Skype export archive to text converter This software utility extracts chat logs
Chardet: The Universal Character Encoding Detector Detects ASCII, UTF-8, UTF-16 (2 variants), UTF-32 (4 variants) Big5, GB2312, EUC-TW, HZ-GB-2312, IS
Scottish Gaelic Text Normaliser The following project contains the code and resources for the Scottish Gaelic text normalisation project. The repo can
Bangla Number in Words Converts a Bangla numeric string to literal words. Install $ pip install banglanum2words Usage
WeKws Production First and Production Ready End-to-End Keyword Spotting Toolkit. The goal of this toolkit it to... Small footprint keyword spotting (K
Pyfood is a simple Python package to process food, in different languages. Pyfood's ambition is to be the go-to library to deal with food, recipes, on
Unicode Slugify Unicode Slugify is a slugifier that generates unicode slugs. It was originally used in the Firefox Add-ons web site to generate slugs
texthooks A collection of pre-commit hooks for handling text files. In particular, hooks for handling unicode characters which may be undesirable in a