Jonathan Reeve
@JonathanReeve

Infrastructures for Cultural Analytics, Digital Humanities, Text Analysis, and NLP

JonathanReeve mottar USD 0,00 hver uke fra 0 kronerullere.

Doner

Beskrivelse

I build tools and infrastructures for analyzing, collecting, and manipulating texts, so that we can better understand books and other textual cultures. Some of my recent projects have included Macro-Etym, a tool for analyzing the etymologies of a text; Text-Matcher, a text reuse detection tool, good at identifying when a text quotes from another; Corpus-DB, an API for Project Gutenberg and other text repositories; and Chapterize, a tool for splitting a book into its chapters. I also lead the Open-Editions project, which aims to produce richly-annotated editions of classic works of literature, and the Git-Lit project, which publishes the British Library's digital books through GitHub.

I'm a PhD candidate in English and Comparative Literature at Columbia University, where I work in the Literary Modeling and Visualization Lab of the Group for Experimental Methods in the Humanities. Our group has no funding of its own, and my graduate student funding is very modest, so donations (of money, cryptocurrency, and/or code) are deeply appreciated.

Koblede kontoer

JonathanReeve eier følgende kontoer på andre plattformer:

JonathanReeve (Jonathan Reeve)

j0_0n (Jonathan Reeve / @JonathanReeve@hcommons.social)

Pakkebrønner

text-matcher Stjerner 120 Oppdatert for 10 måneder siden

A simple text reuse detection CLI tool.

corpus-db Stjerner 57 Oppdatert for 4 år siden

A textual corpus database for the digital humanities.

late-style-PCA Stjerner 10 Oppdatert for 4 år siden

An attempt to experimentally test Edward Said's claims about late style using computational text analysis and principal component analysis.

chapterize Stjerner 82 Oppdatert for 6 år siden

A simple tool for splitting up an ebook into its chapters. Works well with Project Gutenberg texts. May also be used to clean up books for computational text analysis.

chapter-experiments Stjerner 0 Oppdatert for 6 år siden

Quantitative analyses of novelistic chapters. Diachronic analyses of chapter lengths, numbers of chapters, linguistic patterns within chapters.

sentence-trees Stjerner 1 Oppdatert for 7 år siden

Experiments with sentences as trees.

character-attribution Stjerner 2 Oppdatert for 7 år siden

Probabilistic attribution of character voices in fiction.

allusion-detection Stjerner 9 Oppdatert for 8 år siden

Computational intertextuality detection in Python. Fuzzy string matching, approximate string matching.

Historikk

JonathanReeve registrerte seg for 5 år siden.

Jonathan Reeve
@JonathanReeve

Beskrivelse

Koblede kontoer

Pakkebrønner

text-matcher Stjerner 120 Oppdatert for 10 måneder siden

corpus-db Stjerner 57 Oppdatert for 4 år siden

late-style-PCA Stjerner 10 Oppdatert for 4 år siden

chapterize Stjerner 82 Oppdatert for 6 år siden

chapter-experiments Stjerner 0 Oppdatert for 6 år siden

sentence-trees Stjerner 1 Oppdatert for 7 år siden

character-attribution Stjerner 2 Oppdatert for 7 år siden

allusion-detection Stjerner 9 Oppdatert for 8 år siden

Historikk

Inntekt per uke (i amerikanske dollar)

Ukentlig antall kronerullere

Jonathan Reeve@JonathanReeve

Beskrivelse

Koblede kontoer

Pakkebrønner

text-matcher Stjerner 120 Oppdatert for 10 måneder siden

corpus-db Stjerner 57 Oppdatert for 4 år siden

late-style-PCA Stjerner 10 Oppdatert for 4 år siden

chapterize Stjerner 82 Oppdatert for 6 år siden

chapter-experiments Stjerner 0 Oppdatert for 6 år siden

sentence-trees Stjerner 1 Oppdatert for 7 år siden

character-attribution Stjerner 2 Oppdatert for 7 år siden

allusion-detection Stjerner 9 Oppdatert for 8 år siden

Historikk

Inntekt per uke (i amerikanske dollar)

Ukentlig antall kronerullere

Jonathan Reeve
@JonathanReeve