John Johnson

XML

XML is the mark-up language most often used to store dictionary data. For over twenty years I have been involved in the conversion, structuring and management of large lexicographical datasets in a range of languages, from Italian to Arabic.

  • Developer of internal editorial tools using XML (Python, Elementtree, XPath) for dictionary analysis and comparisons
  • Project owner for the development of dictionary editing software
  • Project owner for data conversion from a legacy format into XML
  • Project owner for the restyling of the dictionary graphic layout from XML
  • Knowledge of character management in Unicode
  • General knowledge of HTML, CSS and Javascript

Python

I'm a self-taught coder, which has enabled me to produce a whole range of software tools for data management and language-analysis based on Natural Language Processing principles

  • Large corpora in English, Spanish, German and Italian for lexicographical purposes, creating using web crawlers.
  • Language analysis, collocation, trending information for the identification of new words and meanings.
  • Web GUI with infographics.
  • The Python modules used: (app) Flask, (database) MongoDB via pymongo, (grafica) networkx, mathplotlib, plotly, (NLP) spacy, NLTK.
  • CFER levels for English
  • Disambiguation and automatic glossary generation for Italian
  • Mock-up platform for dictionaries with spellcheck and tip function

Click on the links to see some examples of the software tools I have developed

Publishing software

  • Everyday use of all the standard software packages used in publishing. Adobe, Office: Word, Excel, Access, PowerPoint, email providers, Google Drive, etc.
  • Advanced user of Word and Excel, in particular use of the track changes and compare functions in Word for versioning revisions, and Excel formulas for budgeting and so on.
  • Internal company systems for project management and admin, using Notes and web portals.
  • Software and other platforms used for project management and collaboration with third parties (Slack, Coda, Jira, etc.).

Dictionary software

I am the coordinator of all the digital aspects of dictionary production, a list containing more than 40 works in more numerous digital formats, offline and online versions, as well as other products produced with third party digital developers. The principal tasks covered include:

  • Update and maintenance of dictionary interfaces
  • Commercial software: offline for Windows, Mac, iOS and Android, Kindle and web
  • management of the platforms for digital publication
  • Chatbot for Messenger and Telegram

Click on the links to see some examples of the projects I run

CMS

I have used numerous content management systems:

  • management and development of a web-based platform for language learning entitled Eliza
  • management of content in WordPress for DizionariPiù
  • management of the Word of the Day service, which includes both mailing and web publication services