Versatile
  • Home
    • Subscribe
  • After IELTS
  • Blog
  • Books
  • Resources
  • VersaText
    • About Versatext
  • About Versatile
    • About me
  • Versatile Lessons for Teachers
  • VersaText Questionnaire

VersaText Information and Resources
​

Picture
​The VersaText e-learning course introduces teachers to many aspects of teaching through texts. 
Check out the course here. 
​learn.versatile.pub/b/ziYhk

  • Introduction
  • Language
  • Word Cloud
  • Concordancer
  • Profiler
<
>
VersaText is a new web-based tool for learners of English to explore the language of a single text. 

Versatile's YouTube channel has some demo videos.
​
The basic steps:
  • paste a text into the frame on the VersaText homepage.
  • click the Word Cloud button to see a word cloud.
  • click a word in the Word Cloud to see it in concordance lines.
  • click the Profiler button for information about the words in the text.
  • More details about each of these steps can be found under each of the tabs here.
VersaText was created by James Thomas and Vít Baisa. James Thomas, the author and/or editor-publisher of  the books on this website, has been using word clouds, Concordle and Vocabulary Profiler with his students and teachers for many years. He also uses SkELL extensively with teachers and students. Vít Baisa, the main programmer of SkELL, offered to bring James' dream of combining and extending Concordle and Vocabulary Profiler to life.  Vít has not only programmed it, but has enriched it with features not previously imagined.  
Grammar, Vocabulary and Everything in Between is an activity book for studying English vocabulary and grammar using SkELL, VersaText and other online tools. Unit 2,  Meet VersaText, can be downloaded as a PDF below. 
Two web-based tools for learning language from language is an article by James Thomas written in  April 2020 for a conference that was cancelled due to the corona virus. It is both a practical guide to using SkELL and VersaText and a discussion of the aspects of linguistics and language acquisition that relate to learning language from language.  The pdf can be downloaded below.
Picture

VersaText

The inspiration

​VersaText combines features of two online text tools, Concordle and Vocabulary Profiler:
  • Concordle makes a word cloud from a text and then shows how individual words are used in that text in concordance lines.
  • Vocabulary Profiler provides frequency statistics about the words in the text.
VersaText has many additional features that expand on the resources of both.

Concordle

Concordle was created by Ladislav Kocbach, a Czech physicist working at the University of Bergen in Norway. More information about his program is available in this website and at its blog.

Vocabulary Profiler

Vocabulary Profiler is an online text tool that is based on an earlier downloadable tool by Paul Nation and Batia Laufer called Range. It is one of many tools in the Compleat Lexical Tutor suite which Tom Cobb from L'Université du Québec à Montréal has been developing for several decades. There is some information in this website.
gveb_unit_2_versatext_16.4.2020_.pdf
File Size: 1294 kb
File Type: pdf
Download File

two_web-based_tools_for_learning_language_from_language._thomas_2020_v2.pdf
File Size: 1398 kb
File Type: pdf
Download File

Picture
Picture
Picture
Picture
(v2 - a change to the second graphic on p.12.)
VersaText works as a corpus for a single text.  The very idea of a single text being a corpus is not consistent with most definitions of corpus. Definitions typically include features such as large collections of authentic language, sampled and stored, representing either a genre or register, or being a cross-section of a whole language. Large used to mean millions of words but nowadays means billions.

However, using corpus techniques with a single text provides insights into the words and phrases, patterns, grammatical features and some aspects of frequency of the chosen text. This is quite different from exploring features of the whole language.

Preloaded texts

Click on the titles under the input field to set them in the input field. These are samples of authentic texts and graded readers. Sources and links related to these texts is on the Text Information page. 

Your own texts

More interesting are the texts you choose yourself. They might be:
  • a topic of personal interest to you
  • an online news story
  • a Wikipedia article
  • a film review or synopsis
  • a short story
  • a text required for study
  • examples of interesting use of language 

The texts can be of any length. You can paste  several texts into the frame.

 VersaText does not save your text. If you want to work with it again, you need to save it as a file on your computer. 

What to do
  1. Paste your text into the frame.
  2. Click the Word Cloud button to get started.

Part of speech tags

CCcoordinating conjunctionand
CDcardinal number1, third
CDZ possesive pronounone’s
DTdeterminerthe
EXexistential therethere is
FWforeign wordd’hoevre
INpreposition, subordinating conjunctionin, of, like
JJadjectivegreen
JJRadjective, comparativegreener
JJSadjective, superlativegreenest
LSlist marker1)
MDmodalcould, will
NNnoun, singular or masstable
NNSnoun pluraltables
NNSZpossessive noun pluralpeople’s, women’s
NNZpossessive noun, singular or massyear’s, world’s
NPproper noun, singularJohn
NPSproper noun, pluralVikings
NPSZpossesive proper noun, pluralBoys’, Workers’
NPZpossesive noun, singularBritain’s, God’s
PDTpredeterminerboth the boys
PPpersonal pronounI, he, it
PPZpossessive pronounmy, his
RBadverbhowever, usually, naturally, here, good
RBRadverb, comparativebetter
RBSadverb, superlativebest
RPparticlegive up
SENTSentence-break punctuation. ! ?
SYMSymbol/ [ = *
TOinfinitive ‘to’togo
UHinterjectionuhhuhhuhh
VBverb be, base formbe
VBDverb be, past tensewas, were
VBGverb be, gerund/present participlebeing
VBNverb be, past participlebeen
VBPverb be, sing. present, non-3dam, are
VBZverb be, 3rd person sing. presentis
VHverb have, base formhave
VHDverb have, past tensehad
VHGverb have, gerund/present participlehaving
VHNverb have, past participlehad
VHPverb have, sing. present, non-3dhave
VHZverb have, 3rd person sing. presenthas
VVverb, base formtake
VVDverb, past tensetook
VVGverb, gerund/present participletaking
VVNverb, past participletaken
VVPverb, present, not 3rd persontake
VVZverb, 3rd person sing. presenttakes
WDTwh-determinerwhich
WPwh-pronounwho, what
WPZpossessive wh-pronounwhose
WRBwh-abverbwhere, when
Zpossessive ending‘s

What is a word cloud?

​A word cloud is an image made up of the words from a text that you provide. Word clouds are popular in presentations, on webpages, coffee cups and are used in books and lessons as attractive assemblies of words. But they are not just decorative. 

Size
​
The main feature of a word cloud is that the more frequent a word is in the text, the bigger it appears in the word cloud. Most word clouds contain content words only: nouns, verbs, adjectives and adverbs. This is why it is usually possible to guess the topic of a text just be looking at its word cloud. 

The biggest words in word clouds are not, however, the most frequent words in the text. Function words are: prepositions, pronouns, determiners, conjunctions, auxiliaries.
​
On the right, under the word cloud, you can choose which parts of speech to show in the word cloud.

Colour

While the colours  in most word clouds are just decorative, in VersaText, they indicate parts of speech.

Below the word cloud, 
  • deselect Colourize if the only word colour you want is black, 
  • use the slider to make the colours darker or lighter.

Use the number buttons to choose how many words to show. 

Use the three radio buttons to change what the word cloud is made up of: 
  • words: the words as they appear in the text
  • lemmas: the base forms of the words, e.g. name (singular) and names (plural) appear under name. 
  • tags: the part of speech codes, e.g. NNS for plural noun, VVG for verbs in their -ing forms. There is a list of tags under the Language tab.

Focus on a word

Click on a word or tag in the word cloud to see its concordance.  For more information about looking at words in context, look under the Concordance tab 
Picture
Picture

Picture

Picture
A concordance is a page of lines of text that show how a word, lemma or tag is used in the text provided. 

The target items run down the middle, a format known as Key Word in Context (KWIC). These contexts allow us to explore various language features. The word in the middle is referred to as the Node.

This concordance was created by clicking on social in the word cloud. 
Sorting concordance lines reveals patterns in the word’s usage​ if there are enough lines.

Click the column heading, KWIC to sort the lines in alphabetical order. 
  • If the concordance was created from a Tag in the word cloud, there will be different words. 
  • If the concordance was created from a Lemma in the word cloud, there will be different words forms.​
Click on  Left context and Right context to sort the lines in alphabetical order of the words that come before or after the node. ​

Picture

More buttons
  • Click the Colourize button to show the words in the part of speech colours.
  • Click Monochromatic to return to the original colour scheme.
  • Click the Sentence button to see whole sentences instead of the KWIC view.
  • Click KWIC to return to the KWIC view.

In Sentence view,
  • Click Sort by length to sort the sentences in order of the number of words in the sentences.
  • Click Text order to see the sentences in the order that the occur in the text.​​

Picture

Picture
The Profiler shows lexical statistics about the text.

Types and Tokens

The Naming text has 532 words and is said to have 532 tokens. A lot of these words are repeated, some content words and especially function words. The different words are referred to as types. 

The type-token ratio (TTR) is the number of tokens divided by the number of types times one hundred. The TTR  is a measure of lexical density. It varies in different registers and genres, such as fiction vs. non-fiction, teenagers vs. adults and native vs. non-native speakers.
Frequency bands
​

The Top 1,000 words are the most frequent 1,000 words in English generally. The table says that 55.45% of all the words in the text are among the most frequently used 1,000 words of English and that only 5.64% are in the second thousand: 1,000 to 2,000.
Picture

​​


The Academic Wordlist (AWL) is a list of words that are not in the top 2,000 but are commonly found in many kinds of academic texts. The AWL does not include terminology. ​

Text-specific words are the words in the text that are not in the other three groups. 

Picture
The pie chart shows the relative frequencies of parts of speech in the text.

Under the table and the pie chart, the four tables contain all of the words grouped according to their bands and their parts of speech. 

The two buttons under the four tables show lists. ​

Show Lemmas shows a list of all the words in the text in their base or dictionary forms. These are lemmas. 

Clicking the column headings sorts the list. 
  • Lemma: all of the lemmas in alphabetical order. In computer terms, numbers and punctuation come before A in the alphabet.
  • Frequency: all of the lemmas in order of their frequency in the text. 
  • HPM: this stands for Hits Per Million. If your text had a million words in it, this number is how often the lemma would occur. HPM is used to compare word usage in different texts and large bodies of text, i.e. corpora. 
  • General HPM: the HPM of the lemma in general English. This is not related to your text.
  • Score: this compares words in the text with English in General. The calculation is HPM + 1 / Gen HPM + 1.

Show Analysis shows all the words in the text in the order of the text. You can read the text vertically.  
The columns show how VersaText  has analysed the text to provide the colours, tables and charts.  If you find something strange, you can find it in this table to see what VersaText has done. No programs that analyse text are perfect. ​ ​These columns cannot be sorted.

Services

Versatile Books
Courses
Resources
​
Moodle site

Organisation

About Versatile
James Thomas
​Privacy Policy
​Contact
​
Lulu
Picture


​
​© COPYRIGHT 2018. ALL RIGHTS RESERVED.

  • Home
    • Subscribe
  • After IELTS
  • Blog
  • Books
  • Resources
  • VersaText
    • About Versatext
  • About Versatile
    • About me
  • Versatile Lessons for Teachers
  • VersaText Questionnaire