Part of the purpose of this website is to make it easy for people to experiment with RST. Various sorts of tools can help.
Tools for analysts
Programmed by Mick O'Donnell. The tool can help a great deal in constructing diagrams of RST analyses, and most published analyses (including the ones on this site) are produced with RSTTool. We gratefully acknowledge Mick O'Donnel's contribution to RST research through this tool.
Please check the page on Troubleshooting RST. These are issues that seem to have arisen with RSTTool 3.43 (for Windows) under Ubuntu.
Originally conceived to do analysis in the Systemic Functional Linguistics framework, it now incorporates RST capabilities, in addition to a host of other annotation and analysis tools. An excellent resource for any linguist. Also programmed by Mick O'Donnell.
Download UAM CorpusTool.
rstWeb - an online and local annotation tool for RST: https://gucorpling.org/rstweb/info/
GUM - a multilayer corpus containing (among other things) RST analyses for 193 documents and counting https://gucorpling.org/gum
AMALGUM - a larger multilayer corpus containing analyses of 4 million tokens, automatically annotated https://gucorpling.org/amalgum.html
Mick O’Donnell's tool was modified by Daniel Marcu, and is available from his web site: http://www.isi.edu/licensed-sw/RSTTool/
David Reitter has created a tool to generate RST-style diagrams using the LaTeX text processing software. The package produces an RST tree and marks its corresponding text with the appropriate span labels: http://www.david-reitter.com/compling/rst/index.html
Tables for accessing RST relation definitions
Part of this website. For use in manual analysis of texts. Relation Lists and Definitions.
There are RST analysis files for the texts and diagrams exhibited on the website. There are 15 text analyses currently on the web site, with the Mother Teresa text having 3 analyses, so there are 17 analyses in all. The collection can be downloaded as a pdf document, or in .RS3 files, to be viewed with Mick O'Donnell's RSTTool (see above).
The RST Corpus is a collection of Wall Street Journal articles annotated using (a version of) RST by Lynn Carlson, Daniel Marcu and Mary Ellen Okurowski. The corpus is available through the Linguistic Data Consortium, free for members, and at a cost for non-members (Catalog number: LDC2002T07). Further information:
A RST-annotated corpus in Brazilian Portuguese, coordinated by Thiago Pardo. The texts have been annotated with RSTTool.
Papers describing the corpus:
Discourse Relations Reference Corpus
The materials in the Discourse Relations Reference Corpus are taken from three different sources: texts from this web site; annotated Wall Street Journal articles from the RST Discourse Treebank (see above); and review texts from the SFU Review Corpus. The documents in each of the subcorpora have been annotated with RSTTool (see above). Although the background to all subcorpora is Rhetorical Structure Theory, and they have been annotated with RSTTool, we believe that the corpus is useful to anyone interested in discourse relations, from whatever perspective. The annotations provide rich information on what relations are more common; how they are commonly signalled; and how relations are distributed in different genres. A description and the full collection are available from the Discourse Relations Reference Corpus page.
Spanish RST Discourse Treebank
A new RST-annotated corpus in Spanish, coordinated by Iria da Cunha. To access the corpus:
Potsdam Commentary Corpus
A corpus of 220 German newspaper commentaries annotated with different types of linguistic information, including RST, downloadable from:
The corpus is described in:
Stede, Manfred and A. Neumann (2014). Potsdam Commentary Corpus 2.0: Annotation for Discourse Research. Proceedings of the Language Resources and Evaluation Conference (LREC), Reykjavik.
Text generation research
In addition, there are various kinds of computer programs that have been oriented by RST. Several efforts have benefited from the orientation of RST, some following the first published work much more closely than others. To learn about text generation and find a demonstration of text generation on line on the net, go here: Text Generation.
A set of slides that can serve as a basic introduction to RST. The slides come out of courses taught by Manfred Stede and Maite Taboada. You are free to use and/or modify them. We would appreciate an acknowledgement of the source.
A wiki article on RST in German.
go to top
©2005-2023 William C. Mann, Maite Taboada. All rights reserved.