Dr George R S Weir, Department of Computer & Information Sciences, University of Strathclyde, Glasgow
This workshop explores the use of simple software-based textual analysis techniques for extracting useful information from existing document collections. Based on Unix shell processing and readily available open-source Perl applications, we will extract quantitative data from a variety of textual resources and consider how to deploy these methods toward practical information problems such as document indexing, readability and authorship analysis.
This will be conducted as practical workshop with Linux-based computers.
Format of Workshop: Half Day