InfoTypography: Information Visualisation through and for Text
Lead Research Organisation:
University of St Andrews
Department Name: Computer Science
Abstract
In several fields where text is the main source of information, it is important to be able to represent associated data about the text alongside it. For example, in the speech recognition field, it is often necessary to display how certain we are that a word has been recognised accurately. Similarly, in advanced thematic search, we might want to provide information about how much a certain word, sentence, or paragraph, relate to the topic we are interested in.
In this work I will investigate a new visualisation technique that allows representing data that is associated to text in-situ (within the text itself), without requiring additional visual artefact. The technique, called InfoTypography, is based on the idea that we can modify typographic parameters of the letters (glyphs) to efficiently represent information. For example, the letters of a word can be made to have more prominent serifs if that word has been recognised with more certainty or if it is more relevant to the topic being searched.
The technique might offer singular advantages beyond the fact that it does not require additional visual artefacts to "pollute" the text. For example, there are many different typographic parameters that could be used to represent many different variables. The purpose of this research is to investigate the properties of the technique, to characterise the best ways to take advantage of it, and to build a system that will make it available to people who need to represent this kind of data. Finally, the technique will be put in practice through an application within the digital humanities field.
In this work I will investigate a new visualisation technique that allows representing data that is associated to text in-situ (within the text itself), without requiring additional visual artefact. The technique, called InfoTypography, is based on the idea that we can modify typographic parameters of the letters (glyphs) to efficiently represent information. For example, the letters of a word can be made to have more prominent serifs if that word has been recognised with more certainty or if it is more relevant to the topic being searched.
The technique might offer singular advantages beyond the fact that it does not require additional visual artefacts to "pollute" the text. For example, there are many different typographic parameters that could be used to represent many different variables. The purpose of this research is to investigate the properties of the technique, to characterise the best ways to take advantage of it, and to build a system that will make it available to people who need to represent this kind of data. Finally, the technique will be put in practice through an application within the digital humanities field.
Planned Impact
Impact to the Economy:
This proposal is aimed at enabling new ways to present and understand data through a novel technique based on typography. A fundamental component of the promise of big data and evidence-based decision making is the ability of humans to make sense of information in an efficient way. Novel techniques that are useful to represent data have therefore the potential to facilitate the big data and evidence-based decision making endeavours, especially if this is for types of data that have been traditionally difficult to represent (the case of text).
The proposed technique can be leveraged by the growing industry of data visualisation businesses, which is also increasingly relevant in the UK but that, unfortunately, is currently led mostly by non-European businesses. For an impression of the size and economic importance of the field, Tableau Software, a U.S.-based start-up visualisation company, went public recently (NYSE:DATA) and it has a current market capitalization of well over $US 4 billion. The application of this technique can provide advantages to U.K. based projects looking to visualise text-based data.
Impact to People:
This project will employ a research assistant and will count with the collaboration of a PhD student and an undergrad research intern. The assistant will closely collaborate with the PI and the PhD student to develop a new system and gain new evidence about the benefits and specifics of the technique. The RA will therefore gain experience in supervision and the development of cutting-edge visualisation systems, their application to real data, as well as in the design, execution and analysis of experiments. This wide-range of skills and balance between empirical and applied research will help the RA in the process of developing as an academic, with unique options to become a practitioner in the use of a new technique.
The PhD student will benefit from the experience of working within a team of top-level academics, and will extent the potential reach of his research. The undergraduate intern will get the opportunity to work on a real research topic, and learn to do research by doing.
Impact to Society:
The application of techniques to visualize data within text such as the one that I propose can enable analysis in economic areas that have not been able to fully benefit from the potential of information visualisation and visual analytics. This is particularly relevant for the digitisation and commercial exploitation of textual historical records (e.g., newspapers see: http://news.bbc.co.uk/1/hi/scotland/tayside_and_central/8688897.stm ).
Indirectly, applications of the technique in the digital humanities can also indirectly benefit our knowledge of the British culture, current society, and our identity. This is particularly reflected by the Palimpsest project, the case study chosen as first application of the technique, which explores literary Edinburgh.
This proposal is aimed at enabling new ways to present and understand data through a novel technique based on typography. A fundamental component of the promise of big data and evidence-based decision making is the ability of humans to make sense of information in an efficient way. Novel techniques that are useful to represent data have therefore the potential to facilitate the big data and evidence-based decision making endeavours, especially if this is for types of data that have been traditionally difficult to represent (the case of text).
The proposed technique can be leveraged by the growing industry of data visualisation businesses, which is also increasingly relevant in the UK but that, unfortunately, is currently led mostly by non-European businesses. For an impression of the size and economic importance of the field, Tableau Software, a U.S.-based start-up visualisation company, went public recently (NYSE:DATA) and it has a current market capitalization of well over $US 4 billion. The application of this technique can provide advantages to U.K. based projects looking to visualise text-based data.
Impact to People:
This project will employ a research assistant and will count with the collaboration of a PhD student and an undergrad research intern. The assistant will closely collaborate with the PI and the PhD student to develop a new system and gain new evidence about the benefits and specifics of the technique. The RA will therefore gain experience in supervision and the development of cutting-edge visualisation systems, their application to real data, as well as in the design, execution and analysis of experiments. This wide-range of skills and balance between empirical and applied research will help the RA in the process of developing as an academic, with unique options to become a practitioner in the use of a new technique.
The PhD student will benefit from the experience of working within a team of top-level academics, and will extent the potential reach of his research. The undergraduate intern will get the opportunity to work on a real research topic, and learn to do research by doing.
Impact to Society:
The application of techniques to visualize data within text such as the one that I propose can enable analysis in economic areas that have not been able to fully benefit from the potential of information visualisation and visual analytics. This is particularly relevant for the digitisation and commercial exploitation of textual historical records (e.g., newspapers see: http://news.bbc.co.uk/1/hi/scotland/tayside_and_central/8688897.stm ).
Indirectly, applications of the technique in the digital humanities can also indirectly benefit our knowledge of the British culture, current society, and our identity. This is particularly reflected by the Palimpsest project, the case study chosen as first application of the technique, which explores literary Edinburgh.
People |
ORCID iD |
Miguel Nacenta (Principal Investigator) |
Publications
Lang J
(2022)
Perception of letter glyph parameters for InfoTypography
in ACM Transactions on Graphics
Description | We have performed a series of experiments that measure which parameters of type are perceivable and to which extent. We have found that general audiences are sensitive to these parameters, and that different parameters are more perceivable than others. We have generally shown that using typographic parameters is possible technically. We are currently in the process of analysing the data to share with the wider community. |
Exploitation Route | It is a bit premature at the moment. We are exploring a number of avenues. |
Sectors | Digital/Communication/Information Technologies (including Software),Education |
URL | http://infotypography.com |
Description | IAA Award |
Amount | £6,000 (GBP) |
Organisation | University of St Andrews |
Sector | Academic/University |
Country | United Kingdom |
Start | 11/2017 |
End | 06/2018 |
Title | Reading small scalar data fields: color scales vs. Detail on Demand vs. FatFonts (dataset) |
Description | |
Type Of Material | Database/Collection of data |
Year Produced | 2017 |
Provided To Others? | Yes |
Description | Prototypo |
Organisation | Prototypo.io |
Country | France |
Sector | Private |
PI Contribution | Debugging of the standard platform before release. Use scenarios. Purchase of a license. |
Collaborator Contribution | Access to the source code before release. Consulting and expertise. Changes to the platform to facilitate our work. |
Impact | Accellerated development of our experimental platform. Creation of an infotypographical font based on Prototypo.io's base font. |
Start Year | 2015 |
Title | Community Design Registrations of 5 fonts |
Description | Community Design Registration Nos 002752063 -0001 Raven (Character set) Community Design Registration Nos 002752063 -0002 Gannet (Character set) Community Design Registration Nos 002752063 -0003 Puffin (Number set) Community Design Registration Nos 002752063 -0004 Puffin (Number set) Community Design Registration Nos 002752063 -0005 Gannet (Number set) |
IP Reference | |
Protection | Copyrighted (e.g. software) |
Year Protection Granted | 2016 |
Licensed | No |
Impact | Impact not visible yet. |
Title | VIZFONTS |
Description | VIZFONTS name that describes the wider set of fonts being generated as part of infotypography |
IP Reference | |
Protection | Trade Mark |
Year Protection Granted | 2016 |
Licensed | No |
Impact | Impacts not yet achieved. |
Title | TypoCartographer |
Description | Enables the creation of maps with continuously varying parameters. |
Type Of Technology | Webtool/Application |
Year Produced | 2016 |
Impact | No impacts yet. |
URL | https://infotypography.com/software-and-data/ |
Description | BBC |
Form Of Engagement Activity | A talk or presentation |
Part Of Official Scheme? | No |
Geographic Reach | National |
Primary Audience | Industry/Business |
Results and Impact | Presentation to the Audiences group at the BBC, partially describing the interest and uses of the research. |
Year(s) Of Engagement Activity | 2018 |