AI & Machine Learning

Four ways to apply NLP in financial services

April 28, 2020

11 min

Jo Stichbury

Freelance Technical Writer

Natural language processing (NLP) is increasingly used to review unstructured content or spot trends in markets. How is Refinitiv Labs applying NLP in financial services to meet challenges around investment decision-making and risk management?

Natural language processing models can be trained to review unstructured content, and spot issues or trends that may impact financial markets.
Content enrichment and sentiment analysis can help financial institutions make more informed investment decisions, and streamline risk management and compliance, especially in response to COVID-19.
NLP in financial services is a key focus for Refinitiv Labs, with work ongoing to quantify sentiment on more than 100 key drivers of equity performance across different content types.

Refinitiv Labs leverages natural language processing (NLP) to optimize data curation, enrich unstructured content, and improve content workflows and data management.

This article looks at some of the benefits of applying NLP in financial services, as well as practical use cases, including Refinitiv Labs projects described to me by Kelvin Rocha, Lead Data Scientist at Refinitiv Labs.

Tackling a firehose of information is a familiar problem in the financial services industry.

Traders and investment managers have numerous sources to comb through, such as research reports, company filings, and transcripts of quarterly earnings calls.

The amount of this kind of unstructured content is accelerating at an unprecedented rate, making it time consuming to analyze.

As a result, unstructured content is underused as a source of insight. It may contain hints that would quantify a trading strategy, but the overwhelming volume of data makes it impossible to spot the nuances that could drive a decision-making process.

Natural language processing (NLP) offers opportunities to uncover meaningful insights from under-used content.

“NLP is a growing area of artificial intelligence, in part assisted by rapid growth in infrastructure, such as computing power and data handling capacity.

In addition, there have been a number of key algorithmic improvements, and a proliferation of open libraries such as the BERT NLP framework, released by Google in 2018,” explains Rocha.

Efficiency: automating the analysis of volumes of unstructured content in real-time

Speed: the value of the information declines rapidly so insights need to be harvested swiftly

Consistency: a single model achieves consistency that is not achievable if performed by a number of human analysts, each of whom may interpret aspects of text slightly differently

Accuracy: unstructured documents can be lengthy, and human analysts can potentially miss or misinterpret information

A model can be trained to learn how to extract meaning from text, allowing applications and services that understand human language to be developed.

Practical examples of NLP in financial services include speech recognition and intent parsing used by voice assistants and chatbots in customer services, and information retrieval and sentiment analysis of corporate documents and news feeds.

Speech recognition and intent parsing
Content enrichment - retrieval
Content enrichment - trends and relationships
Sentiment analysis

Speech recognition is a key piece of the analysis of companies’ quarterly or semi-annual earnings calls.

Corporate conference calls usually start with the company making a presentation on the performance of the previous quarter and the outlook for the following one, followed by a Q&A session in which analysts ask direct and specific questions to the company.

“What and how they ask the questions, and what and how the company answers, including their tone, are likely to reflect on the company’s stock price. Profiling the tone of speech, and converting it to text to quantify it across different key topics, such as revenue, is extremely useful.”

NLP can also be used to retrieve information from unstructured text. This approach is known as named entity recognition (NER), and is used to detect and label entities, that is, real-world concepts, such as people or companies.

NER effectively overlays context on the content by tagging it with machine-readable metadata aligned with an ontology. It’s like having a very detailed Dewey library system, and it means that information retrieval is efficient and accurate.

NLP can also be used to support banks’ compliance processes. Tagging unstructured data facilitates searching across thousands of digital documents, allowing compliance officers to swiftly determine whether regulations have been followed.

“NLP can also be used to create explicit links between supply chain relationships. If the demand for certain products is likely to increase in the near future, then identifying key raw material suppliers would be extremely useful from an investor’s point of view,” adds Rocha.

“Similarly, if a supply chain is expected to be disrupted for some reason, topically, by COVID-19, NER could identify which companies would be affected and to what degree.”

In the investment sphere, applying tags to highlight the main topics covered by text, or topic modeling, is valuable when analyzing earnings calls to establish a main theme, or to compare against previous, similar calls to identify trends.

NER offers additional value, since it can be used to link entities and build a graph of relationships. For example, an entity-modelling system can pick out mentions of specific topics within a range of unstructured text and build new connections.

It can help track relationships between entities, with the potential to detect money laundering or fraud.

Another area of NLP is sentiment analysis, which can extract the subjective meaning from text sufficiently well to be able to determine its attitude, or sentiment. It is an ideal tool for reviewing unstructured content about a particular company to look for inconsistencies and anomalies.

Refinitiv Labs is currently training a new model to identify potential signals of equity performance from thousands of research reports and company transcripts, by identifying changes in outlook over time as potential drivers of equity performance.

“We are currently quantifying the sentiment on more than 100 key drivers of equity performance across different content types: equity research reports, transcripts, news, and company filings,” shares Rocha.

Sentiment analysis can help classify news stories based on positive and negative sentiment to indicate the likely impact on a stock price, but also has more nuanced uses.

Refinitiv Labs believes that future advances in neural networks will be key to the development of NLP, with the potential to transform financial services.

“By combining equities’ sentiment scores across different dimensions with a variety of metrics on the evolution of COVID-19, such as the number of cases, death rates, recovery rates, or active cases per capita, we could potentially identify the key drivers of equity performance, including what stocks are affected most, and to what degree.

“NLP could be used to pair COVID-19 mentions in unstructured content to sentiment-based signals. Although the strength of the signal could vary based on geography and industry, having an aggregated sentiment on COVID-19 at the equity level and across different content types could be used to predict future, market-adjusted stock returns.”

See all Insights

LSEG Labs combines the best data, technology, talent and customer partnerships to deliver validated solutions to financial markets at speed.

Learn more Opens in a new tab

Email address

Country/territory

Subscribe to an email recap from:

FTSE Russell

Data & Analytics

Republication or redistribution of LSE Group content is prohibited without our prior written consent.

The content of this publication is for informational purposes only and has no legal effect, does not form part of any contract, does not, and does not seek to constitute advice of any nature and no reliance should be placed upon statements contained herein. Whilst reasonable efforts have been taken to ensure that the contents of this publication are accurate and reliable, LSE Group does not guarantee that this document is free from errors or omissions; therefore, you may not rely upon the content of this document under any circumstances and you should seek your own independent legal, investment, tax and other advice. Neither We nor our affiliates shall be liable for any errors, inaccuracies or delays in the publication or any other content, or for any actions taken by you in reliance thereon.

The content of this publication is provided by London Stock Exchange Group plc, its applicable group undertakings and/or its affiliates or licensors (the “LSE Group” or “We”) exclusively.

Neither We nor our affiliates guarantee the accuracy of or endorse the views or opinions given by any third party content provider, advertiser, sponsor or other user. We may link to, reference, or promote websites, applications and/or services from third parties. You agree that We are not responsible for, and do not control such non-LSE Group websites, applications or services.

The content of this publication is for informational purposes only. All information and data contained in this publication is obtained by LSE Group from sources believed by it to be accurate and reliable. Because of the possibility of human and mechanical error as well as other factors, however, such information and data are provided "as is" without warranty of any kind. You understand and agree that this publication does not, and does not seek to, constitute advice of any nature. You may not rely upon the content of this document under any circumstances and should seek your own independent legal, tax or investment advice or opinion regarding the suitability, value or profitability of any particular security, portfolio or investment strategy. Neither We nor our affiliates shall be liable for any errors, inaccuracies or delays in the publication or any other content, or for any actions taken by you in reliance thereon. You expressly agree that your use of the publication and its content is at your sole risk.

To the fullest extent permitted by applicable law, LSE Group, expressly disclaims any representation or warranties, express or implied, including, without limitation, any representations or warranties of performance, merchantability, fitness for a particular purpose, accuracy, completeness, reliability and non-infringement. LSE Group, its subsidiaries, its affiliates and their respective shareholders, directors, officers employees, agents, advertisers, content providers and licensors (collectively referred to as the “LSE Group Parties”) disclaim all responsibility for any loss, liability or damage of any kind resulting from or related to access, use or the unavailability of the publication (or any part of it); and none of the LSE Group Parties will be liable (jointly or severally) to you for any direct, indirect, consequential, special, incidental, punitive or exemplary damages, howsoever arising, even if any member of the LSE Group Parties are advised in advance of the possibility of such damages or could have foreseen any such damages arising or resulting from the use of, or inability to use, the information contained in the publication. For the avoidance of doubt, the LSE Group Parties shall have no liability for any losses, claims, demands, actions, proceedings, damages, costs or expenses arising out of, or in any way connected with, the information contained in this document.

LSE Group is the owner of various intellectual property rights ("IPR”), including but not limited to, numerous trademarks that are used to identify, advertise, and promote LSE Group products, services and activities. Nothing contained herein should be construed as granting any licence or right to use any of the trademarks or any other LSE Group IPR for any purpose whatsoever without the written permission or applicable licence terms.