There is a history of hybrid machine learning approaches where the result of an unsupervised learning algorithm is used to provide data annotation from which ILP can learn in the usual supervised manner. Here we consider the task of predicting the property of cointegration between the time series of stock price of two companies, which can be used to implement a robust pair-trading strategy that can remain profitable regardless of the overall direction in which the market evolves. We start with an original FinTech ontology of relations between companies and their managers, which we have previously extracted from SEC reports, quarterly filings that are mandatory for all US companies. When combined with stock price time series, these relations have been shown to help find pairs of companies suitable to pair trading. Here we use node2vec embeddings to produce clusters of companies and managers, which are then used as background predicates in addition to the relations linking companies and staff present in the ontology, and the values of the target predicate for a given time period. Progol is used to learn from this mixture of predicates combining numerical with structural relations of the entities represented in the data set to reveal rules with and predictive power.
|Title of host publication||Inductive Logic Programming, Proceedings of the 30th International Conference|
|Number of pages||14|
|Publication status||Accepted/In press - 20 Sep 2021|
|Event||30th International Conference on Inductive Logic Programming: ILP2020-21@IJCLR - Online, Athens, Greece|
Duration: 25 Oct 2021 → 27 Oct 2021
Conference number: 30
|Conference||30th International Conference on Inductive Logic Programming|
|Period||25/10/21 → 27/10/21|
Bibliographical note© 2021 Springer Nature Switzerland AG.
- financial forecasting
- SEC financial reports
- Inductive Logic Programming (ILP)
- unsupervised learning
- graph embedding