Log in or Sign up

ET News & Sponsor Info

General Topics

Markets

Technical Topics

Brokerage Firms

Company Specific

Interactive Brokers

Tools of the Trade

Trading for a Living

Community Lounge

Site Support

Feedback

What's the point of machine learning?

Discussion in 'Trading' started by barkbark, Mar 15, 2023.

M.W.
- 4,047
  Posts
- 1,452
  Likes
I think you fail to grasp the advances of "AI" over the past few years. Gpt-4 does a lot more than finding a "best fit" . Not sure whether your post was meant to be cynical or ironic or serious.

Specterx said:
The basic use case of machine learning is classification - identifying sets of existing observations which are in some way related, and telling you which of those sets (if any) new observations belong to. (Good on you for realizing that, once you strip away the hype and gobbledegook, "AI" as it exists today amounts to finding the best-fit line for a set of points) For trading, one could (for example) identify the set of "times where I want to have exposure", run an algo to try and pick out salient features of those times, then check whether "right now" falls into the set.

This kind of thing is playing right on the home court of quant funds who can hire huge teams of math PhDs and far more data than you could ever obtain, let alone process - so I'd spend some time thinking about where you as retail might have a structural edge.
More...

Last edited: Mar 16, 2023

#21 Mar 16, 2023

Share
M.W.
- 4,047
  Posts
- 1,452
  Likes
"just"? Finding the best model, parameterized and fitted to actual observations as well as learning from mistakes and being rewarded for doing the right thing is exactly how humans and to a more limited degree animals learn. I would not call this "just". We are at a juncture where the deep neural networks are not the bottleneck to better models anymore but hardware resources and the amount of training data we throw at such networks.

Zwaen said:
Machine learning is just modelling an outcome to variables. An outcome can be lineair or categoric. Machine learning is just modelling these outcomes from your given variables, and "the machine" returns you the best model. Basically when correlation is high your model will be well fitted to the data. This is always "known data", the past. Models cannot measure causality. The problem you had is that you modelled the past data but the correlation break with future data because they are not causal, or have changed.

Btw ANN's are also build upon lineair and logestic regression functions.
More...

#22 Mar 16, 2023

Share
trade4succes
- 1,154
  Posts
- 75
  Likes
Zwaen said:
Interesting, will look it up! Was unknown with this field.
With to post i was reffering to the 'standard' ml models, svm, knn etc, fwiw
More...

Fair enough. I liked "the book of why" regarding causal AI.

On another note, my opinion on the subject is that financial data is too noisy and sparse (except in very high frequency) for any realistic advantage with AI for most investors. Never say never though.

#23 Mar 16, 2023

Share

barkbark likes this.
werewitt
- 10
  Posts
- 4
  Likes
I’m sure this has been said before but there is much much more to ML than linear regressions. LR is sort of a basis from which everything, even singular “neurons” are constructed but saying that’s this is all there is to ML is like saying all maths is simply arithmetic

#24 Mar 16, 2023

Share
TrAndy2022
- 1,615
  Posts
- 1,022
  Likes
Actually ML is for me just a new way for optimization and more curve fitting. Garbage in Garbage out as usual here.

#25 Mar 19, 2023

Share
M.W.
- 4,047
  Posts
- 1,452
  Likes
Of course the input data are very important and to a large degree decide over a model's viability or not. But isn't optimization exactly what we as humans aim for? As a baby you could not walk. Then your brain formed connections based on your experiences (data) to optimize how you move and coordinate your legs to stand up and walk, then sprint, then run. This is what this models in ML and DL aim to do as well. Obviously, the capability to handle complexity will have a large impact on the ability to handle complex relationships between data. "optimization" is not a drawback of those model, no matter how simplistic or complex, it is the end-goal.

TrAndy2022 said:
Actually ML is for me just a new way for optimization and more curve fitting. Garbage in Garbage out as usual here.
More...

#26 Mar 19, 2023

Share

rb7 likes this.
barkbark
- 11
  Posts
- 5
  Likes
https://swngui.medium.com/python-tutorial-using-lasso-to-predict-stock-prices-ee71f82aa698

But this tutorial is what I'm taking about. What do I do with the mean squared error? Is it that he's comparing adj close to close purely for demonstration purposes? Because I don't get it. The wikipedia article about it isn't much help, as it seems aimed at someone who has a 201ish knowledge of statistics. I did take a statistics class or two in college, but while I did well I no longer remember any of it.

#27 Mar 8, 2024

Share
ph1l
- 3,546
  Posts
- 2,210
  Likes
barkbark said:
https://swngui.medium.com/python-tutorial-using-lasso-to-predict-stock-prices-ee71f82aa698

But this tutorial is what I'm taking about. What do I do with the mean squared error? Is it that he's comparing adj close to close purely for demonstration purposes? Because I don't get it. The wikipedia article about it isn't much help, as it seems aimed at someone who has a 201ish knowledge of statistics. I did take a statistics class or two in college, but while I did well I no longer remember any of it.
More...

Disclaimer: I don't know python.

The tutorial says

We will be using the Adj Close column as our target variable and the other columns as our features.
More...

I assume by other columns, it means date, open. high, low. close. and volume. So, it looks like it creates the weights for a linear model of these columns to predict AdjClose using the lasso method as

Code:

AdjClose = (W1 * date) + (W2 * open) + (W3 * high) + (W4 * low) + (W5 * close) + (W6 * volume)

And the mean squared error would be the sums of

Code:

( (PredictedAdjClose - ActualAdjClose) ^ 2 )

on the test data instances divided by the number of test data instances.

My guess is the mean squared error will be really, really small given that the input in the example is GOOGL.
Last edited: Mar 8, 2024

#28 Mar 8, 2024

Share

barkbark likes this.
v-shape-0DTE
- 1,266
  Posts
- 462
  Likes
No offense, but you can’t complain about ML and then talk about a linear regression. That is not going to get it done and is not some holy grail. It’s going to take way more. Looks to me like you just hoping to spot out something that’s profitable. Why would linear regression be the the right approach in the market. It’s essentially drawing a line that matches data points. It’s far more complex than that.

#29 Mar 8, 2024

Share

MarkBrown likes this.
ph1l
- 3,546
  Posts
- 2,210
  Likes
v-shape-0DTE said:
No offense, but you can’t complain about ML and then talk about a linear regression. That is not going to get it done and is not some holy grail. It’s going to take way more. Looks to me like you just hoping to spot out something that’s profitable. Why would linear regression be the the right approach in the market. It’s essentially drawing a line that matches data points. It’s far more complex than that.
More...
- Stock_Price_Prediction_Linear_Regression_Machine_Learning_Antad.pdf
  
  File size:
  
  749.8 KB
  
  Views:
  
  18
#30 Mar 8, 2024

Share

(You must log in or sign up to reply here.)

Search