Forecasting stock prices with a feature fusion LSTM-CNN model using different representations

Discussion in 'Automated Trading' started by nooby_mcnoob, Dec 5, 2019.

  1. IAS_LLC

    IAS_LLC

    #11     Dec 6, 2019
  2. #12     Dec 6, 2019
    IAS_LLC likes this.
  3. IAS_LLC

    IAS_LLC

  4. IAS_LLC

    IAS_LLC

    Logical people do logical things.

    Your background in engineering or computer science? I'm a Control Systems guy...so state-space modeling and kalman filtering is at the forefront of how I approach problems.
     
    #14     Dec 6, 2019
  5. Computer science, but I do gravitate towards signal processing. The only problem is I have no formal introduction to it and I don't really know how to become an expert in it. Is there a "signal processing" book that you would recommend?
     
    #15     Dec 6, 2019
  6. IAS_LLC

    IAS_LLC

    "Fundamentals of Object Tracking" by Challa et al is what I used to use as a reference for bayesian filtering (Kalman,etc..). It explains the theory, but I primarily used it as a recipe book. Its applications are obviously geared towards aircraft tracking...but a Bayesian Filter is a Bayesian Filter...only the state transition and observations models change.
     
    #16     Dec 6, 2019
    nooby_mcnoob likes this.
  7. Awesome, will check it out.
     
    #17     Dec 6, 2019
  8. geneticien

    geneticien Guest

    From the paper:
    "In the case of Naive model, since it is judged that Pt is the same at the time point t + 5, it cannot be determined whether it will rise at the time point t + 5. Therefore, we assumed trading at each time point and as a result we recorded 15.48% revenue."

    I don't get it. If P_{t+5}=P_t, i.e., the naive strategy says that the price will not change, then how exactly it can be used to make profit?

    "Buy and hold strategy is a simple trading method to buy shares at the time of starting trading and sell shares at the end of the trading period. We confirmed that revenue was 3.09% when we implemented this strategy."

    So it seems that the time interval is so small that the market is not really positive in that time window. In other words, even a few years of US equities would lead to a large positive % (except from 2008-2009). Short out-of-sample time series are not really convincing.

    Then, their approach shows 17.38% compared to 15.48% of the naive strategy. Can the difference can be attested to overfitting of hyperparameters? They don't show a single figure with evolution of training and validation curves, that is weird.

    I hope you find the paper useful but after my quick check it looks like a mess to me. Well, it is PLOS One, that is probably why.
     
    #18     Dec 6, 2019
  9. Just looking at that paper raises several flags, and makes it not really worthy of continuing (other than academic theory)...

    Some examples
    1) Use 1 year of data??? 2016 that's great for the one year of data. Too bad next year may be nothing like it.
    2) RMSE is kind of a useless metric to your trading system. So, you got an RMSE improvement of .34%, so what. How does that translate to profit, or some other suitable metric?
    3) Why compare system 1 RMSE to system 2 etc? It's ok, but in many ways, useless. Better to compare all to some kind of baseline (with a more profit oriented objective/loss function).

    They may have addressed some of this as I didn't read further.

    Markov chains are very useful. Defining the states and optimization criteria, are the tough parts (as is the case with all ML).
     
    Last edited: Dec 6, 2019
    #19     Dec 6, 2019
    IAS_LLC likes this.
  10. Like most papers, they probably have mistakes that prevent the results from being reliable. I usually read for ideas and assume if they don't provide all the code that they don't really believe the results work.
     
    #20     Dec 6, 2019