Do Neural Networks overfit less than Decision Trees?

Discussion in 'Strategy Building' started by damien.dualsigma, Oct 28, 2017.

  1. Sprout

    Sprout

    At the risk of being off- topic (and a novice in this domain to boot), is Alphago from Deepmind curve fitting as it teaches itself?

    https://www.wired.com/story/this-more-powerful-version-of-alphago-learns-on-its-own/


    Three Main components of Alphago:

    Policy Network-
    Trained on high level games to imitate the opponents.

    Value Network-
    Evaluates board positions and determines probability of winning from these positions.

    Tree Search -
    Looks through the different variations of the game from current positions to determine probability of future outcomes.


    Netflix currently has a documentary on the competition between Alphago and the world’s best Go champion - Lee Sedol.

    What was surprising is the crowd’s realization when confronted with the idea of human’s being ‘out-thought’ by a machine and the huge humble pie that was served.
     
    #11     Jan 7, 2018
  2. gon

    gon

    T
    When you read "policies", "values" or "rewards", "agents" and so on, you are speaking most likely about reinforcement learning. If it uses trees, probably it's using gradient descent.
     
    #12     Jan 7, 2018
    Sprout likes this.
  3. All right so thanks everyone..
    Now, the key question is.. do you need to make sure parameters introduced in random forest or neural networks or whatever ML algorithm are statistically relevant in driving the output function? By themselves or as an interaction with others? Or is ML going to figure out further meaningful relationships not actually statistically relevant at first sight?
     
    #13     Jan 7, 2018
  4. gon

    gon

    Introducing redundant parameters will reduce the performance of your system.

    For instance, if you introduce and index and one stock very representative of that index that has also the same dynamics, it will reduce the chances of a good model.

    If you insert open and close prices, since they may be similar, that will probably reduce the performance too.

    You can try using a correlation matrix and other feature selection techniques to avoid overloading your model with:
    - Redundant predictors.
    - Non-significative predictors.

    Also, some models may require you to standarize the data you use. And others will require you to also normalize it. Others may require you scale your data within a certain range such as [0,1]. And for categorical data, you need always to transform it to binary representation...

    If you are tempted to use preparation methods such as Principal Component Analysis, you must be also aware that it does not necessarily will reduce all the features into something useful.
     
    #14     Jan 7, 2018
    Sprout likes this.
  5. sle

    sle

    The issue with random forests is the discrete (piece-wise linear) nature of the splits. That's a problem with almost all "classifier" methods, you end up with discrete populations which is not necessarily optimal in finance.

    Looks like a pretty good book, especially if one is just getting started. Thanks!

    Actually, PCA is a great tool for financial applications, if you know how to use it.
     
    #15     Jan 7, 2018
    gon likes this.
  6. gon

    gon

    Interesting point.

    Just for curiosity, do you know about any specific NN topology especially good for finance?

    I am curious about it.

    In the past I have been tempted to use NNs to parse directly charts, similar to this:

    https://arxiv.org/ftp/arxiv/papers/1111/1111.5892.pdf
     
    #16     Jan 7, 2018
  7. userque

    userque

    Yes.

    A NN will build a network designed to forecast lottery drawings, based upon past drawings. It will deem some past drawing inputs more relevant that others.

    NN's will find all sorts 'relationships' in the training data, even if they are meaningless on unseen data.

    I haven't used random forest much, but I believe the same holds true, but to a lesser degree.
     
    #17     Jan 7, 2018
  8.  
    #18     Jan 7, 2018
  9. userque

    userque

    @damien.dualsigma , I see you've quoted me, but I don't see any response you've made--other than just quoting my post. Am I missing something?
     
    #19     Jan 7, 2018
  10. sle

    sle

    A good R^2 does not mean a good strategy, obviously, but this would be a good start. At the very least, I'd run it with IRL execution assumptions and see how it fares.
     
    #20     Jan 7, 2018