Fully automated futures trading

Discussion in 'Journals' started by globalarbtrader, Feb 11, 2015.

  1. They are not assets per se, but rule variations. If I recall correctly, they are ewmac 2x8, normalised momentum 2x8 and breakout 10.

    BTW I have added a "PS" part in the previous post about the method sensitivity to the clustering algorithm.
     
    #2371     Nov 13, 2020
  2. This is what I am looking at. If I run a clustering algorithm on pooled returns for rules, I get the following 9 clusters (3x3, I named the top 3):

    -- fast rule cluster --
    1 momentum2
    1 normmom2
    1 breakout10

    2 momentum4
    2 normmom4

    3 breakout20

    -- medium speed rule cluster ---
    1 momentum8
    1 normmom8
    1 breakout40

    2 momentum16
    2 normmom16

    3 breakout80

    -- slow rule cluster --
    1 momentum32
    1 normmom32
    1 breakout160

    2 normmom64
    2 momentum64

    3 breakout320


    But if I did this manually, I'd choose the following clustering - the 3 top clusters would be by rule type and each rule cluster would have 2 clusters, 1 for faster and 1 for slower variations:
    --- momentum rule ---
    1 momentum2
    1 momentum4
    1 momentum8

    2 momentum16
    2 momentum32
    2 momentum64

    --- normalised momentum rule ---
    1 normmom2
    1 normmom4
    1 normmom8

    2 normmom16
    2 normmom32
    2 normmom64

    --- breakout rule ---
    1 breakout10
    1 breakout20
    1 breakout40

    2 breakout80
    2 breakout160
    2 breakout320


    If I ran correlation algorithms on both clustering styles, I'd get very different weights.
    Or am I overthinking this too much?
     
    #2372     Nov 13, 2020
  3. So, the data is telling you that faster momentum rules have more in common with each other than they do with their variations. Is the right response to ignore this, or to listen to it? (I don't actually know if there is a right response, but it's interesting information none the less).

    Would you really get very different weights? IDM does some of the heavy lifting in balancing out different clustering results. More to the point, even if the weights really were very different, I'd bet serious money that the backtested SR came out pretty similar.

    I'm not saying you're overthinking, as I think it's important to understand what's going on under the hood and you probably (like me!) find this intellectually interesting. Just to warn that it's unlikely anyone will get any serious alpha from tinkering with optimisation. Get a set of reasonably diversified rules. Get a set of diversified instruments (most important of all). Get weights that are roughly correct. And that is as good as it gets.

    GAT
     
    #2373     Nov 13, 2020
  4. Yes, it's 90% intellectual curiosity and 10% need to construct a portfolio I could use in live trading.

    I'll do weightings of both clustering hierarchies and compare the weights. Perhaps you are right and there won't be much difference. I will use the new method with the preferred parameters of 4 x stdev of correlation estimation, 500 data points (~10 years of weekly data) and min weight adjustment.

    I also have an additional idea I wish to test. The new method is not limited to 3 asset problem. I'm thinking of expanding it to 6 assets and running the weighting for clusters of all 6 variations per rule style. It does increase computing time, but not to astronomical levels. That would be interesting to compare against the above clustering schemes.

    P.S. I do agree that it will very likely have no discernible difference in the actual long term backtest results.
     
    Last edited: Nov 13, 2020
    #2374     Nov 13, 2020
  5. Alternatively you could say "why bother?" if the correlation between A, B and C is so high. Just choose one of them could be a simple strategy.
     
    #2375     Nov 13, 2020
  6. Even with high correlations there is some, albeit very small, diversification advantage.
     
    #2376     Nov 14, 2020
  7. sef88

    sef88

    Would some sort of dimension reduction like PCA be useful to get further insights? (Or did you do that for the clustering part?)
     
    #2377     Nov 14, 2020
  8. Just did the first comparison: calculating weights (with diversification multipliers applied) for clustering generated by the algorithm (hierarchical complete-linkage clustering with max cluster size = 3) VS manual clustering using human judgement and warm feelings towards neat and clean categories:

    clustering_weights_comparison.png

    It appears that the final weighting is very sensitive to the clustering output.

    In the next post I will check if there is any sort of significant differences in SR between the two outputs.
     
    #2378     Nov 15, 2020
  9. Do you mean applying PCA to the correlation matrix?
    I did not do it. The clustering is hierarchical complete-linkage algorithm with max cluster size of 3 (to accommodate the handcrafting "candidates").
     
    #2379     Nov 15, 2020
    sef88 likes this.
  10. Auto VS Manual clustering weightings turned out quite different. However when checking basic stats, I'm getting:

    auto clustering:
    mean = 5.6671
    stdev = 168.80
    mean / stdev = 0.033573


    manual clustering:
    mean = 5.6206
    stdev = 172.09
    mean / stdev = 0.032661


    equal weights:
    mean = 4.3331
    stdev = 135.27
    mean / stdev = 0.032034


    Based on 47793 data points - weekly returns for 30 years for 37 futures markets. No trading costs or SR adjustment done.

    It does look like auto clustering gives the best result (in the backtest...), but is it significantly better, that's another question.
     
    #2380     Nov 15, 2020