Best place to purchase historical stock data for backtesting??

Discussion in 'Data Sets and Feeds' started by sneakoner, Sep 4, 2011.


  1. Based on this link I found on Google:

    http://www.ninjatrader.com/support/forum/showthread.php?t=47387

    and this link provided by Kyle @ Ninjatrader customer service:

    http://www.ninjatrader.com/support/helpGuides/nt7/index.html?importing.htm


    [Open Quote
    Tick Format
    Each tick must be on its own line and fields must be separated by semicolon (;).

    The format is:
    yyyyMMdd HHmmss;price;volume

    Sample data:
    20061107 000431;1383.00;1
    20061107 000456;1383.25;25
    20061107 000456;1383.25;36
    20061107 000537;1383.25;14
    End Quote]



    I don't see any sub-second time stamp ability from their API. I can promise you that there is zero chance that five levels deep of market data on ES amounts to 20MB per day unless it is cut down substantially.

    For example, Bloomberg data for SPY pulled via their API ranges from 56MB to 86MB per day (over the last 5 years) and that's stripped down - quotes and prints only. No cancels, no broker codes, no partial/full designation, no crosses, no order type, no crosses, etc.

    If you are at 20mb per day and you only have market depth five levels deep you are probably not getting every exchange and are certainly not getting the full trade details with every message.
     
    #51     Feb 19, 2012
  2. dyson

    dyson

    I disagree with this. I use minute based data for intraday backtesting and daily data for swing based testing. As long as you know and can adjust your code to avoid the weakness of the data precision, you should be ok. For the average trader, tick data actually has many disadvantages.

    One caveat with unfiltered tick data is a bad tick on one feed may not be present on another feed. Thus, comparing data from multiple, unfiltered vendors will yield different test results. Although this is true will all brokers and feeds, many fail to recognize this fact.

    TickData cleans their data. They even have a white paper on the subject "High Frequency Data Filtering - Tick Data".
     
    #52     Feb 22, 2012
  3. How so?

    I've been using pi's data for backtesting and haven't had any problems with it. I've compared it against IB's historical data, and they seem to match up.

    Also, to those of you suspicious of how cheap it is: how much do you think it costs them to burn you a DVD and send it to you? Some places charge a shitload of money because they know that a lot of people correlate cost with quality, and want to help those people get rid of that extra cash. Other places just want to attract a lot of customers. In the end, I doubt it costs any of them that much to subscribe to a data source, save it to a disk, and then burn it to a DVD for subscribers.
     
    #53     Feb 23, 2012
  4. #54     Feb 24, 2012
  5. ===============
    Mr.Sne;
    Another way to reach your Nasdaqqq goal;
    divide the market into bull markets/bear markets/200 day moving average. Then Test. You may want to buy some books recommended [top green line this website/page], or spend$ $8,400+ - but that just the first step in a lengthy process.

    And get your hands dirty with the data[hourly, 4/5 hours,40 or 50 days...;
    sure its exspensive in time, but it stays in you mind so much better.There are no shortcuts, but differences in bull & bear is much.
    Some who are visual like charts[picture=2,000 words];
    my old CPA doesnt like charts, he likes numbers.............:cool:
     
    #55     Feb 25, 2012
  6. Another reason i prefer the time exspense of printed/paper charts;
    several different data companys ''forgot'' to mention Citigroup split 10 times reverse, its really a $3.25 penny stock.LOL:D Big BIG difference.......................................

    MSFT split once or more...............................................;
    i noted that by hand on my charts-amazing the mistakes that ''hide'' or are simply concealed in lots of data.:cool:
     
    #56     Feb 25, 2012
  7. dyson

    dyson

    Is this site a joke? Who would pay for this?
     
    #57     Feb 25, 2012
  8. Yeah, it was way too expensive. $34.95 per symbol? I don't need that when all I have to do is click edit data.
     
    #58     Feb 25, 2012
  9. +++

    $99/mo if you open an account. First three months are free.

    And the backtesting engine is the quickest / easiest I've tried.

    And it will paper trade your algos when you're ready to forward test.

    And it can walk forward optimize on a portfolio....

    :D
     
    #59     Feb 25, 2012


  10. Interesting!
    I'll bring it up with them and see what they say
    Thanks!
     
    #60     Feb 25, 2012