Log in or Sign up

ET News & Sponsor Info

General Topics

Markets

Technical Topics

Brokerage Firms

Company Specific

Interactive Brokers

Tools of the Trade

Trading for a Living

Community Lounge

Site Support

Feedback

How do you avoid overfitting or over-optimization in your backtest?

Discussion in 'Strategy Building' started by mizhael, Feb 24, 2010.

mizhael
- 1,386
  Posts
- 0
  Likes
I heard about "Walk Forward Analysis",

to me it's just like a dynamic rolling-window out-of-sample test coupled with optimization.

The result of this "Walk Forward Analysis" is a set of "optimized" parameters that are suitable for all history(it's robust, but kind of conservative).

Am I right?

Any good ways of avoiding over-fitting or over-optimization?

Thanks!

#1 Feb 24, 2010

Share
schizo
- 17,980
  Posts
- 11,024
  Likes
Trade with real money and see how much you can actually lose. Seriously.

#2 Feb 24, 2010

Share
Arthur Deco
- 1,897
  Posts
- 3
  Likes
The first caveat is don't optimize if the bare strategy without stops isn't significantly profitable.

#3 Feb 24, 2010

Share
psytrade
- 2,597
  Posts
- 0
  Likes
create a complexity number for each function... give the functions with the least complexity the most risk capital, and test the results.

I have no idea what the results you'll get, but by evaluating the complexity of parameters should avoid over optimization

You might be able to evaluate different strategies at the same time as well:

http://en.wikipedia.org/wiki/Cyclomatic_complexity

#4 Feb 24, 2010

Share
intradaybill
- 2,961
  Posts
- 11
  Likes
Quote from mizhael:

Any good ways of avoiding over-fitting or over-optimization?

Thanks!
More...

One way is by using price action formations. You replace the problem though with selection bias. One can have either curve-fitted signals or signals that are selected over others but you cannot avoid both, at least I do not know of a way of doing that. I prefer selection bias over curve-fitting.

Examples

(1) Curve-fitting: SMA(a) > SMA(b), adjust a and b for profitability

(2) Selection-bias: close > high(2) has been profitable but not close > high(1), etc. You select the profitable one.

#5 Feb 24, 2010

Share
stevegee58
- 3,580
  Posts
- 474
  Likes
When optimizing system variables, I look for "fragile" settings. This is where say 37 is profitable but 36 and 38 perform abysmally. The 37 setting is probably profitable due to one or two lucky trades it took advantage of.

I like to see ranges of system variable values that are profitable. Then I pick one in the middle rather than the one that produced the best results.

#6 Feb 24, 2010

Share
Dacamic Guest
- 30
  Posts
- 0
  Likes
Quote from stevegee58:

When optimizing system variables, I look for "fragile" settings. This is where say 37 is profitable but 36 and 38 perform abysmally. The 37 setting is probably profitable due to one or two lucky trades it took advantage of.

I like to see ranges of system variable values that are profitable. Then I pick one in the middle rather than the one that produced the best results.
More...

Agreed, which is why I prefer thinking of this process as evaluating robustness rather than optimizing parameters.

#7 Feb 24, 2010

Share
intradaybill
- 2,961
  Posts
- 11
  Likes
Quote from stevegee58:

When optimizing system variables, I look for "fragile" settings. This is where say 37 is profitable but 36 and 38 perform abysmally. The 37 setting is probably profitable due to one or two lucky trades it took advantage of.

I like to see ranges of system variable values that are profitable. Then I pick one in the middle rather than the one that produced the best results.
More...

There is no such a thing as a "somewhat pregnant" woman. The same goes about optimization. The issue is whether a system is optimized or not. Any system that utilizes a function to generate signals is optimized with respect to some objective function.

#8 Feb 24, 2010

Share
stevegee58
- 3,580
  Posts
- 474
  Likes
Well, right. Even a moving average crossing system has to be run through an optimizer to find which combinations of MA periods work and which don't. You can't just pick 2 at random.

#9 Feb 24, 2010

Share
intradaybill
- 2,961
  Posts
- 11
  Likes
Quote from stevegee58:

Well, right. Even a moving average crossing system has to be run through an optimizer to find which combinations of MA periods work and which don't. You can't just pick 2 at random.
More...

Well, the point is that regardless of the choice of paramaters, such system is always optimized. You can always find an objective function that any such system maximizes. The lesson is that if you have any parameters at all in your system, it is optimized, whether you actually optimized it or not.

I tell you, very few understand what I wrote above but I know a few do.

#10 Feb 25, 2010

Share

(You must log in or sign up to reply here.)

Search