Combining diversified alpha to deliver superior Sharpe

By Marcus Williamson

In this article I show that very basic quantitative trading strategies that generate returns from different market behaviours, when combined, can provide a more desirable and stable returns stream, as reflected in a Sharpe ratio higher than any individual strategy. We show how absolute returns can be some-what ‘sacrificed’ for an improved risk adjusted return stream, which then can be later leveraged as per the investors risk appetite.

The strategies in this analysis were previously optimised, please find the supplementary optimisation Notebooks here.

Analysis has been trimmed for brevity, please find the full piece including all details and figures from backtesting here

  1. Introduction
  2. Backesting Optimised Strategies
  3. Combining Optimised Strategies
  4. Review of Results

1. Introduction

This Notebook highlights the backbone to the approach of the Artificial Investor ideology:

Combining diversified alpha generators to deliver consistent returns

We aim to show here that very basic quantitatively driven strategies that generate returns from different market behaviours when combined can provide a more desirable and relaiable return stream that can be leveraged to generate a superior return to the market, although we must be clear that this is not the objective.

Below we compare the different strategies alone vs when combined in any particular combination is somewhat inferior to the quality of returns stream generated when all are combined together.

Desirable Returns:

Desirable returns is understandably a very subjective and a weak term, here I shall briefly outline what we determine as reliable. Risk free rate is based on US Treasuries, see here. An algorithm’s returns ideally have:

  • High Sharpe Ratio – Deviation risk measure (Reward to variability)
  • High Stability – R-squared of a linear fit to the returns
  • High Alpha – Active return of investment
  • High Calmar Ratio – Compound annual returns divided by Max Drawdown
  • High Sortino Ratio – Variation on Sharpe penilising downside volatility only
  • High Omega Ratio – Probabilty weighted ratio of returns vs losses
  • Low Max Drawdown – Highest to lowest point since inception
  • Low Volatility – Degree of variation in returns
  • Low Beta – Exposure to general market movements
  • Low Kurtois – “Peakedness” of returns vs Normal Distribution (High = Fat tails
  • Positive Skew – “Skewdness” of returns vs Normal Distribution

The explicit valuations depend entirely on circumstances such as strategy being tested, the current market regime it is tested over, and the slippage or market impact models being implemented. However one is able to compare apples to apples when looking at the same algorithms’s strategies in various combinations over the same periods.

Please note: the securities chosen to trade are simply for illustrative purpose, no thought has gone into what pair of stocks to trade, it has simply been taken from wikipedia.

Dynamic Rebalancing

A dynamic monthly rebalancing has been implemented for the first two strategies in this algorithm, the frequency was chosen after reviewing the peformance. This is based closely on the Kelly Criterion, looking at the covariance of returns, see more here.

In the backtests I compare the peformance of the strategies with and without this dynamic rebalancing. There has been no independent leverage control implemente which could theoretically (and should in practice) force a strategy to zero leverage if it consistently underperforms. This is something I hope to implement in my next project, along with fixing two potential problems, which I cover below.

Known Issues

  • Initial investigation presents the potential issue that the daily returns may be being incorrectly calculated for the strategies which hold some positions over night.
  • Leverage control ensures the algorithm does not exceed a leverage of 1.05, however on three occasions over 8 years this leverage is excedeeded momentarily, I believe this is a bug in the timing of orders vs leverage calculation.

Although both these issues arguably have little affect to my performance it is of the upmost importance that quantitatively driven strategies have failsafe watertight architecture that does not have bugs. Flaws in the programming present the greatest danger to quantiatively driven models.

Transaction Costs & Slippage

  • Applicable fees can be found here, the minimum order charge is 1 USD per order and 0.005 USD per share for fixed tier pricing
  • Slippage model implemented can be read about here

In both cases the actual algorithm contains a more generous setup as compared to the backtesting environment, it is in line with the realistic costings as defined by the broker API however.

2. Backtesting Optimised Strategies

The original algorithm has been optimised over a total six year period in two three year segments, the parameters have been selected with a focus on consistency and stability of returns rather than maximal performance

In Sample Backtest: 01/01/07 – 01/01/13

Out-of-sample Backtest: 01/01/13 – 01/01/15

Mean Reversion

Security: S&P 500 ETF (SPY)

Strategy Overview:

  • Based on the assumption that a stock’s price will tend to move to the average price over time
  • Stock price analysis involves both identifying the trading range for a stock and computing the average price using analytical techniques
  • Trading logic:
    • When current market price is less than the average price, purchase on expectation of a return of the stock price to the average price
    • When current market price is greater than the average price, sell on expectation of a return of the stock price to the average price


  • Simple forms of this strategy typically have little in the way of risk management, failing to mean revert due to exceptional circumstances, such as bankruptcy, can be catastrophic without human intervention or very advanced semantic interface
  • Order feathering, absolute stop losses, and more advanced methods to detect suitable entry points would vastly improve strategy performance

Read more on Mean Reversion here.

I chose to include this strategy in my algorithm as consistent returns should theoretically be generated in QE tainted environments. My rationale for this lies in the theory that the stock market will revert to a support/resistance level proxy of Governemnt balance sheets, as they expand to support the market growth (prices roughly follow the expansion). I use the word ‘theory’ here as I have not personally performed significance testing on this, nor have seen any papers so far proving so.

Markus1The original PDF containing this chart can be found here.

Backtest & Out-of-sample Performance


The strategy has a naturally contrarian approach, it appears to profit well from sudden drops, however its simple logic structure results in it giving back much of its gains in trending periods. These drawdowns could be reduced with some of the implmentations mentioned above.

In the out-of-sample period the strategy struggles with the bull run in 2013, considerable retuns over the Oct 15th 2014 Treasury flash crash event. Simialrity of returns are good, despite the actual consistence of returns being poor.


Security: Nasdaq 100 ETF (QQQ)

Strategy Overview:

  • Based on the empirically observed tendency fo rising asset prices to continue rising and falling asset prices to continue falling
  • Finance theory and the efficient-market hypothesis has difficulty in fully explaining the existence of this ‘market anomaly’
  • It is believed that the majority of this behaviour can be attributed to behavioural economics, cognititve biases and irrational investors!
  • Trading logic:
    • When current market price is greater than the average price, purchase on expectation of a continued outperformance by the stock, rolling a stop up
    • When current market price is less than the rolling stop, close and wait for re-entry point


  • This strategy has been implemented in a long only fashion, this is to ensure simplicity and to ensure only one security is traded
  • Long-Short variations trading individual stocks would yield superior returns with a reduced market exposure
  • The rolling stop utilised has not been optimised, and is not dynamically set over periods of volatility

Read more on Momentum here.

I chose to include this strategy in my algorithm as it is the natural rival to Mean Reversion, which as previously discussed should theoretically operate well in QE tainted environments. Ideally the less correlated the strategies the better.

Backtest & Out-of-sample Performance


This strategy generates solid returns on bullish trends, it’s stoploss design prevented it from enduring a large drawdown over the subprime crisis. Over various events the returns are largely flat as the stoploss is hit and the algorithm is waiting for a re-entry signal.

Pairs Trading (Statistical Arbitrage)

Securities: Coca Cola (KO), Pepsi (PEP)

Strategy Overview:

  • Based on a market neutral convergence strategy where two historically correlated securitys are traded on the weakening / strengthening of their price correlation
  • The trading of a pair’s divergence / convergence requires good position sizing and market timing as opportunities are limited. The strategy has limited downside however
  • Trading logic:
    • When pair of securities diverge (spread widens), algorithm goes long on one arm of the pair and short on the other
    • When spread narrows sufficiently positions are closed


  • This strategy has been implemented across a single pair which is very well known to the industry, very few opportunities actually exist in this circumstance with trading costs
  • Strategy can be vastly enhanced by using multiple pairs and using more advanced spread divergence signal generation such as Kalman Filters

Read more on Pairs trading here.

I chose to include this strategy as it provides a non-correlated diverse returns stream to the portoflio, which theoretically should have the ability to generate returns in any circumstance due to market neutrality. Exposure here lies in company risk and violent market moves rapidly expanding / contracting spreads. I feel this strategy is very much not correlated with Mean Reversion returns.


Backtest & Out-of-sample Performance


The pairs trading strategy performs well over the total duration of the backtest and out-of-sample forward test. Sizeable drawdowns are realised however, despite the algorithm remaining relatively stable and not correlated to index returns. It appears that in extreme index moves the strategy experiences losses or is unable to generate returns, this is quite the oppposite to Reversion strategy where majority of returns are generated in these circumstances.

3. Combined Strategies

Here we examine the performance of the three strategies when combined in different combinations, some with and without the dynamic rebalancing. The advantage of stacking these strategies (as covered previously) are that the returns are diversified and stabilisied in aim of generate consistent returns over varying market behaviour.

Mean Reversion & Momentum – No Rebalance

Here we have combined the Mean Reversion and Momentum strategy, the portfolio control has now been shared equally between the two strategies.


The combination of Mean reversion and Momentum provides a complementary returns stream which delivers on violent drops and trending periods. We can still see the large moves generated by the Mean Reversion strategy, however the Momentum strategy has improved the cumulative return’s line of fit coefficent.

Mean Reversion & Momentum – Dynamic Rebalancing

Here we are utilising the same strategies combined as above, but in this case we have enabled dynamic rebalancing as mentioned at the beginning of this analysis. The rebalancing results in an outperforming strategy gaining more control of the portfolio.


The rebalancing version of the strategy outperforms the non-rebalanced version, and has smoothed the returns stream nicely. This simplistic machine learning-esque modification has resulted in a superior performance even with very basic strateigies.

Momentum & Pairs Trading

Here we combine the Momentum strategy with the Pairs trading strategy and review its performance. We are unable to use rebalancing on the Pairs strategy due to the usage of two securities. This is something that will be fixed in my next project!


we see a very steady and consistent returns stream with only one major sustained drawdown around 2009 which causes the strategy to dip into negative returns.

Mean Reversion & Pairs Trading

Combining the Mean Reversion strategy and Pairs trading strategy.


Here we see a less stable cumulative return, however we have noticably avoided neative cumulative returns over the ’09 period. The combination of Mean Reversion generating returns on large moves and Pairs generating returns on most occasions other than in violent market moves is a complimentary combination.

All Strategies – Mean Reversion, Momentum, Pairs Trading – No Rebalance

For the grand finale we bring all three of our basic strategies together to review the performance firstly without rebalancing and then later with rebalancing.


Without rebalancing we see a returns stream with the qualities of the various strategies combined, we have not sustained negative cumulative returns or experience a large sustained drawdown in 2011. As a compromise (as there must always be one) we have sacrified absolute returns. The overall alorithms is steady and reliable with periods of underperformance followed by sharp increases in returns.

All Strategies – Mean Reversion, Momentum, Pairs Trading – Dynamic Rebalancing

Turning now to the dynamically rebalanced version of the above, we are looking for performance improvements like seen with the Mean Reversion and Momentum strategies combination. Below I output a full backtest tearsheet like what can be found in the full analysis (see link at top).




Rebalanced we can see the overall strategy has generated superior returns are to be expected from prior backtests. We are seeing a slightly less stable returns stream, as you can noticably see by the dip into negative cumulative returns in ’09 once again. Rebalancing has given us on face value a compromise in quality for quantity when it comes to returns, we shall have to examine the data later to see what the true impact has been.

Further review of out-of-sample performance over each of the following periods:

  • 2013-2015 – Out-of-sample
  • 2013-September’15 – Out-of-sample

In each case the algorithm has been run explicitly over these periods, this generated different returns compared to the overall period performance. The differences are due to a variety of factors not limited to:

  • Backtesting environment calculations of performance
  • Rebalancing warmup period
  • Various calculations warmup period

I can confirm the source code is identical and the time periods are accurate, please use the below as illustrative examples of how the strategy would behaved with different market behaviour.

2013 – 2015


We can see the strategy tracks the index fairly closely with reduced violent price movements. There is a period of underperformance around May and June 2014 which largely effects the overall cumulative returns. We have sacrificed returns for reduced risk and consistency of performance. Its not glamarous, but garnering risk chasing absolute unleveraged returns is unresonable.


2013 – September


This a repeat of the previous backtest but including the forming market top of 2015 and the “Black Monday” event that hit China and the rest of the world (1000pt drop in Dow Jones). It is very key to note the performance at the far right of the chart.

Although in reality any position that may have been opened or closed on that day reaslistically would have been catastrophically affected by violent market moves, broker difficulties and slippage. However its clear to note the inital sharp drop was not felt severely by the algorithm, it only forms the second largest drawdown (around 4%). This is due to a diversified long and short holding with limited market exposure despite only trading three securities, two being major indicies.

4. Review of Results

Here we review a selection of the strongest backtests side by side with commentary:


To summarise the above table we have a few key points (moving from left to right):

  1. Reversion strategy returns are negatively correlated to market returns (as represented by Beta) and has lowest stability and Sharpe
  2. Momentum strategy is highly correlated with market returns and has high stability and returns
  3. Pairs strategy has no correlation to the market and has the lowest volatility and max drawdown
  4. Combining the various stategies appears to have the effect of ‘averaging out’ weaknesses and strengths of the various strategies resulting in a more consistent performance

We find there are three top performing strategies:

  • Momentum & Pairs
  • Mean & Momentum Rebalanced
  • All Strategies Rebalanced
  1. Comparing Momentum & Pairs vs other two strategies, we see the following key points:
    • Joint largest Max Drawdown
    • Lowest Calmar, Sharpe, Sortino, Omega ratio
    • Highest Stability
  2. Comparing Mean Revesion & Momentum Rebalanced vs other two strategies, we see the following key points:
    • Joint largest Max Drawdown
    • Largest Alpha and lowest Beta
    • Highest Volatility
    • Lowest Stability
    • Largest Kurtosis of returns, but the most positive Skew
  3. Comparing All Strategies Rebalanced vs other two strategies, we see the following key points:
    • Joint lowest Volatility
    • Highest Sharpe, Calmar, Sortino
    • Joint highest Omega
    • Lowest Max Drawdown


Although it seems a very close performance between Mean Reversion & Momentum Rebalanced and All Strategies Rebalanced, examining the data carefully shows we have sacrificed some Alpha and aboslute returns for stability and consistency. We are in search of leveraged returns which yields superior returns for each unit of risk added. I am not venturing into quantifying the extent to which one strategy is supeior over the other. The various ratio metrics highlight how returns are being generated on a better risk adjusted basis, we can see them as a proxy to this quantifying.

Overall Outcome:

We have combined three relatively diversified and simplisitic strategies together, shared portfolio control with dynamic rebalancing between two of the three stratgies, this has lead to an improved risk adjusted return stream as measured by metrics in the table above.

Performance Against Benchmark

It is of my opinion that comparing a multi strategy algorithm against an index is illogical, however this is industry standard. This is especially true for outside investors, who may rate the performance of a portfolio against a buy-and-hold return of the S&P 500. For this reason I have written up all of my above analysis against the standard benchmark. However in the chart below you can see the unleveraged performance against an ETF which tracks Multi-Strategy Hedge Fund performance, you can see more about this here. This is by no means a perfect proxy, however it a sure step toward reflecting actual institutional performance.

As a side note: I personally disagree with the ideaology of “out performing the benchmark”, this would be an unrealistic bonus (whilst unleveraged), rather than a target. Instead the intelligent investor will ideally seek consistent and reliable compounding of wealth above the risk free rate. Arguablly, this has been easy to achieve over the past 7/8 years due to experimental monetary policy and ZIRP (Zero Interest Rate Policy) specifically. Market regieme change is frequent and unpredictable, this is a blessing for Speculative Traders and a curse for Quantitiative Traders, however in my opinion uncertainty is most often than not an opportunity.

Here is the backtest + out-of-sample period starting just after ETF inception:




Against the ETF benchmark we see a far superior unleveraged performance. It must be made clear that the hedge fund industry is becoming more crowded, and within the benchmark there will be some components that vastly out/under perform so representing an average performance. We can see that despite this algorithm being extremely basic in construction with very simplistic and “proof-of-concept” style design and strategies, it is not totally obsolete. Compounding at over 8% per year with a Sharpe Ratio just under 1.5 in the out-of-sample period.


1.  Do you believe higher Sharpe Ratios are generally more important than Absolute returns to educated outside investors ?
2.  Do you think eliminating volatility (including upside volatility) is always beneficial to quantitatively driven portfolios?
5 replies
  1. KAiJin
    KAiJin says:

    Hey Marcus – Great write up. Very thorough and informative. However, I am not sure if I am missing something obvious here. How are you combining the different alphas? In other words, whats the criteria or method use to give weight to Momentum + Pairs, for example.

    Let’s say you have five or six optimised alphas : mean reversion, pairs, momentum, sector cross momentum, price volume, etc. You know your results for these in out of sample tests and want to move forward with combining these. How would you create the right combination of these into one ‘strategy’ with these alphas, that satisfies the low volatility+highest sharpe criteria for instance? For instance, you won’t just give 1/5 weight to each of these and allocate capital equally to each. Apologies, if I have missed something obvious.

    • Marcus Williamson
      Marcus Williamson says:

      Hi there Kaijin,

      Thank you for your kind words! In the introduction section I had a subsection on “Dynamic Rebalancing”. This does not go into any detail on what exactly my calculations are, but covers the basic theory of my approach. I did not implement it for the Pairs Trading strategy due to difficulties with some of the recording of orders. Its on my to-do list to revisit and fix.

      The basics idea is: If strategy A generates superior daily returns to strategy B it over time it will gradually allocate increasing weight to Strategy A’s requested positions. This will therefore reduce Strategy B’s as a result. Its along the lines of Kelly Criterion for gambling funds allocation.

      In terms of five or six alphas in an out of sample period, I would look to start with even allocations and then have a dynamic allocation based on what is returning consistently strong returns, where each strategy has to fight to maintain its share of portfolio weighting.

      Hopefully this has cleared up your question! Please let me know if there is anything else,



      • KAiJin
        KAiJin says:

        Hey Marcus – Thanks a lot for the extremely informative response. That definitely clears the concept.
        Just an addition from my end : You are right, it is imperative and natural to look at absolute returns and yes in that case, it would make sense to start equal-weighted and then dynamically rebalance.
        However, let’s say you have one alpha that has a sharpe of 2.3 and a turnover of 50% and another one with an alpha of 2.1 but a turnover of 15% (both out-of-sample results) I suppose you would want to weigh heavier on the side of the alpha with slightly lower sharpe. This would be ideal in a scenario where you want a lower turnover strategy so that you can keep your trading execution costs low. Would you reckon that the same Kelly criterion could be applied in this case as well? As per the quant-strat article it says : All of the strategies are statistically independent (there is no correlation between strategies). But let’s say you have a bunch of alphas in price-volume space that you want to formulate a strategy around, so there might be a little bit of correlation between them. This could lead to cross-alpha effects on each one. Hence some more sophistication would be needed surely. Correct?
        Just thinking out loud 🙂 Please let me know if somehow I have overthought this and underestimated the Kelly criterion.

        • Marcus Williamson
          Marcus Williamson says:

          Hey Kaijin,

          Not a problem, good to hear.

          Depending on the strategy, a higher turnover may be required to keep volatility down and returns steady, one size never fits all however. In combining multiple differentiated strategies one would hope to include a whole range operating over different frequencies (in my opinion), even some which gives a considerable proportion back in costs, but generates consistent regular returns. I guess it depends on whether your explicitly defining transaction costs as an evil or necessary evil (have you got outside investors…). I personally would stick to an Alpha with a higher Sharpe, with the transaction costs included in the returns on which the Sharpe is calculated. With increased sophistication in the Kelly Criterion style management you could penalise a strategy generating good returns with excess turnover, however you wish to define excess.

          Of course more sophistication would be required, in my elementary thinking we could treat each strategy as an asset with its own risk / return profile, ignoring the underlying operations. Then applying Modern Portfolio Theory – Markowitz Optimisation style management you can yield a superior risk adjusted portfolio allocation, please see for more on this. Your perfectly correct here; Kelly Criterion is a very basic tool for cash allocation, although its simple to implement, it lacks sophistication for the issues you have rightly mentioned here. Give me an add on LinkedIn, would be great to stay in contact!



          • KAIJIN
            KAIJIN says:

            Hey Marcus –

            Thanks a lot for clarifying. Yes, that’s a good idea about the personal tolerance on the transaction costs and factoring that into Sharpe. As well as this: “treat each strategy as an asset with its own risk / return profile,” Basic and effective.

            And yes, a linkedin invite coming up 🙂


Leave a Reply

Want to join the discussion?
Feel free to contribute!

Leave a Reply

Your email address will not be published. Required fields are marked *