I'm pretty sure the model picked up some significant splines, especially during the 2005 to 2008 range.
I do a number of these models, going back to different periods. Yes, you have a good argument that when you go back in time, sale age becomes a more important factor in the variance of prices, and thus a high R2 may simply be reflecting more how well your model reflects the change in price with respect to time rather than other features such as GLA. That is true, but unavoidable because it is still necessary to get a good resolution on current property characteristic differences. When you go back in time you are doing that for only one reason - to get a greater variety of homes that differ in the composition of bedrooms, bathrooms, GLA, Lot Size and so on. Another way to put this is if I do a regression only going back two years, more of my R2 relates to how well the model does in reconciling the impact of features other than sale age on price than a model that goes back 15 years. Yet when you apply the first model to the last year's comps, you will find it doesn't do as good of a job reconciling differences. More importantly, with fewer samples, you are going to wind up overfitting your model so that it is far weaker in predicting what a similar home would sell for if say a different buyer came along. My experience, is that MARS in any case can do a pretty good job of removing the impact of time on sale prices. IF you believe that there are factors occurring over an extended period of time that impact buyers tastes and market value with respect to home style, GLA, bath count and so on, you can a 2 or 3 way regression that just includes SaleAge vs GLA, SaleAge vs LotSize, SaleAge vs Bathrooms and so on. Then you will get a more complex model. I always run a number of these MARS runs with 2 and 3 way interaction if I suspect something like that is going on.
But, to be truthful, I would never put 3-way interactions in a report, they are too hard to explain. And I would very much try to avoid putting even 2-way interactions in the report. On this latest report, yesterday, I reran without the 2-way interaction and came up with a tighter fit of 8 comps all under 0.20% deviation from the average. I find this hard to believe myself.
To reiterate a previous post, I emphasize that with respect to the comps, I always let the buyer decide what the subjective features are worth. That means, I use that residual to get the value of all features not covered by the first stage regression. I have a special Excel template that shows all the comps. It jumps to a table that has all my CQA rating 0.0, 0.5, 1.0, .... 10.0 next to the adjust amount based on my scoring of the 220+ comps. I put in the scores for the subject in the first column, the CQA generated based on the residuals for all of the comps going into the sales grid, and then go in and factor those out into individual adjustments. The factoring is not important in the final adjusted sale price, its only purpose to to explain to the reader what it is between each comp and the subject that leads to the adjustment. But look, if I think the patio is average looking from looking at the MLS pictures for the comp, but the buyer has paid a lot more for the property than predicted by the model, I may very well have to rate that backyard higher than I otherwise would simply based on my observation of pictures. To reiterate, the buyer has absolute control over the total of all subject value contributions, I only divvy them up based on my analysis. The actual subject feature adjustment is
[Comparable Feature Adjustment] = [Subjective Feature Value Contribution] - [Comparable Feature Value Contribution]
What is absolutely critical to everything here, is my subjective assignment of a CQA score to the subject. But only the buyers control the comparables. That is the difference between the way I do things and the way most other appraisers do things. Most other appraisers will fly by the seat of their pants on the the comparable adjustments - and that is a major source of inaccuracy for them.