- Joined
- Jun 27, 2017
- Professional Status
- Certified General Appraiser
- State
- California
A couple of weeks ago, an AI instructor told the me that you have to put all comparables for your subject in the sales adjustment grid. I asked if she meant all properties the adjustments are based on and she said "Yes." I'm not sure though that that is really what she meant.
But if so,, that's a big deal if that's what reviewers really think.
Typically I would run regression on all the neighborhood sales within a certain GLA sq. ft. range of the subject to extract patterns and the impact of various features on price. I would then select from these the 6 most comparable for the grid. I actually think that is the way it has to go. You could add maybe another 6 comps; but that is going to be more work and more "exposure" and probably affect things for the worse. So, I don't think that is really an issue. In any case, I wouldn't myself use more than 12 comps.
More importantly:
1. Regression only models the attributes that you input - and only if there are a certain number of non-null (known) values for each attribute. So, typically we do not get certain values like Condition, Quality and View from the MLS as quantities. You may get these indications inexactly. For example, view may be listed as categories: "Ocean, Mountains, Neighborhood, ..". In the latter case you just need to go through all the data and make sure they are consistent. Before running the model, you have to check the attribute/variable as "categorical". MARS will put a base function in your model something like BF8 = ( VIEW$ in ("Ocean" )) and then the adjustment would be value = BF8 * 20000 + ..... or BF9= (CONDITION$ in ("C-1", "C-2")) and value = BF9 * 30000 + .....
2. Condition and Quality, or similar important variables, should be coded with numbers based on your best guess from what you know about the neighborhood, the MLS and street view.
3. Then you run your regression to create a model.
4. Next you run the model against the input data to predict the prices of comps.
5. Then you calculate the difference between the model prediction and the actual sale price. If your model is good, the difference should be small. If it is not then, look at the sales with large differences and see if you can determine why there is such a large difference.
6. Make improvements and iterate until you can't do any better.
7. Then choose the comps you want to put in the grid. How many? Well that seems open to debate.
In any case, I think you can see, it can be a lot of work to add in the variable values that may be missing from your MLS data and then tweak the model by seeing if you can improve your supplied (most likely subjective) values. Also, you should be able to understand why AVMs are so far off - as they don't know the values of all the important contributors to value that are typically not in the MLS or at least cannot be estimated very well from the data in the MLS. [ But interestingly, if the AVM companies can get some of that data, they can use it to incrementally improve their models - thus the interest in getting their hands on appraiser data.]
But if so,, that's a big deal if that's what reviewers really think.
Typically I would run regression on all the neighborhood sales within a certain GLA sq. ft. range of the subject to extract patterns and the impact of various features on price. I would then select from these the 6 most comparable for the grid. I actually think that is the way it has to go. You could add maybe another 6 comps; but that is going to be more work and more "exposure" and probably affect things for the worse. So, I don't think that is really an issue. In any case, I wouldn't myself use more than 12 comps.
More importantly:
1. Regression only models the attributes that you input - and only if there are a certain number of non-null (known) values for each attribute. So, typically we do not get certain values like Condition, Quality and View from the MLS as quantities. You may get these indications inexactly. For example, view may be listed as categories: "Ocean, Mountains, Neighborhood, ..". In the latter case you just need to go through all the data and make sure they are consistent. Before running the model, you have to check the attribute/variable as "categorical". MARS will put a base function in your model something like BF8 = ( VIEW$ in ("Ocean" )) and then the adjustment would be value = BF8 * 20000 + ..... or BF9= (CONDITION$ in ("C-1", "C-2")) and value = BF9 * 30000 + .....
2. Condition and Quality, or similar important variables, should be coded with numbers based on your best guess from what you know about the neighborhood, the MLS and street view.
3. Then you run your regression to create a model.
4. Next you run the model against the input data to predict the prices of comps.
5. Then you calculate the difference between the model prediction and the actual sale price. If your model is good, the difference should be small. If it is not then, look at the sales with large differences and see if you can determine why there is such a large difference.
6. Make improvements and iterate until you can't do any better.
7. Then choose the comps you want to put in the grid. How many? Well that seems open to debate.
In any case, I think you can see, it can be a lot of work to add in the variable values that may be missing from your MLS data and then tweak the model by seeing if you can improve your supplied (most likely subjective) values. Also, you should be able to understand why AVMs are so far off - as they don't know the values of all the important contributors to value that are typically not in the MLS or at least cannot be estimated very well from the data in the MLS. [ But interestingly, if the AVM companies can get some of that data, they can use it to incrementally improve their models - thus the interest in getting their hands on appraiser data.]