Eric Rasmusen's Weblog: Weighted Least Squares and Why More Data is Better

Friday, September 28, 2007

Weighted Least Squares and Why More Data is Better

In doing statistics, when should we weight different observations differently?

Suppose I have 10 independent observations of $x$ and I want to estimate the population mean, $\mu$. Why should I use the unweighted sample mean rather than weighting the first observation .91 and each of the rest by .01?

Either way, I get an unbiased estimate, but the unweighted mean gives me lower variance of the estimator. If I use just observation 1 (a weight of 100% on it) then my estimator has the variance of the disturbance. If I use two observations, then a big positive disturbance on observation 1 might be cancelled out by a big negative on observation 2. Indeed, the worst case is that observation 2 also has a big positive disturbance, in which case I am no worse off by having it. I do not want to overweight any one observation, because I want mistakes to cancel out as evenly as possible.

All this is completely free of the distribution of the disturbance term. It doesn't rely on the Central Limit Theorem, which says that as $n$ increases then the distribution of the estimator approaches the normal distribution (if I don't use too much weighting, at least!).

If I knew that observation 1 had a smaller disturbance on average, then I *would* want to weight it more heavily. That's heteroskedasticity.

Labels: statistics

To view the post on a separate page, click: at 9/28/2007 06:21:00 AM (the permalink).

Selected Archive Topics >

Blog Policies.

I've set up this blog for myself, as a commonplace book, with the idea that it might also be useful for outside readers. That is why the topics are idiosyncratic. I see that most of my readers are directed here by Google searching rather than being regular readers.

I will delete rude comments, and will give less leeway to anonymous comments than to signed ones. I will for now at least allow stupid and ill-informed comments, though other readers don't enjoy them unless they are so ignorant as to be funny.

I will revise my posts freely, usually without any note that they've been revised. If I make an important mistake in a post that I think people might refer to, I will note the mistake and correction. But I'm not trying to make this a historical record. In fact, I'd like to merge posts on the same topic and delete posts not of interest a year later, except that I never get round to doing that.

Subscribe to
Comments [Atom]

Eric Rasmusen's Weblog

Friday, September 28, 2007

Weighted Least Squares and Why More Data is Better

About Me

Previous Posts

Selected Posts of Special Interest >

Selected Archive Topics >