Hi.
Your cluster analysis methodology is a good beginning, and if you like the clusters it gives you, then you're set. The beauty of having a small data set to work with (it sounds like you've got each of the states in the USA as a single point, so you only have around 50 data points, a few more if you include D.C., Puerto Rico, etc.) is that you can try a bunch of different things, and they all work reasonably quickly.
If I understand correctly, you used PROC FASTCLUS after PROC CLUSTER in order to generate centroids? Again, this is the luxury afforded by a small data set. Did you check afterwards to see how close the FASTCLUS clusters agreed with the clusters you chose from CLUSTER? If the agreement is not so good then the FASTCLUS centroids may not be very useful.
When you have medium to large data sets (more than 10,000 records), you can't really run PROC CLUSTER first. What some people do is to run PROC FASTCLUS to generate a lot of "small" clusters (say, 500 to 1,000) and then use the centroids as new data points, and the number of points in each cluster as frequencies for the centroids. Then run PROC CLUSTER on that data set.
And, of course, there is the "forgotten" clustering procedure, PROC MODECLUS, which is faster than CLUSTER and uses nonparametric tests to try to determine the best number of clusters. But I'm getting ahead of myself, because data preparation is so critical to achieving your results. You understand the reasons for standardization, but as you've noted in your related posts, standardizing can swamp out non-spherical shapes (such as elongated ellipses). If you play around with shapes on a piece of paper, you can see for yourself the kinds of delicate situations that can arise. Draw two elongated, needle-like parallel clusters. As they get closer together and more elongated, the typical "vanilla" options like k-means will drastically rearrange what the clusters are. (It's tough to guard against this in higher tensions, because it's hard to visualize the data.) ACECLUS can help you maintain ellipsoidal clusters; read the documentation and experiment.
I'm sorta rambling here, but there are two more related things I want to point out. Outliers. Some clustering methods claim to be immune to outliers, but many of the SAS methods use squared Euclidean distance as their default, and that will be sensitive to outliers. If you toss out outliers and get very different clusters, then the outliers probably were messing you up. You mentioned that your data is not normally distributed; if the tails are fat, then again you need to be careful about the effect on distances. You may want to ditch the squared Euclidean distances. CLUSTER and MODECLUS are quite flexible in the kinds of distances they allow (you can generate all kinds of distance metrics with the %DISTANCE macro, or PROC DISTANCE in SAS 9). FASTCLUS is less flexible about distance, but you can play with the "homotopy parameter."
Good luck, I hope I haven't caused more confusion...