IBM SPSS Statistics for Mac offers in-depth analysis of data and provides suggestions and reports based on the given criteria. This premium product comes with a free trial and features extremely powerful data analysis tools. On the downside, the user interface is poorly designed, with an overabundance of options, which may overwhelm the casual user. Pros Powerful capabilities: After you input your variables into IBM SPSS Statistics for Mac, you can perform complex analysis, including actions such as changing all variables to meet a certain goal, or in the case of direct marketing, identify clients who are most likely to buy your product or the general geographic area that sees the most purchases. Other analysis types include correlation, forecast, regression, and classification in clusters. Add-ons support: The app's core functionality is enhanced by add-ons such as text analysis for surveys and data collection. Excellent output options: When it comes to saving your work, you can either export your entire database, or save the data, itself, as one of more than 30 formats, including Microsoft Excel.
Cons Lengthy setup: In order to install IBM SPSS Statistics for Mac, you need to register and create your own IBM ID and install Java and Python. Bottom Line If you run a company and need software to help you make the most informed decisions, a product like IBM SPSS Statistics for Mac can be worth its weight in gold. Although it may be overwhelming at first, the app easily handles multivariable decisions. If, on the other hand, you don't need this much analytical potential, a more basic offering like Excel's Solver might be a better fit. Editors' note: This is a review of the trial version of IBM SPSS Statistics for Mac 19. Solve business and research problems using SPSS for Mac OS X. Designed to run on your Mac, this data analysis and data management package enables you to uncover key facts, patterns, and trends.
With this release, you'll have access to a host of new capabilities while still enjoying the familiar features of the Mac platform. You can use SPSS for Mac OS X in a variety of areas, including:. Survey and market research and direct marketing. Academia and instruction. Medical, scientific, clinical, and social services research. Administrative research, human resources, and resource planning.
Planning and forecasting. Quality improvement. Reporting and ad hoc decision making In particular, you can use SPSS for Mac OS X to gain greater insight into the actions, attributes, and attitudes of your customers, employees, students, or citizens.
I will be analysing vast amount of network traffic related data shortly, and will pre-process the data in order to analyse it. I have found that R and SPSS are among the most popular tools for statistical analysis. I will also be generating quite a lot of graphs and charts. Therefore, I was wondering what is the basic difference between these two softwares. I am not asking which one is better, but just wanted to know what are the difference in terms of workflow between the two (besides the fact that SPSS has a GUI). I will be mostly working with scripts in either case anyway so I wanted to know about the other differences. I work at a company that uses SPSS for the majority of our data analysis, and for a variety of reasons - I have started trying to use R for more and more of my own analysis.
Some of the biggest differences I have run into include:. Output of tables - SPSS has basic tables, general tables, custom tables, etc that are all output to that nifty data viewer or whatever they call it. These can relatively easily be transported to Word Documents or Excel sheets for further analysis / presentation. The equivalent function in R involves learning LaTex or using a odfWeave or Lyx or something of that nature.
Labeling of data - SPSS does a pretty good job with the variable labels and value labels. I haven't found a robust solution for R to accomplish this same task. You mention that you are going to be scripting most of your work, and personally I find SPSS's scripting syntax absolutely horrendous, to the point that I've stopped working with SPSS whenever possible. R syntax seems much more logical and follows programming standards more closely AND there is a very active community to rely on should you run into trouble (SO for instance). I haven't found a good SPSS community to ask questions of when I run into problems. Others have pointed out some of the big differences in terms of cost and functionality of the programs.
If you have to collaborate with others, their comfort level with SPSS or R should play a factor as you don't want to be the only one in your group that can work on or edit a script that you wrote in the future. If you are going to be learning R, this post on the stats exchange website has a bunch of great resources for learning R. For small tables I usually just copy-paste the screen output in R directly into Excel, then call 'text-to-column', alternatively you might use write.csv (or csv2) on the table. (or perhaps you were referring to some automatic reporting?).
The Hmisc package has variable labels, but value labels is for factors. This is nicely done in the foreign package if you were to import a SPSS (or Stata) dataset, the resulting R data keeps the labeling information from the original. – Sep 24 '10 at 16:15. @eyjo - 'automatic reporting' is a relative term. Our current work flow entails: 1. Pull data from SQL into SPSS, 2. Use a VB script that goes through our surveys and pulls out variable and value labels automatically, edit them and apply to SPSS dataset.
Use another script that generates SPSS tables in the format we like. Export to Word & Excel for further post processing that SPSS can't handle. Make a 'client ready' appendix as.DOC or.PDF. I would LOVE for R to replace the SPSS - Word part of that. Ideally, workflow could be SQL - R / Sweave - Final product.
– Sep 24 '10 at 19:20. @Chase: I do not see why this could not be done only using R.
I have developed some custom tools for companies which does exactly the same: get the data from SPSS or automatically fetch data from MySQL and applies labels/variable names (from another MySQL table of from online HTML survey's body) to the columns, generates tables with required format and exports it as an odt file, which can be opened in any MsWord (2007+) or OOWriter. The output can be themed (header, colors, images, font, table's margins etc.) easily. It can be a lot of work (moreover with a GUI) but may worth in the long run. – Jan 17 '11 at 13:22.
Here is something that I posted to the R-help mailing list a while back, but I think that it gives a good high level overview of the general difference in R and SPSS: When talking about user friendlyness of computer software I like the analogy of cars vs. Busses: Busses are very easy to use, you just need to know which bus to get on, where to get on, and where to get off (and you need to pay your fare).
Cars on the other hand require much more work, you need to have some type of map or directions (even if the map is in your head), you need to put gas in every now and then, you need to know the rules of the road (have some type of drivers licence). The big advantage of the car is that it can take you a bunch of places that the bus does not go and it is quicker for some trips that would require transfering between busses. Using this analogy programs like SPSS are busses, easy to use for the standard things, but very frustrating if you want to do something that is not already preprogrammed.
R is a 4-wheel drive SUV (though environmentally friendly) with a bike on the back, a kayak on top, good walking and running shoes in the pasenger seat, and mountain climbing and spelunking gear in the back. R can take you anywhere you want to go if you take time to leard how to use the equipment, but that is going to take longer than learning where the bus stops are in SPSS. There are GUIs for R that make it a bit easier to use, but also limit the functionality that can be used that easily. SPSS does have scripting which takes it beyond being a mere bus, but the general phylosophy of SPSS steers people towards the GUI rather than the scripts. The initial workflow for SPSS involves justifying writing a big fat cheque.
R is freely available. R has a single language for 'scripting', but don't think of it like that, R is really a programming language with great data manipulation, statistics, and graphics functionality built in. SPSS has 'Syntax', 'Scripts' and is also scriptable in Python. Another biggie is that SPSS squeezes its data into a spreadsheety table structure.
Dealing with other data structures is probably very hard, but comes naturally to R. I wouldn't know where to start handling network graph type data in SPSS, but there's a package to do it for R. Also with R you can integrate your workflow with your reporting by using Sweave - you write a document with embedded bits of R code that generate plots or tables, run the file through the system and out comes the report as a PDF. Great for when you want to do a weekly report, or you do a body of work and then the boss gives you an updated data set.
Re-run, read it over, its done. But you know, your call. Well, are you a decent programmer?
If you are, then it's worthwhile to learn R. You can do more with your data, both in terms of manipulation and statistical modeling, than you can with SPSS, and your graphs will likely be better too. On the other hand, if you've never really programmed before, or find the idea of spending several months becoming a programmer intimidating, you'll probably get more value out of SPSS. The level of stuff that you can do with R without diving into its power as a full-fledged programming language probably doesn't justify the effort. There's another option - collaborate. Do you know someone you can work with on your project (you don't say whether it's academic or industry, but either way.), who knows R well?
I have not data for it, but from my experience I can tell you one thing: SPSS is a lot slower than R. (And with a lot, I really mean a lot) The magnitude of the difference is probably as big as the one between.
For example, I never have to wait longer than a couple of seconds in R. Using SPSS and similar data, I had calculations that took longer than 10 minutes. As an unrelated side note: In my eyes, in the recent discussion on the speed of R, this point was somehow overlooked (i.e., the comparison with SPSS). Furthermore, I am astonished how this discussion popped up for a while and silently disappeared again. There are some great responses above, but I will try to provide my 2 cents. My department completely relies on SPSS for our work, but in recent months, I have been making a conscious effort to learn R; in part, for some of the reasons itemized above (speed, vast data structures, available packages, etc.) That said, here are a few things I have picked up along the way:.
Unless you have some experience programming, I think creating summary tables in CTABLES destroys any available option in R. To date, I am unaware package that can replicate what can be created using Custom Tables. SPSS does appear to be slower when scripting, and yes, SPSS syntax is terrible. That said, I have found that scipts in SPSS can always be improved but using the EXECUTE command sparingly.
SPSS and R can interface with each other, although it appears that it's one way (only when using R inside of SPSS, not the other way around). That said, I have found this to be of little use other than if I want to use ggplot2 or for some other advanced data management techniques. (I despise SPSS macros). I have long felt that 'reporting' work created in SPSS is far inferior to other solutions.
As mentioned above, if you can leverage LaTex and Sweave, you will be very happy with your efficient workflows. I have been able to do some advanced analysis by leveraging OMS in SPSS. Almost everything can be routed to a new dataset, but I have found that most SPSS users don't use this functionality. Also, when looking at examples in R, it just feels 'easier' than using OMS. In short, I find myself using SPSS when I can't figure it out quickly in R, but I sincerely have every intention of getting away from SPSS and using R entirely at some point in the near future. I work with both in a company and can say the following:.
If you have a large team of different people (not all data scientists), SPSS is useful because it is plain (relatively) to understand. For example, if users are going to run a model to get an output (sales estimates, etc), SPSS is clear and easy to use. That said, I find R better in almost every other sense:. R is faster (although, sometimes debatable). As stated previously, the syntax in SPSS is aweful (I can't stress this enough).
On the other hand, R can be painful to learn, but there are tons of resources online and in the end it pays much more because of the different things you can do. Again, like everyone else says, the sky is the limit with R. Tons of packages, resources and more importantly: indepedence to do as you please. In my organization we have some very high level functions that get a lot done. The hard part is creating them once, but then they perform complicated tasks that SPSS would tangle in a never ending web of canvas. This is specially true for things like loops.
It is often overlooked, but R also has plenty of features to cooperate between teams (github integration with RStudio, and easy package building with devtools). Actually, if everyone in your organization knows R, all you need is to maintain a basic package on github to share everything. This of course is not the norm, which is why I think SPSS, although a worst product, still has a market.