The linear multilevel model can be written in the general form
(1)
The matrix V is constructed from the random parameters - the variances and covariances of the
random effects in E and an estimate of it used in the above expression. The estimates of the random
parameters are also functions of the data and V (Goldstein, 2003).
The RIGLS/IGLS algorithm iteratively estimates the random and fixed parameters. In some circumstances
it is possible to obtain inadmissible estimates for components of the model and MLwiN will issue one
of the following warnings. You may need to take action, to ignore the warning or to proceed. Note that
when carrying out a bootstrap any of these conditions will result in that particular bootstrap replicate
being ignored and an extra failed replicate will be registered.
This message is probably produced because your worksheet is not large enough. See the enquiry about increasing worksheet size for details of how to increase the worksheet size.
I (successfully) ran a multinomial model, then cleared the Equations window and set up a binomial model. When I pressed Start, I got an error message which said:
"error while obeying batch file C:\Program Files\MLwiN v2.10\discrete\pre at line number 11: name c1180 'H' C1181 'F~(H)' C1182 'H*' C1183 'P' c1184 'MASK' c1189 'Y-VAR'
Duplicate column name."
This error message occurs because when MLwiN estimates a binomial or multinomial model, it sets up some special columns in the worksheet. When it does this it gives them certain names. The names for the columns used when running a multinomial model are P, -P, H, F~(H), H*, Pi and Y-VAR, and these occupy c1178 to c1183 and c1189. When the model is cleared, these columns are not deleted. If another multinomial model is set up and run, c1178 to c1183 and c1189 get given these names again. This does not cause a problem because the same column is being given the same name. However, if instead a binomial model is set up and run, the names and locations of some of the special columns are different: H, F~(H), H*, P and Y-VAR are put in c1180 to c1183 and c1189. When MLwiN tries to give these names to these columns, it discovers that P is already the name of c1178. Since there can't be two columns with the same name, the error message is produced.
To solve the problem, after you clear the Equations window simply scroll down in the Names window to find c1178 to c1189, highlight these columns and select the Delete button at the top of the window. This removes the contents of the columns and renames them c1178, c1179, and so on. When you set up and run the binomial model MLwiN is then able to properly create the special columns it needs.
Note that the same problem does not occur if you try to run a multinomial model after having run a binomial model. This is because, although c1183 is already named P when MLwiN tries to name c1178 P, c1183 is being given a new name in the same command, and so c1178 is successfully named P without there being 2 columns with the same name.
Example question: I tried to set up and run a binomial model using a macro, but got the error message 'Variables random at bottom level should not be used in model'
It sounds as though you have included a command in your macro like this:
setv 1 'cons'
and this is causing the error message that you see. When fitting a binomial model, it is not necessary to specify an error term at level 1. This is because the level 1 error term is already included in the model, with a binomial distribution, when you specify a binomial response (It is attached to a specially created variable, which does not appear elsewhere in the model). When specifying models in the Equations window using the Graphical User Interface, you are not given the option to specify that any variable be random at level 1. Using commands or macros it is of course possible to issue a command that makes this specification, but this causes an error when MLwiN attempts to estimate the model. If you press the Start button after setting up a model like this using commands or a macro you will see the same error message. If you remove the command "setv 1 'cons' " from your macro the model should run as expected.
For more information about binomial response models and level 1 residuals, see section C6.4 of the module on binary response models in our online training materials.
A common situation where you may wish to allow negative variances is when carrying out an iterated bootstrap for bias correction. In some cases allowing a variance, especially a level 1 variance, to go negative during the iterative procedure, can prevent estimation failure. For example, if the level 1 variance is very small compared to that at higher levels, the IGLS algorithm can stick at a zero estimate of the level 1 variance if a negative level 1 variance is encountered at an iteration and reset to zero - and at the subsequent iteration also be reset to zero so that convergence appears to have been achieved. Allowing it to go negative can avoid this problem and result in a final positive converged value. The most important case is when modelling complex level 1 variation with the variance a quadratic function of an explanatory variable. Here it is perfectly legitimate to have a negative variance so long as the total variance is not negative.
Example question: I just tried to use MCMC methods in MLwiN but I always get error messages (wrong parameter...) even if I try to follow the tutorial in Chapter 10 of the MCMC manual.
This error may be caused by using a comma as the decimal separator. If you do not have the most recent version of MLwiN, you may find that upgrading to this version solves the problem. If you still experience the problem when using the most recent version, then try changing the decimal separator that your computer uses to a '.' (via Regional and Language Options in the Windows Control Panel).
Example question: I'm trying to run a two-level multinomial model using MCMC: the outcome variable has 4 categories (cat 1 n=6168, cat 2 n=1160, cat 3 n=1608, cat 4 n=1393). The dataset comprises 200 areas and 10,329 individuals. When switching the estimation control from mql/pql to MCMC I get the following error message: "MCMC error 0315: prior variance matrix is not positive definitive" followed by "Error in MCMC start-up - estimation halted". I checked HELP in MLwiN and it indicated I needed to manually change a setting at column 1096 but insufficient information is provided as to how to do this. I subsequently sent an email to the multilevel listserv and received the following helpful response from Bill Browne…
"the MQL method must have produced a variance matrix at a higher level that is non positive definite i.e. it estimates a correlation between random effects outside the range -1 to 1. In C1096 the numbers should correspond to the ones you see in the equations window and are starting from the highest level the lower diagonal elements of all the variance matrices. My advice is to run in MQL/PQL then before switching to MCMC go into c1096 and switch off diagonal elements to 0 - to check you are doing this correctly if you have the equations window up next to the data window then when you change the value in the data window the corresponding value in the equations window will change. Once you have changed all the off-diagonal elements to zero switch to MCMC and things should run".
… I followed Bill's advice and MCMC ran fine (which suggests I followed the advice correctly). However, I then added additional variables to my model and again went through the process of generating starting values using mql/pql and then switching to MCMC and the same two error messages returned, however, this time resetting the off-diagonal values to zero in c1096 did not fix the problem. And unfortunately my technical/programming knowledge of MCMC is very limited.
Bill's advice sounds spot on and I am surprised that it didn't solve the problem the second time round. Note, if you add any additional random effects then the rows of c1096 that you set to zero the second time will differ from the first time. Make sure that the second time you run the model the variance-covariance matrices look as Bill described before starting the MCMC chain. Also check whether any of the on-diagonal terms (i.e. the variances as opposed to covariances) are zero. If so, try changing them to 0.001.
Note that not positive definite does NOT mean the matrix contains some elements which are negative. It is perfectly possible for a matrix which contains only positive elements to not be positive definite. Therefore when following the advice above it is important to change ALL the off-diagonal elements to 0, not just any negative off-diagonal elements.
Example question: I am attempting to fit a 3-level proportional odds model with time (measured in months) as the first level, person (RID) as the second level, and institution (FID) as the third level. In setting up an ordinal model I get the error message
"Unexpanded level 2 id column (Month)must not contain consecutive repeated codes, in multivariate or multinomial models. Use GENErate command to generate unique ID codes".
I do not understand what is needed of the data and it isn't clear from the data manipulation menu how to use the GENE command. Please advise me on this.
When you fit a multinomial model (whether ordinal or unordered), MLwiN needs your level 1 ID codes to obey the condition that the same value does not appear on adjacent rows of the dataset. So if your level 1 is Month, you can't for example have values of 1,2,3 for Individual 1 and 3,4,5 for Individual 2 because then the level 1 ID column will read '1,2,3,3,4,5,…' and that repeated 3 is a problem. (This does not apply to Normal response models, where repeated values like this are not a problem). To get around this, the simplest thing is to create a new variable to be your level 1 ID which has values 1,2,3,4,...,n where n is the length of your dataset: in other words a variable which is just a seqence from 1 to the length of the dataset, and which takes no account of any information about individuals or time points. This is what the error message is asking you to do when it tells you to use the GENErate command (more on this in a moment). You might think using a sequence like this as your level 1 ID is a problem because you are discarding all your information about when people were measured. However this information can still go into this model if you put Month in as an explanatory variable (you will probably want to do this anyway). The explanatory variable is the proper place for this information; all that is needed for the level 1 ID is that it should change from row to row, to signify that each row is a different occasion, it doesn't matter for the level 1 ID how far apart the occasions are or which people are measured on the same occasion.
The reason that repeated values are a problem is that MLwiN needs to create an expanded dataset when you fit a multinomial model, as described in the User's Guide. (With a Normal response model, MLwiN does not expand the dataset). The process that it uses to expand the dataset requires that the level 1 IDs satisfy this condition. In the expanded dataset, a new level 1 is created and the level you specified as level 1 is now level 2, so the level 2 referred to in the error message is the level you specified as level 1.
The sequence to use for level 1 in place of Months can be created either using a command or via the GUI. To create using the GUI, select 'Generate vector' from the 'Data Manipulation' menu. In the window that appears, select 'Sequence' under 'Type of vector'. Select any free column next to 'Output column' and type 1 next to 'Start number', the length of your dataset (this would be for example the length of your Month variable) next to 'End number' and 1 next to 'Step value' and select 'Generate'. To create using a command, in the Command interface window type 'Gene 1 n 1 c100', replacing n by the length of your dataset and c100 by any free column; for example if your Months variable is of length 4587 and column 48 is free you could type 'Gene 1 4587 1 c48'.
You may sometimes obtain estimates for a model which seem to be incorrect. Possibilities are:
Example question: I have been using MLwiN to fit an ordered categorical model. In the first instance I have been doing this for a single level model and have been replicating the analysis in SPSS. When I fit a model including only the ordered response variable SPSS and MLwiN give the same answers more or less. When I add a categorical fixed effect explanatory variable, all but two of the standard errors of the parameter estimates are set to zero in MLWin. They are not in SPSS. Is there any explanation for this? I have suppressed the warning of negative variance in the iterative procedure to obtain convergence, is this the key? Is there any way round it? I am keen to pursue the use of MLWin because it is more powerful and I am particularly interested in the standard errors of the predictions. Thanks.
You can check whether these standard errors have really been set to zero or if they have been estimated but are very small. Check this by changing the display format for numbers from 3 decimal places to 3 significant figures. Go to:
options
Numbers (Display precision and missing value code)
and change the selection from 'decimal format' to 'signif digit format'; in the box '# significant digits' select 3. If standard errors are still zero then you have some kind of estimation problem. One reason may well be: MLwiN is more sensitive than SPSS and Stata and can fail to estimate models that SPSS and Stata provide numbers for. This can be a good thing as MLwiN will fail when a model is poorly specified whilst SPSS/Stata will throw out results and you may not realise you have a problem. So make sure that your model makes sense and that all the variables in the model are sensible and have sensible distributions. You could try to see if the problem applies with other explanatory variables and an alternative ordered response. Other possibilities are: You have done something wrong in setting up the model in MLwiN. Check very carefully that the summary statistics for your variables look the same as in SPSS. Make sure that you have read the relevant chapter in the User Guide (chapters 9 and 11).Then try repeating your MLwiN/SPSS comparison for the simpler case of a binary response (i.e. collapse your ordered outcome down to two categories). As for 'suppressing the warning of negative variance in the iteratice procedure', there is no variance (apart from the implicit probit or logit variance) in single level ordered models so you should not be doing this.
The most common reasons are:
Changing the windows decimal separator to a point (.) resolves this problem. You can do this via the regional settings screen available from the Windows control panel.
Default Worksheet size set too high: If you set the default worksheet size to too high a level for the amount of RAM on your computer, MLwiN will not be able to open on subsequent attempts to launch the software. If you run into this problem do the following:
Run)
worksheet)Example question: I have just successfully installed MLwiN. However upon initiation of MLwiN, I now receive the message "Pre or Post file does not exist (post)". Is this OK, or should it be repaired?
The pre- and postfiles should be located inside a directory called discrete in your MLwiN directory (in C:\Program Files). The first thing is to check whether they are actually there. The file names should be 'PRE' and 'POST'.
If the files do not seem to be there, then try downloading and installing again. If you can see the files, then check that MLwiN is looking in the right place for them: from the Options menu select dIrectories and check that in the Fpath box it gives the path of the discrete folder (e.g. 'C:\Program Files\MLwiN v2.10\discrete') and in the Pre file box it says 'pre' and in the Post file box it says 'post'. If any of this has been mis-specified you can change it by selecting User defined settings above the boxes and then changing the text in the boxes as appropriate and selecting Done.
If this problem has been encountered when running a macro that specifies a path for these files, check that the path is correctly specified in the macro.
It is possible to open, work with and save worksheets on a network drive. However, you will need to have the drive mapped to a drive letter (and to type the filepath in this form, if you are opening/ saving the worksheet via commands). This is because MLwiN does not recognise the more recent notation for accessing a network drive directly. If using the GUI to open and save files, then provided the drive you want to access has been mapped to a drive letter, the procedure is just the same as when opening files saved on your local computer: simply browse to find the network drive (which will probably have its name in the longer more recent notation but with the letter after it in brackets), double click on it just as with a folder, and continue browsing to find your worksheet, then click on the worksheet and select Open. If using commands to open and save files, then for example if you have a network drive mapped to the letter Q and want to open 'file1.ws' in the Data folder in this drive, you would type:
"RETR 'Q:\Data\file1.ws' "
… instead of (for example):
"RETR '\\computername\sharedfolder\Data\file1.ws' ".
If you have a worksheet saved to a network drive which is not mapped to a drive letter then the simplest thing to do is to copy the worksheet to your local computer disk (or a memory stick) and open it from there.
This applies to both MLwiN worksheets and to SPSS, Stata, and Minitab worksheets (if using MLwiN version 2.10).
Example question: I'm taking the LEMMA Course and until yesterday everything was working perfectly, but today I just couldn't open the MLwiN Datafiles. Every time I try I get this message: "Run-time error '5': Invalid procedure call or argument". I've already install the latest version (2.10 beta 6), but it didn't work. I tried several times uninstalling and reinstalling the software, but it didn't work… Any ideas of what can I do? Many thanks in advance.
Has anything on your system changed between when you were able to open the worksheets and now? Examples might be other software being installed or removed. It might be worth manually clearing the MLwiN installation directory after removing it, before attempting to reinstall it. This is the directory where you chose to install it (usually C:\Program Files\MLwiN v2.10). If you have an account with administrative privileges on that machine it might also be worth checking to see if you have the same problems when running as that user.
Enquirer's reply: I don't have administrator rights in my computer, it is a pain each time I need to install a software. Finally I could open the files. As I realized I can open the old example files, what I did was to save the files in my computer and then change the extension from '.wsz' to '.ws' I know it is not the best, but it worked.
Example question: When I select the icon for 5.1.wsz in Module 5.1 I seem to get a notepad file that has a lot of blank characters, not a data file.
First note that the datasets for the LEMMA course will not open with MLwiN version 2.02: you will need version
2.10 or the teaching version (which you can download for free here). Make sure that you have one of these versions of MLwiN. If you do have the appropriate version but are experiencing this problem, then first try reinstalling your copy of MLwiN. If this fails, you can try downloading again and then reinstalling. If that doesn't work or if you cannot do this, there are two options: a work around or a fix.
Option 1: Work around
When you select the link to the datafile, choose Save to computer instead of Open. Then start MLwiN and open the data file from MLwiN's File menu.
Option 2: Fix
The fix is to reassociate the data file's .wsz extension with MLwiN (so that Windows opens them in MLwiN if you double click them,
or if you choose Open instead of Save to computer when downloading), which is pretty easy and takes just a minute.
Again, when you select the link to the datafile in your browser choose Save to computer rather than Open, and save for example to My Documents. Then open up My Documents or whichever folder you have saved the file into in Windows Explorer - so you can see the data file's icon. If the cause of the problem with opening the files is the situation this fix is designed for, then it should not have the MLwiN logo on it. (If the data file does have the MLwiN logo then there is probably a different cause of the problem and this fix will probably not work).
The next part of these instructions is written correctly for Windows XP, but probably something similar will work for Windows Vista. Right click on the data file (i.e. select with the right mouse button, not the left one that people usually use), and choose the menu option Open With, then in the submenu select Choose Program. You should get a new window with a list of programs and a tick box labelled "Always use the selected program to open this kind of file" WHICH YOU
SHOULD TICK, then select MLwiN, then select OK. If MLwiN is not in the list of programs, select the Browse button and locate and open the "mlwin.exe" file. It would normally be in the folder C:\Program Files\MLwiN v2\ or C:\Program Files\MLwiN v2.10. (Note that the icon for the program should be pale blue and white, not yellow and dark blue: the program with the yellow and dark blue icon is version 2.02 which will not open the LEMMA datafiles).
After doing this, the data file on your computer should now have the MLwiN logo, and when you double click it should open up in MLwiN, and
when you open files directly from your browser, they should open in MLwiN too.
Enquirer's reply:I didn't reinstall, but I tried the work around (didn't work), and the fix (which did).
Example question: I have worked through Module 5 up to p. 8 (P5.1.2) but when I try to produce the caterpillar plot (selecting 'apply' in the Plots tab), I get the error message: 'Failed to open Graphics Server', selecting OK or closing produces the error message: 'The server is already running', Clicking or closing this message repeats the earlier message, and so on …
It is possible that data in the worksheet columns can become corrupted unintentionally for a number of reasons:
You should always make sure that you have a recent backed up worksheet copy which you can retrieve if you get into this situation
The command LOGO <filename> does not work in the current version of MLwiN. However the commands LOGO <filename> 1 to switch on logging and LOGO 0 to switch logging off should do exactly the same thing the LOGO <filename> command used to do. Note that the <filename> needs to be the full path of the file, unless you have set the working directory to the directory that the file is in using Options
dIrectories and then setting the directory using the current directory box.
To make things clearer, here is an example of using the LOGO command that should work:
ECHO 1
LOGO 'C:\Documents and Settings\All Users\Documents\logfile.txt' 1
SAY 'Hello World'
LOGO 0
(Note that 'ECHO 1' turns on echoing, which means that commands coded in macros will be displayed).
If this does not work an alternative way to preserve a record of what has been done in a session is to copy the commands from the Command interface window (see When I set up a model in the Equations window, is there any way to record the commands that will do the same thing?). Also note that you can save sequences of commands in a macro to execute again later (see How can I easily save and run commands?)
Example question: I fitted a model using version 2.02 and then upgraded to version 2.10
Beta and fitted the model again. All the estimates of the coefficients and
variance parameters were the same, but the -2*log(likelihood) value given
was different.
We have found in some rare cases that MLwiN 2.10 Beta (subversions 1-9) produced incorrect likelihoods for models estimated with (R)IGLS. The problem does not apply to models estimated in MCMC (since there is no likelihood calculated for these models; the problem does not apply to the DIC). The problem was not present in previous release versions (2.02 and earlier). The problem was fixed in MLwiN Beta 2.10.
Our research has found the problem to only occur in single level models with a large number of cases (>10,000). Furthermore, the problem does not occur if a second level is declared but no variance component is fitted at level 2.. If users have fitted a single level model in MLwiN 2.10 Beta (subversions 1-9), with only a single level hierarchy defined, the -2*log(likelihood) should be regarded as suspect and the model should be re-run in MLwiN 2.1 or subsequent version.