Scottish Study of Early Learning and Childcare: phase 4 report
This report outlines findings from the 4th phase of the Scottish Study of Early Learning and Childcare (SSELC), focusing on two-year-olds who are accessing funded ELC. The SSELC forms a major part of the strategy for the evaluation of the expansion of funded early learning and childcare in Scotland.
Appendix B – Methodology
Aims
Phase 4 of the SSELC had multiple aims. Firstly, it was designed to provide robust, nationally-representative data on a number of specific child and parent outcomes for those aged between two years and two years six months who were receiving 1140 hours of funded ELC. It was also intended to collect information about the household circumstances of these children, childcare use, the socio-economic characteristics of the family and a range of additional circumstances, experiences and behaviours known to be associated with child outcomes.
Secondly, it was intended to provide baseline data for these children and to allow for the same children to be followed up one year later. This will allow for the identification of how much progress these children have made following a year of funded ELC.
Thirdly, it was intended to ensure that data from a subsample of these children was comparable with data from children who took part at Phase 1 of the study, in order to provide a pre- and post-expansion comparison. Note that Phase 1 of the study, covering children aged between two years and two years six months who were receiving 600 hours of funded ELC for the year, was not nationally representative, being conducted in only 17 local authority areas.
Finally, it was intended to gather information from setting heads about their opinions of the expansion of ELC and how easy or difficult it had been to meet the requirements.
Sampling
A sample of children aged between two years and two years six months with a funded ELC place in Scotland was drawn via a two stage process. First, a stratified sample of ELC settings was drawn. For larger settings, a second stage was involved. If there were more than 10 eligible children at the setting, a sample of 10 children was drawn by setting staff from those eligible. The large majority of settings had no more than 10 eligible children. In these cases all eligible children at the setting were invited to take part.
Selection of settings
A systematic sample of 315 ELC settings was drawn, stratified by local authority (LA) and two deprivation groups (ELC settings in the most deprived 20% of areas, based on SIMD, vs the rest). Some local authorities were too small to allow for stratification by deprivation group, so settings were ordered by deprivation group and setting size before selection. To give all eligible children an equal chance of being selected, all settings were also given equal selection probabilities, except for those more than 10 eligible children. These were given a proportionately higher probability of selection. Settings that opted out were excluded.
A list of all ELC group settings in Scotland[36] with eligible two-year-olds was provided by the Scottish Government and local authorities, including various figures for the number of eligible two-year-olds at each setting. Some figures were for the current year, and some for the current term. Provisional figures were available for some settings, additional to confirmed attendance. Only capacity figures for 2023 were available for some settings, although 2022 enrolment figures were available for each of these. A model was used to estimate 2023 attendance in these settings based on 2023 capacity figures.
Figures provided were for all eligible two-year-olds, rather than just those aged two years to two years six months. The number of eligible children aged two years to two years 6 months at each setting was estimated combining all this information, including an adjustment to bring the total for each LA into better alignment with published local authority figures for the total number of eligible two-year-olds attending in September 2022.
Reserve samples
A reserve sample of 63 settings was also drawn using the same method, in case there were a larger number than expected of settings with no eligible children / otherwise unable to take part identified before the start of the fieldwork period. This was only used for three local authorities: Dundee, Orkney, and Shetland.
It was initially unknown whether East Dunbartonshire would participate in the survey, so another small reserve sample was selected to replace settings from that LA if need be, consisting of 7 settings drawn from all other local authorities. Ultimately East Dunbartonshire did not participate, so those settings were removed from the initial sample and replaced with this small reserve sample.
Local authority |
Deprivation group |
Settings with eligible |
Total estimated eligible |
Issued sample (number of settings)[38] |
---|---|---|---|---|
Aberdeen City Council |
Less deprived |
22 |
147.4 |
7 |
Aberdeen City Council |
Most deprived |
4 |
81.6 |
2 |
Aberdeenshire Council |
48 |
144.0 |
16 |
|
Angus Council |
Less deprived |
18 |
102.6 |
6 |
Angus Council |
Most deprived |
4 |
28.4 |
1 |
Argyll & Bute Council |
23 |
85.0 |
8 |
|
Clackmannanshire Council |
Less deprived |
8 |
78.8 |
3 |
Clackmannanshire Council |
Most deprived |
7 |
72.2 |
2 |
Dumfries & Galloway Council |
Less deprived |
36 |
145.2 |
12 |
Dumfries & Galloway Council |
Most deprived |
9 |
46.8 |
3 |
Dundee City Council |
Less deprived |
20 |
172.6 |
8 |
Dundee City Council |
Most deprived |
12 |
107.4 |
5 |
East Ayrshire Council |
Less deprived |
11 |
165.8 |
4 |
East Ayrshire Council |
Most deprived |
13 |
189.2 |
5 |
East Lothian Council |
9 |
50.0 |
3 |
|
East Renfrewshire Council |
6 |
62.0 |
2 |
|
Edinburgh City Council |
Less deprived |
27 |
263.7 |
10 |
Edinburgh City Council |
Most deprived |
11 |
296.3 |
6 |
Falkirk Council |
Less deprived |
7 |
149.7 |
3 |
Falkirk Council |
Most deprived |
4 |
59.7 |
2 |
Fife Council |
Less deprived |
34 |
307 |
12 |
Fife Council |
Most deprived |
21 |
259.2 |
7 |
Glasgow City Council |
Less deprived |
56 |
369.8 |
20 |
Glasgow City Council |
Most deprived |
67 |
594.2 |
22 |
Highland Council |
Less deprived |
36 |
134.4 |
12 |
Highland Council |
Most deprived |
4 |
69.6 |
2 |
Inverclyde Council |
Less deprived |
6 |
76.2 |
2 |
Inverclyde Council |
Most deprived |
5 |
74.8 |
2 |
Midlothian Council |
29 |
174 |
10 |
|
Moray Council |
17 |
87.0 |
6 |
|
Na h-Eileanan Siar Council |
4 |
29.0 |
2 |
|
North Ayrshire Council |
Less deprived |
8 |
59.5 |
3 |
North Ayrshire Council |
Most deprived |
12 |
123.5 |
4 |
North Lanarkshire Council |
Less deprived |
34 |
202.0 |
12 |
North Lanarkshire Council |
Most deprived |
22 |
192.0 |
7 |
Orkney Islands Council |
7 |
19.5 |
3 |
|
Perth & Kinross Council |
28 |
181.0 |
9 |
|
Renfrewshire Council |
Less deprived |
19 |
108.3 |
7 |
Renfrewshire Council |
Most deprived |
21 |
188.7 |
7 |
Scottish Borders Council |
Less deprived |
30 |
84.6 |
10 |
Scottish Borders Council |
Most deprived |
4 |
28.5 |
2 |
Shetland Islands Council |
7 |
13.0 |
3 |
|
South Ayrshire Council |
Less deprived |
21 |
89.2 |
7 |
South Ayrshire Council |
Most deprived |
6 |
68.8 |
2 |
South Lanarkshire Council |
Less deprived |
26 |
262.6 |
9 |
South Lanarkshire Council |
Most deprived |
14 |
193.4 |
5 |
Stirling Council |
Less deprived |
14 |
45.7 |
5 |
Stirling Council |
Most deprived |
6 |
57.4 |
2 |
West Dunbartonshire Council |
Less deprived |
6 |
68.1 |
2 |
West Dunbartonshire Council |
Most deprived |
6 |
118.9 |
3 |
West Lothian Council |
Less deprived |
25 |
143.6 |
9 |
West Lothian Council |
Most deprived |
9 |
68.4 |
3 |
All |
923 |
7019 |
319 |
Data collection
Data were gathered on children in the cohort via two methods: a survey of parents/carers; and a survey of the children's ELC keyworkers (primarily to measure child development). Data about the settings were also collected, by a short online questionnaire for setting heads and observations of ELC settings attended by sampled children carried out by Care Inspectorate inspectors.[39]
Parents were recruited by ELC staff and provided with information about the study before being asked to complete a paper self-administered questionnaire that collected a wide range of information about themselves, their child and their household. Parents were also asked for their permission for the child's keyworker to complete a questionnaire about the child's development. This largely consisted of the Ages and Stages (ASQ) and Strengths and Difficulties (SDQ)[40] questionnaires but also collected administrative information, including the number of hours the child attended the ELC setting in the previous week and whether the child had Additional Support Needs. Parents and setting heads were also informed of the intention to follow up on the children in 2024, and contact details were collected in the parent questionnaire for this purpose only.
The setting heads questionnaire was new for Phase 4 of the study. It asked about support provided by the setting to families of eligible two-year-olds, rather than about specific children. It also asked about food provision by the setting and about challenges faced in relation to the expansion.
Response rates to the surveys are not easy to estimate because information about the exact number of eligible children within each setting was not available (see section on Sampling above). Of the 315 settings sampled, 14 were withdrawn by the local authority before the start of fieldwork due to, for example, concerns about other pressures on the settings. Four of these were replaced with settings from a reserve sample to ensure reasonable coverage in all of the 31 local authorities which agreed to participate. Twenty five settings informed ScotCen that they were unable to take part, including six who refused, mainly because of the burden on staff, nine who had no eligible children, and 10 for whom no reason was recorded. A further 126 settings did not participate. The proportion of these that were eligible is not known. At least one questionnaire (keyworker or parent) was returned from 152 settings. A total of 341 completed questionnaires were received from parents and 495 from keyworkers. Nine keyworker questionnaires were removed from the data as it was not clear from the information provided by the settings that the parents were aware that the questionnaires were being completed. This gave a total of 328 paired questionnaires. As a rough estimate, the response rate for the parent questionnaire was 29%, for the keyworker questionnaire was 41% and for both was 27%.[41] The setting heads' questionnaire was completed by 157 setting heads.
Observations were conducted of 149 participating ELC settings using the Infant and Toddler Environment Rating Scale (ITERS-3). This is a widely recognised and highly regarded instrument designed for use in settings where most children are aged under 36 months. It provides an observational measure of the quality of ELC settings for pre-school children across six domains: space and furnishings, personal care routines, language and literacy, learning activities, interaction and programme structure, as well as other observations around numbers of children and staff and access to outdoor space.
Observations were conducted by Care Inspectorate staff seconded to the study and involved a single visit lasting between 2 and 3 hours. It was emphasised to ELC setting managers and staff before and during these observations that they were not formal inspections of the kind routinely undertaken by the Care Inspectorate.
Data analysis and statistical significance
Data analysis has been conducted using the complex samples package of SPSS version 29. Using this, the clustering of children within settings can be taken into account, without the need to use multi-level models. All analysis uses weighted data for Phase 4, except where discussing the characteristics of the cohort or the characteristics of the settings. Different weights were applied, depending on the variables included in the analysis (see section on Weighting below). Tests for statistical significance have been conducted through the use of regression analysis, and all differences between subgroups at Phase 4 discussed within the text are statistically significant unless otherwise stated.
For the significance tests, categorical outcome variables have been reduced to binary variables, so that logistic regression analysis can be used. For example, ASQ scores have been reduced to "on schedule" / "not on schedule", as whether the child is on schedule with their development is the outcome of interest. This allows us to say that girls were more likely to be on schedule for the communication domain. Conducting a chi-square test using all categories of the ASQ variable would only allow us to say that there was an association between ASQ score and sex, with girls tending to do better. It also allows multiple independent variables to be included in the modelling, so relationships between ASQ scores and sex / area deprivation can be tested. For continuous outcome variables, such as SWEMWBS score, linear regression was used.
Weighting
Weights are commonly applied to survey data so that the achieved sample better represents the population it was drawn from. Groups that are under-represented in the achieved sample are given higher weights than those that are over-represented, with the aim of weighted data matching the population distribution by key characteristics. Survey estimates produced using the weighted data should then be closer to estimates that would have been gained from the whole population of interest.
As Phase 4 included multiple questionnaires, three sets of weights have been produced. These are for analysis of: setting head responses, keyworker responses and parent responses. The same basic weighting approach was used for all three sets of weights, with specific modifications where required. The approach was consistent with the weighting of Phases 2 and 3 of the project.
The basic weighting approach consisted of two elements: selection weighting and non-response modelling. The first stage adjusted for differential probability of selection (for settings and children) resulting from the sample design. The second stage adjusted for differences in the profiles of sampled and responding settings, using logistic regression modelling. Calibration weighting, which adjusts the profile of the weights to match estimates of the population, could not be used due to the absence of detailed population estimates for eligible two-year-olds.
Further details of the methods used to produce each set of weights are provided in the subsections below.
Setting Head Weights
Setting head weights were created for the 157 settings where a head responded. Only one setting head response was allowed from each setting. First, selection weights for the settings were created from the inverse selection probability during sampling. Second, a setting-level logistic regression model was run, weighted by the selection weight. The outcome for this model was response from the setting head and the covariates included were SIMD quintiles, setting type (LA or private/voluntary/non-profit), and size band. Non-response weights were calculated as the reciprocal of the propensity to respond estimated from this model. Finally, the non-response weights were combined with the setting selection weights and checked for outliers. No outlying weights required trimming. The design effect of the final setting head weights is 1.17 and the efficiency 86%.
Keyworker Weights
Keyworker weights were created for 486 keyworker responses. Up to 10 keyworker responses from each setting were allowed. First, selection weights for the settings were creating from the inverse selection probability of each setting during sampling. Second, a setting-level logistic regression model was run, weighted by the selection weight. The outcome for this model was any keyworker responses from the setting and the covariates included were SIMD quintiles, setting type (LA or private/voluntary/non-profit), and size band. Non-response weights were calculated for the 146 settings with keyworker responses as the reciprocal of the propensity to respond estimated from this model. Third, the setting-level non-response weights were combined with the setting selection weights and matched onto the 486 keyworker responses.
As the final step, child selection weights were calculated to adjust for children's differential probability of selection between settings. These were calculated from the inverse of number of children selected per setting (if recorded on the response sheet) or number of children sampled (if not available from the response sheet) divided by estimated eligible children at the setting. The setting-level weights were combined with the child selection weights and checked for outliers. The two top weights were trimmed to improve efficiency. The design effect of the final keyworker weights is 1.46 and the efficiency 69%.
Parent Weights
Parent weights were created for 341 parent responses. Up to 10 parent responses from each setting were allowed. First, selection weights for the settings were creating from the inverse selection probability of each setting during sampling. Second, a setting-level logistic regression model was run, weighted by the selection weight. The outcome for this model was any parent responses from the setting and the covariates included were SIMD quintiles, setting type (LA or private/voluntary/non-profit), and size band. Non-response weights were calculated for the 137 settings with parent responses as the reciprocal of the propensity to respond estimated from this model. The non-response weights were checked for outliers and the top two cases trimmed.
As the third step, setting-level non-response weights were combined with the setting selection weights and matched onto the 341 parent responses. Finally, child selection weights were calculated to adjust for children's differential probability of selection between settings. These were calculated from the inverse of number of children selected per setting (if recorded on the response sheet) or number of children sampled (if not available from the response sheet) divided by estimated eligible children at the setting. The setting-level weights were combined with the child selection weights and checked for outliers. The top weight was trimmed to improve efficiency. The design effect of the final parent weights is 1.52 and the efficiency 66%.
Contact
Email: socialresearch@gov.scot
There is a problem
Thanks for your feedback