×
Reviews 4.9/5 Order Now

Voting Behavior in Naples, Italy: Statistical Correspondence Analysis

November 21, 2023
Dr. Nakamura
Dr. Nakamura
🇯🇵 Japan
Statistical Analysis
Dr. Nakamura holds a PhD in Statistical Engineering from the University of Tokyo. With over 350 homework completed, he has extensive experience in statistical reliability analysis. His academic journey and practical work make him a seasoned expert on developing and applying advanced statistical models for engineering systems. Dr. Nakamura's insights help students grasp complex concepts effectively.
Statistical Analysis
Key Topics
  • Problem Description:
  • Solution:
  • Conclusion
Tip of the day
Always define your hypotheses and understand the context of the data before starting. Use statistical software like SPSS, R, or Python for accuracy and efficiency. Double-check formulas and ensure your results align with your analysis. Clear labeling of graphs and tables adds value to your presentation.
News
In 2024, Minitab enhanced its web app's tabular output for improved readability, aiding students in data analysis.

In this statistical data analysis homework, we study the intricacies of voting behavior within the Naples, Italy district. We analyze a dataset encompassing the number of valid votes for six political leaders across ten municipalities. Our aim is to uncover the patterns and relationships in how these municipalities cast their votes. Explore the results and interpretations below to gain valuable insights into this captivating electoral landscape.

Problem Description:

In this Data Analysis homework, we delve into election data from Naples, Italy. The dataset provides information on the number of valid votes for various political leaders in different municipalities within the Naples district. The dataset contains voting numbers for six political leaders: Berlusconi, Bersani, Grillo, Monti, Ingroia, and others. With a total of 450,372 voting observations, our objective is to conduct a Correspondence Analysis (CA) to explore the association between these two categorical variables, the municipalities' voting behavior and the political leaders. The primary aim of this analysis is to gain a deeper understanding of the voting patterns across Naples, Italy.

Solution:

Results and Interpretations

Test of independence between the rows and the columns:

Chi-square (Observed value)11925.220
Chi-square (Critical value)61.656
DF45
p-value0.0001
alpha0.050

Smallest P value 0.0001 the chi-square is significant, and the 2 variables are not independent

Total inertia = 0.026

Interpretation: Total inertia is the Chi-squared divided by the total number of observations (n) which provides an indicator of the total information to explain.

The total inertia also known as total weighted Variance explained by the five components is calculated to be 0.026 as highlighted above.

Eigenvalues and percentages of inertia:

F1F2F3F4F5
Eigenvalue0.0170.0080.0020.0000.000
Inertia %63.85229.3015.8340.6740.340
Cumulative %63.85293.15398.98699.660100.000

First, it appears that, with a single dimension, 63.85% of the inertia can be explained, that is, the relative frequency values that can be reconstructed from a single dimension can reproduce 63.85% of the total Chi-square value for this two-way table; two dimensions allow us to explain 93.15 %.

chi-square-value

Interpretation: Through analyzing the percentages of inertia, we can determine that 93.15% of the observations are determined by the first two factors within the dataset. As such, the analysis of voting behavior across the municipalities will be on the basis of F1 and F2.

According to the graph above, only dimensions 1 and 2 should be used in the solution. The dimension 3 explains only 0,2% of the total inertia which is below the average.

Profiles (rows):

BerlusconiBersaniGrilloMontiIngroiaOthersSum
M010.3250.2820.1940.1520.0220.0251.000
M020.3130.3030.2380.0830.0390.0231.000
M030.3100.2960.2530.0810.0380.0211.000
M040.3510.2670.2520.0680.0340.0281.000
M050.2240.3680.2320.1150.0410.0201.000
M060.2920.3390.2460.0650.0300.0281.000
M070.4060.2190.2410.0790.0280.0261.000
M080.3370.2540.2590.0810.0410.0281.000
M090.3290.2590.2700.0790.0430.0221.000
M100.2280.3430.2720.0880.0480.0211.000
Mean0.3120.2930.2460.0890.0360.0241.000

Interpretation: The above table indicates the percentage of the population who vote for each political leader within each municipality.These are the values the will be plotted on the row oriented plot.CA investigates the differences between each individual row profile and the average row profile

From the table above we can observe that 31.2 % of the population in Naples voted for Berlusconi vs. 29.3% who voted for Bersani, followed by Grillo at 24.6%. Even though Berlusconi received a majority of votes, we can observe varied positions of different municipalities based on their political inclinations. For example, a larger proportion of Municipality 5 voted for Bersani (36.8%) vs. Berlusconi (22.4%). On the other hand, a large proportion of the population in Municipality 1 voted for Berlusconi (32.5%) vs. Bersani (28.2%). Further investigation is required to understand each municipality’s political inclination and their voting behavior according to the political leaders.

In M01 and M07 , M05 people vote in different way

Principal coordinates (rows):

F1F2F3F4F5
M01-0.0050.2480.0040.011-0.005
M02-0.012-0.0150.012-0.0210.019
M03-0.020-0.031-0.002-0.015-0.010
M04-0.124-0.0420.0240.0080.005
M050.2080.036-0.002-0.0100.004
M060.032-0.0720.0900.010-0.006
M07-0.2430.0320.008-0.010-0.005
M08-0.106-0.023-0.0350.0220.017
M09-0.087-0.047-0.056-0.006-0.010
M100.153-0.078-0.0400.015-0.005
symmetric-row-plot

Interpretation: The above symmetric row plot provides a distribution of municipality voting based on Factor 1 and Factor 2 which explains 93.15% of the variability and relationship. The row plot graph shows that municipality 7, 5 and 1 distributions are farthest from the mean, indicating that those municipalities have the strongest political inclinations. From the graph we can interpret that M7 and M5 are opposite with respect to the voting behavior and which political party they lean towards. we can see that if two points are close to each other that means they share a similar profile , like M02 and M03.

Principal coordinates (rows):

F1F2F3F4F5
M01-0.0050.2480.0040.011-0.005
M02-0.012-0.0150.012-0.0210.019
M03-0.020-0.031-0.002-0.015-0.010
M04-0.124-0.0420.0240.0080.005
M050.2080.036-0.002-0.0100.004
M060.032-0.0720.0900.010-0.006
M07-0.2430.0320.008-0.010-0.005
M08-0.106-0.023-0.0350.0220.017
M09-0.087-0.047-0.056-0.006-0.010
M100.153-0.078-0.0400.015-0.005

In this correspondence analysis, 5 factors were considered in the row analysis which 10 municipalities across the political leaders. From the results presenter, M01, M05 and M10 shows greater variability among all the municipalities. The sum of the modulus of the first first factors is more than that of the of the last three. Hence, the first two factors F1 and F2 are sufficient and highly significant in explaining explaining the variability and relationships among the municipalities.

Contributions (rows):

Weight (relative)F1F2F3F4F5
M010.0930.0000.7400.0010.0600.021
M020.0870.0010.0030.0090.2140.345
M030.0990.0020.0120.0000.1200.113
M040.0850.0780.0190.0320.0290.020
M050.1490.3830.0250.0000.0800.027
M060.1060.0060.0700.5630.0590.041
M070.0790.2750.0110.0030.0470.022
M080.0850.0560.0060.0670.2340.266
M090.1060.0470.0300.2110.0250.108
M100.1100.1520.0850.1130.1340.037
asymmetric-row-plot

Interpretation: The above asymmetric row plot provides the distribution of both the municipalities and the political leaders based on the two main factors F1 and F2. This graph helps visually understanding the relationship between municipalities and the political leaders. For example, we can understand from the graph that a large proportion of the population in municipalities M05, 06 and 10, vote for Bersani whereas municipalities M07, 08, 04, and 09 vote for Berlusconi with respect to the mean because the points are attracted by Berlusconi with respect to the mean . M10 the proportional voted for Ingroia Is greater with respect to the other.

Each point is the body center of the red points using weight which reflect how much the municipality voted for the candidate with respect to the other municipality .

Squared Cosines (rows):

F1F2F3F4F5Sum of F1 and F2
M010.0000.9970.0000.0020.0000.998
M020.1050.1720.1170.3350.2720.276
M030.2410.5670.0030.1280.0610.808
M040.8650.0980.0330.0030.0010.963
M050.9690.0280.0000.0020.0000.997
M060.0710.3540.5660.0070.0020.425
M070.9800.0170.0010.0020.0000.997
M080.8140.0400.0890.0360.0210.854
M090.5810.1700.2390.0030.0070.751
M100.7480.1930.0510.0070.0010.941

Interpretation: The squared cosines are used to indicate the level of significance of the observations within the data set. We take the sum of the squared cosines of F1 and F2 to determine the level of significance against each municipality and validate. Given the sum of F1 and F2 for all municipalities is above 0.05 we can conclude that factor 1 and 2 show a high level of significance to explain the voting behavior of all municipalities.

The result of the analysis shows that the contingency table has been successfully represented in low dimension space using correspondence analysis. The two factors 1 and 2 are sufficient to retain 93,15% of the total inertia (variation) contained in the data. However, not all the points are equally well displayed in the two dimensions. If a row item is well represented by two dimensions, the sum of the cos2 is close to one like M01,M07,M05. For some of the row items, more than 2 dimensions are required to perfectly represent the data like M02.

Profiles (columns)

BerlusconiBersaniGrilloMontiIngroiaOthersMean
M010.1000.0880.0740.1570.0550.0990.095
M020.0900.0880.0850.0800.0930.0850.087
M030.1010.0980.1020.0890.1010.0890.097
M040.0990.0760.0880.0640.0780.1000.084
M050.1110.1840.1410.1890.1660.1220.152
M060.1020.1200.1060.0770.0870.1230.103
M070.1050.0570.0770.0690.0610.0870.076
M080.0940.0720.0900.0760.0940.1010.088
M090.1150.0910.1160.0920.1220.0960.105
M100.0830.1260.1220.1070.1420.0980.113
Sum1.0001.0001.0001.0001.0001.0001.000

We can see in the graph the distribution of the votes in differents municipalitie , we see that berlusconi and bersani are opposite with respect these profiles , first we can notice that they are different from the mean because they are far from the origin, they behave in opposite ways because they may have municipalities more habitant where the votes are higher and the other less people vote with respect these profile to these column Bersusconi and bersani behave different in opposite way with respect to the mean and others .we can see in the table that the M01 vote by 10% to Berlusconi whereas for Bersani only 8.8%.

Principal coordinates (columns):

F1F2F3F4F5
Berlusconi-0.1720.0250.013-0.0080.002
Bersani0.146-0.0190.037-0.0050.001
Grillo-0.009-0.078-0.0320.009-0.010
Monti0.1070.243-0.0490.005-0.002
Ingroia0.084-0.129-0.120-0.0170.034
Others-0.0880.0030.0640.0710.029

Presented in the table above are five principal coordinates for the political leaders across the ten municipalities with different magnitudes. The most significant coordinates are the first two which are F1 and F2 as highlighted in the table. This implies that the conclusion that will be drawn when all the coordinates are been evaluated is almost the same as that of the first two coordinates. In all the coordinates, there are equal positive and negative values except F5.

Going by the mean, all political leaders received the most valid votes from M05, Monti received the highest valid votes among the political leaders following by Bersani and then Ingroia. The remaining political leaders had lesser votes compared to the three leaders mentioned.

principal-coordinates

Contributions (columns):

Weight (relative)F1F2F3F4F5
Berlusconi0.3030.5330.0240.0350.0960.009
Bersani0.3000.3780.0140.2610.0440.002
Grillo0.2460.0010.1930.1600.1150.285
Monti0.0910.0620.6900.1390.0150.004
Ingroia0.0370.0150.0790.3420.0580.470
Others0.0240.0110.0000.0630.6720.230

In the above table, the relative weight is presented for all the political leaders. On the average, Berlusconi had more valid votes across municipalities with 30.3% contributions followed by Bersani with 30% contributions and Grillo with 24.6% contributions. The remaining political leaders contribute only 15% collectively.

Conclusion

Overall, the data analysis above provided insightful information to municipalities voting behaviors across the different candidates. We can see from the above-mentioned results and interpretation the following key conclusions:

  • Berlusconi was the lead contender with respect to number of votes, where 31.2% of the population of Naples voted for him, followed by Bersani in second position who gathered 29.3% of the votes in Naples.
  • The political leaders Berlusconi, Bersani, and Grillo made up 85.1% of the total votes.
  • Berlusconi and Bersani are positioned on opposite sides of the political party, in which Berlusconi is the right wing and Bersani is left wing.
  • Municipality 7 had the largest proportion of their population voting for Berlusconi, while Municipality 5 had the largest proportion of voters for Bersani.
  • Voters for Grillo in Municipality 1 were particularly low due to divergence in political inclination and thinking.
  • The other candidates within the elections tended to be more aligned with the right-wing, potentially additional votes away from Berlusconi.
  • Although Ingroia only succeeded in taking 3.6% of the total votes in Naples, his party affiliation was more left-wing, thus potentially taking away votes from Bersani.
  • Municipalities 2 and 3 were closest to the mean, indicating that their population was even split with respect to votes between the different political leaders.

Similar Samples

Our sample section provides a glimpse into the detailed approach we take in solving statistical problems. From data collection to interpretation, we ensure accuracy and clarity in every step of the statistical analysis process. Explore the samples to see how our experts provide solution to complex concept related statistical problem