Ted Cruz’s trollbot army, looking conclusive edition

With deeper data analysis, we’re seeing still more worrisome signs that Ted Cruz’s recent Twitter surge has been driven by bots or other manipulative accounts.

The upshot here? In a comparison between the 200 most active posters in the #TXSen race and the 200 most active posters in all other competitive races, the #TXSen posters (almost all Cruz supporters) were about 40% more likely to be deemed “trollbots.” More than three-quarters of top #TXSen posters are rated “Problematic” or “Alarming.” 

Last week we looked at the top 20 most frequent users of Cruz’s #KeepTexasRed hashtag. Bot Sentinel’s “Trollbot”-spotting service rated nine of these top 20 accounts as “Alarming,” with an overall average (mean) Bot Sentinel score of 71.3% — thus, at the high end of that service’s “Problematic” category.

It seemed striking. And was. But there was a problem with this observation: it lacked context.

Understanding the scope of a problem requires a basis for comparison. Without a benchmark, data that seems relevant may simply lack sufficient context. For instance, given the slightly-less-than-transparent way that machine-learning tools function, it could be that Bot Sentinel’s ratings simply skew very high, and that 71% isn’t as terrible as it sounds.

Alternately, the service’s ratings could be very heavily biased against Tweets that support the GOP. This bias would be invisible if we only looked at a single group of similar posters, as we did. You see the point; without perspective, the numbers appeared alarming, but were in fact suspect.

For all these reasons, we’re expanding our analysis to include a comparison group this time, while also looking at a much larger sample of Cruz-supporting Tweets.

COMPARISON TO: EVERYTHING ELSE

This time around, we’re examining the top 200 posters in the entire TXSen (Texas Senate) district, over a one-week period from Sept. 8 – Sept. 15 (with about half of Sept. 16 thrown in for good measure). As a benchmark comparison, we’re using the top 200 posters in all other 125 competitive districts contained in the Pollchatter database, in the same time period.

Before going farther, we should note that this is not a random sample from either population. Thus, we are not arguing that our results can be extrapolated beyond these top-200 lists. For example, it’s quite conceivable (though we haven’t researched this yet) that bots or motivated trolls may post more often than regular civilian-type folks, and thus be over-represented at the high end of lists such as these.

What this comparison does do is examine the accounts that the average Twitter viewer interested in #TXSen was most likely to run across as she read or posted during the course of that week. That is, an account that posts 40 times (or 142 times) in one week is likely to make a larger splash, and be seen by more people, than an account that posts once or twice. Thus, this top-200 measure is relevant in terms of potential impact within the Twitter community.

But enough caveats. Here are the results.

#TXSEN POSTERS ARE HIGHLY SKEWED TOWARD “PROBLEMATIC” OR “ALARMING”

Within the sample week’s top-200 posters, the vast majority were Cruz supporters. Based on an examination of tweet content and hashtag use, just three were Beto O’Rourke supporters, while one was a reporter for the Texas Tribune.

The average (mean) Bot Sentinel score for the top 200 posters was 62.9%. That’s somewhat lower than the average within last week’s top 20 #KeepTexasRed posters, but still at the midrange of Bot Sentinel’s “Problematic” category (Definition: “Our analysis has concluded XXXX exhibits problematic tweet activity and patterns similar to a trollbot account.”). The median, which can attenuate the influence of high and low outliers, was a score of 66%.

Within this list, 68 of the 200 were deemed “Alarming” (Definition: “Our analysis has concluded XXXX exhibits alarming tweet activity and patterns that match a trollbot account. Please report this account to Twitter and avoid.”). A total of 156 of the 200 were either Problematic or Alarming – thus, showing a Bot Sentinel score of above 50%. Every one of these 156 were Cruz supporters.

BENCHMARK POPULATION INSTEAD LOOKS “NORMAL”

The stats look very different within the top 200 list of accounts posting in all other competitive districts within the same time frame.

The average (mean) score in this population is just 23.22%, at the high end of Bot Sentinel’s “Normal” category (Definition: “Our analysis has concluded XXXXX exhibits normal tweet activity and is not a trollbot account.”). The median in this group was just 18%.

Within this comparison group, 10 of the accounts were deemed “Alarming”, and another 18 were “Problematic”. The remainder were either “Tolerable” or “Normal.”

This population comparison also helps address, if not wholly eliminate, some of the possible suspicions about bias within the Bot Sentinel model. For example, @WontMarchforSoros, a high-frequency pro-GOP poster, falls into the “Tolerable” category, while @MissouriGOP (the state Republic Party’s official account) is viewed as “Normal.” By contrast, the Democrat-supporting @DemocraticBlue is deemed “Alarming.”

So: It now seems very clear that Cruz’s surge of Twitter support over the last week, at least, has been strongly driven by suspicious accounts, in patterns that fail to resemble the norms set by other districts. Who’s responsible? We can’t say.

However, it’s certainly a concern. The very significant differences between the highly active Cruz-supporting population and the highly active population at large seem to make it evident that the Texas Senate race is currently the target of social-media manipulation, whatever its source. We’re in an era where this ought to be the subject of electoral-law investigations.

A final note: As any number-cruncher will tell you, this is a pretty rough-and-ready statistical analysis. For this reason, and because Pollchatter’s Twitter feed is not complete (see “How to use this site“), these conclusions should be treated as a slightly dirty window rather than as scientific-level analysis.

However, the conclusions are all replicable. The charts below contain the screen names that are members of each top-200 group, and our figures can be checked against Bot Sentinel (though that service’s ratings may change over time, as new information accrues). The Pollchatter source code is also available for inspection on GitHub. If anyone has any objections or concerns, we’d love to hear them.

Table 1: Top 200 posts within #TXSen, 9/8-9/16

RankScreen name# of Pollchatter appearances, 9/8-9/16Bot Sentinel score (out of 100)Bot Sentinel category
1Croatansound14252Problematic
2maniseku9252Problematic
3Mariatweets39097Alarming
4smartgirls4gop8174Problematic
5raybae689790Normal
6KayQuirk17850Problematic
7MikeDiaz2856472Problematic
8BeverlyConserv16176Alarming
9pmbasse5867Problematic
10GernertLynn5739Tolerable
11ChgocadChic5782Alarming
12Dan334944835483Alarming
13deeg254769Problematic
14Texan2A4648Tolerable
15JoeBiala4460Problematic
16PatrioticCindy4466Problematic
17MalsLAW4379Alarming
18BolindoMiss19754289Alarming
19Momcat21114255Problematic
20BearingTruth4056Problematic
21R982507294042Tolerable
22BrandonJLandry4079Alarming
23thathad2hurt4066Problematic
24bcarter764065Problematic
25BevConservative3964.02Imputed score
26Epluribusunimus3962Problematic
27AnilNS4253967Problematic
28keramirez3976Alarming
29Jojoinguette3991Alarming
30TCruzCrew3864.02Imputed score
31BoiseMarineMom3777Alarming
32Dlw201619503726Tolerable
33bestnaniever3652Problematic
34skyjones55360Normal
35SueAyoub3680Alarming
36Lisa225451833575Alarming
37NatalieTexas3470Problematic
38AmericaFirst1503450Problematic
39RhondaHurley23486Alarming
40Saint_Obama3467Problematic
41bonniemurphy3481Alarming
42RobertS302899193342Tolerable
43D_Ericson3326Tolerable
44Gretche666940013379Alarming
45OrlandoandresM3266Problematic
46TRUMP4ISRAEL3250Problematic
47PaulRog528968283293Alarming
48rratkinson3250Problematic
49Patriot_Girl_TX3170Problematic
50neff6303174Problematic
51fitzie633159Problematic
52Craybo233130Tolerable
53PhillipLaird3164Problematic
54Ray_Nolen3081Alarming
55VetWithACause3090Alarming
56TrumpCat043078Alarming
57SBtx173092Alarming
58DebraMMason13071Problematic
59lisamichelle203085Alarming
60zopitedaniel3033Tolerable
61VNTEXAS3031Tolerable
62BubbaJoeBaker2988Alarming
63JTS_19572957Problematic
64mastrolyn2941Tolerable
65dianne1h2978Alarming
66DrSueDVM2954Problematic
67KTyrky12950Problematic
68avoiceofliberty2940Tolerable
69Sgroberts1216572861Problematic
70WantBigHammer2861Problematic
71Beaudancer12877Alarming
72Iloveperfumeso2861Problematic
73AggieDad20152848Tolerable
74SonsMary2887Alarming
75cinrui2879Alarming
76Dr_Nahil2841Tolerable
77lynn_weiser2880Alarming
78aranom122867Problematic
79DRADAIV2769Problematic
80JewelsJones12765Problematic
81cmdorsey2762Problematic
82texasaunt2758Problematic
83tracy_texas62778Alarming
84kasmouse2780Alarming
85ScribbledPages12772Problematic
86pamela_pmlpatt2786Alarming
87pattygilli2671Problematic
88tedcruz268Normal
89patriotmom29112662Problematic
90VeldaMcclung2690Alarming
91CraigLisathomas2659Problematic
92RLiberalskiddin2669Problematic
93HorseShort2690Alarming
94MBOKSR_MAGA2670Problematic
95pononzu2671Problematic
96Sethro_tull_5562639Tolerable
97oidcop2587Alarming
98CaprockDemocrat250Normal
99braggingdragon1250Normal
100jearuiz2596Alarming
101stringfellowfr42571Problematic
102lisamichelle122590Alarming
103TeamTrumpTeX2543Tolerable
104Woke20082569Problematic
105cs0058sc2564.02Imputed score
106BeeNewsDaily2517Normal
107RightAsRain72580Alarming
108Drbob4442561Problematic
109SnowLover8142565Problematic
110DestinyandBruce2571Problematic
111Sp9825Micah2564.02Imputed score
112DarleneHBrook2541Tolerable
113TXSVIKING2540Tolerable
11488_Charlotte_882491Alarming
115daliafierro2487Alarming
116Stargazer20202482Alarming
117Dondorey12475Alarming
118TriathleteUSA2475Alarming
119THETXEMBASSY2473Problematic
120usajustice4all2470Problematic
121RebelReedRight2464.02Imputed score
122StillKicking622461Problematic
123TexianRising2445Tolerable
124Angelemichelle2429Tolerable
125PatrickSvitek243Normal
126DelarioDeborah2392Alarming
127aktweet1232392Alarming
128SharNeal2388Alarming
129roses124112384Alarming
130tcarmistead2382Alarming
131SandraTXAS2377Alarming
132Vfigi12373Problematic
133let7362373Problematic
134AmericaFirst462372Problematic
135OhioWomen4Trump2364Problematic
136JasonKManners2364.02Imputed score
137FreedomFrayed2361Problematic
138KST24242350Problematic
139Tsand882344Tolerable
140BennytheKite2320Normal
141BlogUSA20182288Alarming
142Blackdi512642992286Alarming
143KeathGifts2285Alarming
144lopezginnyyaho12282Alarming
145saiwarlord2282Alarming
146HeyJudeOregon2278Alarming
147robinlecronesco2275Alarming
148Ronin10212263Problematic
149DrSue193802260Problematic
150210GOPTEJANA2256Problematic
151AdamlCitizen2251Problematic
152LuzAyala22249Tolerable
153marjo2192248Tolerable
154ruhoward42247Tolerable
155NancyLRogers12222Normal
156Len218788322186Alarming
157grainofsand20202182Alarming
158LibertysArmyJr2182Alarming
159stefilassiter2180Alarming
160Mr_Arizona922179Alarming
161usvetram2178Alarming
162CASuperrunner2177Alarming
163harleygrl34652176Alarming
164djdun32176Alarming
165LawsEris2172Problematic
166Childlikefaaith2172Problematic
167mimiinftworth2164Problematic
168TexasTrumpGirl2163Problematic
169Trumptrooper22162Problematic
170FuriousFarmBoy2161Problematic
171pwirish2160Problematic
172SusanABriggs62158Problematic
173treva214481502155Problematic
174SharonEmily2144Tolerable
175tonywesolick2143Tolerable
176voiceofstentor2127Tolerable
177CruzNewsLive2118Normal
178sheacamm210Normal
179Evenstar8882093Alarming
180trader_sam2088Alarming
181AndrewsHarley2081Alarming
182DonnaCastel2080Alarming
183mountainman842078Alarming
184ruskaj2073Problematic
185thomasj174318262073Problematic
186Lissening2u2070Problematic
187MsJonsie2you2069Problematic
188nancy73gg2066Problematic
189Freedomtoroam12064Problematic
190lorylovesanimal2057Problematic
191RachelSuperEli12054Problematic
192NanaOxford2053Problematic
193silverbird4582050Problematic
194deb4liberty2049Tolerable
195sisterwash652047Tolerable
196NascarChuck3362045Tolerable
197DavidNe994692232044Tolerable
198davealvord1642043Tolerable
199b_roelf2025Tolerable
200TexOnTheLege205Normal
Mean62.9
Median66

 

Table 2: Top 200 posts not in #TXSen (125 districts), 9/8-9/16

RankScreen name# of Pollchatter appearances, 9/8-9/16Bot Sentinel score (out of 100)Bot Sentinel category
1votepledges12260Normal
2SuMoh774415Normal
3jckthesword6500Normal
4MD_3866290Normal
5ybbkaren49232Tolerable
6CallMcclintock4924Normal
7OperationOppo46061Problematic
8IndivisibleIL143315Normal
9RavenRothisPink31123.22Imputed score
10smithbarbv30110Normal
11GailWaldby2941Normal
12StormResist28013Normal
13GoldieAZ27218Normal
14Susana_Tupper27018Normal
15Fawkes_2017_24945Tolerable
16InSpiteOfTrump2464Normal
17waydelsew24037Tolerable
18PhilAmmann22019Normal
19PillsMom1821911Normal
20trudygonzales21724Normal
21WontMarch4Soros20438Tolerable
22tsteph922024Normal
23Mean_Adam2019Normal
24VAPolitical1914Normal
25mersmom81900Normal
26MagaGoldHat18785Alarming
27renaitre818716Normal
28davealvord16418443Tolerable
29BearmanShannon17925Tolerable
30pml_tray1780Normal
31Sisterwriter1765Normal
32Croatansound17352Problematic
33rieth_damon1718Normal
34jgcOCANADA1716Normal
35Unconquerable1698Normal
36EllisBelfer16739Tolerable
37AUdebm1673Normal
38RuneK_1515919Normal
39R9825072915542Tolerable
40MentalNote20181543Normal
41EzraBridger201815323.22Imputed score
42RdrgzKr1492Normal
43dpmax10014525Tolerable
44JakeWilkns1444Normal
45CongressRTBot1430Normal
46Milvio_Diaz14176Alarming
47JOEROIREY13224Normal
48ArtistsPo13215Normal
49Joni_Looking12922Normal
50jwyattronan1293Normal
51HMeemaw12634Tolerable
52ellievan651255Normal
53MMScholes1254Normal
54mydogtristan12419Normal
55DemocraticBlue12379Alarming
56Tony_Glorioso12223.22Imputed score
574CommonSense111956Problematic
58Str8DonLemon11459Problematic
59TeamPappasNH1140Normal
60podod1211240Tolerable
61prettyangeljac111121Normal
62Indivisible19NY1112Normal
63MissouriGOP11012Normal
64realdebfarmer10915Normal
65mcy20001083Normal
66HDowning11310776Alarming
67muellerew10542Tolerable
68ObamasHope10523.22Imputed score
69GarrenShipley1053Normal
70ssmtx610415Normal
71SassyGma61045Normal
72TheTrumpParty10381Alarming
732017Gamora10329Tolerable
74PaulMic1954hael10132Tolerable
75MadelyneRants10116Normal
76misen__1011Normal
77w55unicorn10040Tolerable
78stlajn110013Normal
79gerfingerpoken29996Alarming
80KimTompkins149920Normal
81joncprevo992Normal
82CaroleChatelai19817Normal
83ProgressiveFor9813Normal
84Eugene_Begay459733Tolerable
85myserenity699634Tolerable
86HH2017079621Normal
87jacksnowknows9616Normal
88gerfingerpoken9597Alarming
89SallyDeal49553Problematic
90Feminizzzm954Normal
91ActorAaronBooth950Normal
92elitmik9445Tolerable
93BookerBenjaminA9255Problematic
94ChrisPappasNH920Normal
95TroyMGood19166Problematic
96igggie19125Normal
97Lacus099120Normal
98NashvilleResist915Normal
99JimMcNichols1910Normal
100knsh55a9026Tolerable
101timeindawater19022Normal
102CindyMundell9013Normal
103Vote_Faso_NY19899Normal
104jennobenno8829Tolerable
105N1RMFAN8749Tolerable
106Cassie_o_peia8730Tolerable
107freeandclear18713Normal
108FulcrumOp8711Normal
109johninphx8656Problematic
110NotJayWebber8644Tolerable
111MyVoteMyKids8625Tolerable
1125141quid8625Tolerable
113prairielive8613Normal
114henricoblue8610Normal
115ElliotMalin863Normal
116MagaCandidates28559Problematic
117Solutioneer728554Problematic
118Pie_SocialMedia8518Normal
119dadkind8515Normal
120immelza8512Normal
121JolynnMarie6128459Problematic
122DannyHortonMO8432Tolerable
123haine_xxx840Normal
124Bread_Maker188334Tolerable
125lbweaver8333Tolerable
126CaptainsLog2O18837Normal
127BlondeMamaUSA_8271Problematic
128gayleasher18242Tolerable
129smcenturff8235Tolerable
130RGreggKeller8210Normal
131CATargetBot821Normal
132511nyLongIsland820Normal
133get_out_andVOTE8160Problematic
134CarmoSteph8131Tolerable
135SueMayerNH8112Normal
136ShannonRitenour8112Normal
137dino4dino8042Tolerable
138guardingblue8031Tolerable
139MatthewWolfff8029Tolerable
140cmcdonald048809Normal
141carlyinNJ7933Tolerable
142ElliotKumpe7910Normal
143investtoelectil794Normal
144dearjudge790Normal
145BSmarty177825Tolerable
146RodneyClaeys7733Tolerable
147BigSkyDemocrats771Normal
148barricko_bama7626Tolerable
149BrianCollinsMO7554Problematic
150laureldavilacpa7518Normal
151Intl_Resistance7468Problematic
152Die__Trying_7434Tolerable
153kals557413Normal
154archeology_girl7413Normal
155progresivetrend7410Normal
156jistBhappy749Normal
157TeamHawley746Normal
158UniteBlueMN740Normal
159MisterAndyWong740Normal
160jimstev037593757350Problematic
161ehartman19757327Tolerable
162RichLeonard4NH7318Normal
163NDGOP735Normal
16445xiatai733Normal
165Kopser4Congress730Normal
166BeckyStroud507277Alarming
167bannerite7223Normal
168VoteHealth7221Normal
169Beverly21811568720Normal
170Susanv123457162Problematic
171DonaldsNeckVag7132Tolerable
172TonyContino7122Normal
173AndersonDrew712Normal
174Bellatrixx20187025Tolerable
175Nov2018election7023.22Imputed score
176AgitatorsNeeded7022Normal
177BlueWaveCS706Normal
178Jojoinguette6991Alarming
179ElizabethGrisc16966Problematic
180KikiAdine6922Normal
181WebOften6918Normal
182ChrisJZullo6917Normal
183GretchenPerki146913Normal
184pleasesaveour699Normal
185Its_Bhagyashree693Normal
186WMUR9691Normal
187TrumpWatchNews690Normal
188ReneeDahl56868Problematic
189GailHorn6847Tolerable
190Paulsen_Trump6831Tolerable
191cateady536810Normal
192AVoiceForAllGA688Normal
193emyryly684Normal
194TeamDean683Normal
195NaphiSoc6716Normal
196LambYoga677Normal
197tankcat673Normal
198DestructiveChem6621Normal
199NomdeB6610Normal
200PlyMNIndivisibl662Normal
Mean23.2255
Median18

NOTE: Within both populations, a small number of accounts could not be assigned scores by Bot Sentinel’s service. Within the benchmark population, we have given these accounts a score equivalent to the overall average (mean), a technique known as “imputation” in statistics. Within the Texas Senate population, each of the missing values was a Cruz supporter – thus, rather than assigning them to the overall mean, we have assigned them a value equivalent to the mean among all Cruz supporters.