Cecropin

Cecropins [PMID: 3318666, PMID: 2015623, PMID: 1915368] are potent antibacterial proteins that constitute a main part of the cell-free immunity of insects. Cecropins are small proteins of about 35 amino acid residues active against both Gram-positive and Gram-negative bacteria. They seem to exert a lytic action on bacterial membranes. Cecropins isolated from insects other than Hyalophora cecropia (Cecropia moth) have been given various names; bactericidin, lepidopteran, sarcotoxin, etc. All of these peptides are structurally related. Cecropin P1, an intestinal antibacterial peptide from Sus scrofa (Pig), also belongs to this family.

The below sequences were used to create Cecropin sequence signatures:

>CAMPSQ128
GWIRDFGKRIERVGQHTRDATIQTIAVAQQAANVAATLKG
>CAMPSQ887
GWLRKLGKKIERIGQHTRDASIQVLGIAQQAANVAATAR
>CAMPSQ125
GWLKKIGKKIERVGQHTRDATIQGLGIAQQAANVAATAR
>CAMPSQ886
GWLKKIGKKIERVGQHTRDATIQGLGIAQQAANVAATARG
>CAMPSQ127
GWLRKIGKKIERVGQHTRDATIQVLGIAQQAANVAATAR
>CAMPSQ126
GWLKKIGKKIERVGQHTRDATIQVIGVAQQAANVAATAR
>CAMPSQ513
GWLKKIGKKIERVGQHTRDATIQTIAVAQQAANVAATAR
>CAMPSQ473
GRSKKLGKKIEKAGKRVFNAAQKGLPVAAGVQAL
>CAMPSQ474
GRLKKLGKKIEKAGKRVFNAVQKGLPVAAGVQAL
>CAMPSQ2729
GGLKKLGKKLEGVGKRVFKASEKALPVAVGIKALGK
>CAMPSQ471
GRLKKLGKKIEGAGKRVFKAAEKALPVVAGVKALG
>CAMPSQ1107
GGLKKLGKKLEGAGKRVFNAAEKALPVVAGAKAL
>CAMPSQ285
WKPFKKIEKAVRRVRDGVAKAGPAVAVVGQAT
>CAMPSQ3704
KPFKKLEKVGRNIRNGIIRYNGPAVAVIGQA
>CAMPSQ3703
KPFKKLEKVGRNIRDGIIKAGPAVAVIGQATSIARPTGK
>CAMPSQ812
KWKIFKKIEHMGQNIRDGLIKAGPAVQVVGQAATIYKG
>CAMPSQ56
KWKLFKKIEKVGQNIRDGIIKAGPAVAVVGQATQIAK
>CAMPSQ287
KWKVFKKIEKVGRNIRDGIVKAGPAIAVLGQAN
>CAMPSQ57
KWKIFKKIEKVGRNIRNGIIKAGPAVAVLGEAKAL
>CAMPSQ58
KWKVFKKIEKMGRNIRNGIVKAGPAIAVLGEAKAL
>CAMPSQ2728
KWKVFKKIEKMGRNIRNGIVKAGPAIAVLGEAKAILS
>CAMPSQ252
RWKIFKKIEKMGRNIRDGIVKAGPAIEVLGSAKAI
>CAMPSQ1134
RWKVFKKIEKMGRNIRDGIVKAGPAIEVLGSAKALGK
>CAMPSQ251
RWKLFKKIEKVGRNVRDGLIKAGPAIAVIGQAKSL
>CAMPSQ1135
RWKVFKKIEKVGRNVRDGIIKAGPAIGVLGQAKALG
>CAMPSQ286
RWKVFKKIEKVGRNIRDGVIKAAPAIEVLGQAKAL
>CAMPSQ288
RWKVFKKIEKMGRNIRDGVIKAAPAIEVLGQAK
>CAMPSQ161
ENFFKEIERAGQRIRDAIISAAPAVETLAQAQKIIKGGD
>CAMPSQ59
WNPFKELERAGQRVRDAIISAGPAVATVAQATALAK
>CAMPSQ60
WNPFKELEKVGQRVRDAVISAGPAVATVAQATALAK
>CAMPSQ55
WNPFKELERAGQRVRDAVISAAAVATVGQAAAIARGG
>CAMPSQ53
WNPFKELERAGQRVRDAIISAGPAVATVGQAAAIARG
>CAMPSQ52
WNPFKELERAGQRVRDAVISAAPAVATVGQAAAIARG
>CAMPSQ54
WNPFKELERAGQRVRDAIISAAPAVATVGQAAAIARG
>CAMPSQ883
VFIDILDKVENAIHNAAQVGIGFAKPFEKLINPK
>CAMPSQ2594
WLSKTYKKLENSAKKRISEGVAIAILGGLR
>CAMPSQ3311
SWLSKTYKKLENSAKKRISEGVAIAILGGLR
>CAMPSQ2592
WLSKTAKKLENSAKKRISEGIAIAIKGGSR
>CAMPSQ3310
SWLSKTAKKLENSAKKRISEGIAIAIKGGSR
>CAMPSQ534
SWLSKTAKKLENSAKKRISEGIAIAIQGGPR
>CAMPSQ2593
WLSKTYKKLENSAKKRISEGIAIAIQGGPR
>CAMPSQ3309
SWLSKTYKKLENSAKKRISEGIAIAIQGGPR
>CAMPSQ172
KWKLFKKIPKFLHSAKKF
>CAMPSQ171
KWKLFKKIGIGKFLHSAKKF
>CAMPSQ173
KLKLFKKIGIGKFLHSAKKF
>CAMPSQ174
KAKLFKKIGIGKFLHSAKKF

The below sequences were retrieved using Cecropin sequence signatures:

> CAMPCecP37, CAMPCecH, CAMPCecH37 | CAMPSQ52 | P14662 | 116080
WNPFKELERAGQRVRDAVISAAPAVATVGQAAAIARG
> CAMPCecP37, CAMPCecH, CAMPCecH37 | CAMPSQ53 | P14663 | 116081
WNPFKELERAGQRVRDAIISAGPAVATVGQAAAIARG
> CAMPCecP37, CAMPCecH, CAMPCecH31 | CAMPSQ54 | P14664 | 116083
WNPFKELERAGQRVRDAIISAAPAVATVGQAAAIARG
> CAMPCecP37, CAMPCecH, CAMPCecH31 | CAMPSQ55 | P14665 | 116084
WNPFKELERAGQRVRDAVISAAAVATVGQAAAIARGG
> CAMPCecP37, CAMPCecH, CAMPCecH31 | CAMPSQ56 | P01507 | 116087, 223330
KWKLFKKIEKVGQNIRDGIIKAGPAVAVVGQATQIAK
> CAMPCecP35, CAMPCecH, CAMPCecH31 | CAMPSQ57 | P01509 | 116088
KWKIFKKIEKVGRNIRNGIIKAGPAVAVLGEAKAL
> CAMPCecP37, CAMPCecH, CAMPCecP35, CAMPCecH35 | CAMPSQ58 | P01508 | 116090
KWKVFKKIEKMGRNIRNGIVKAGPAIAVLGEAKAL
> CAMPCecH, CAMPCecH31 | CAMPSQ59 | P01511 | 116091
WNPFKELERAGQRVRDAIISAGPAVATVAQATALAK
> CAMPCecH31, CAMPCecH, CAMPCecH37 | CAMPSQ60 | P01510 | 116092
WNPFKELEKVGQRVRDAVISAGPAVATVAQATALAK
> CAMPCecH | CAMPSQ125 | P08375 | 134859
GWLKKIGKKIERVGQHTRDATIQGLGIAQQAANVAATAR
> CAMPCecH | CAMPSQ126 | P08376 | 134861
GWLKKIGKKIERVGQHTRDATIQVIGVAQQAANVAATAR
> CAMPCecH | CAMPSQ127 | P08377 | 134862
GWLRKIGKKIERVGQHTRDATIQVLGIAQQAANVAATAR
> CAMPCecH | CAMPSQ128 | P18312 | 134863
GWIRDFGKRIERVGQHTRDATIQTIAVAQQAANVAATLKG
> CAMPCecH, CAMPCecH31 | CAMPSQ161 | P85210 | 156630481
ENFFKEIERAGQRIRDAIISAAPAVETLAQAQKIIKGGD
> CAMPCecH, CAMPCecH31 | CAMPSQ251 | Q27239 | 227386, 225797
RWKLFKKIEKVGRNVRDGLIKAGPAIAVIGQAKSL
> CAMPCecP35, CAMPCecH31, CAMPCecH | CAMPSQ252 | P04142 | 1705754, 546998
RWKIFKKIEKMGRNIRDGIVKAGPAIEVLGSAKAI
> CAMPCecH, CAMPCecH37 | CAMPSQ285 | P83420 | 25089847
WKPFKKIEKAVRRVRDGVAKAGPAVAVVGQAT
> CAMPCecP37, CAMPCecH, CAMPCecP35, CAMPCecH31 | CAMPSQ286 | P83413 | 25089858
RWKVFKKIEKVGRNIRDGVIKAAPAIEVLGQAKAL
> CAMPCecP35, CAMPCecH, CAMPCecH31 | CAMPSQ287 | P83414 | 25089860
KWKVFKKIEKVGRNIRDGIVKAGPAIAVLGQAN
> CAMPCecP35, CAMPCecH, CAMPCecH31 | CAMPSQ288 | P83415 | 25089862
RWKVFKKIEKMGRNIRDGVIKAAPAIEVLGQAK
> CAMPCecP35, CAMPCecH, CAMPCecH35 | CAMPSQ471 | P82290 | 46395677
GRLKKLGKKIEGAGKRVFKAAEKALPVVAGVKALG
> CAMPCecH | CAMPSQ473 | Q86PR4 | 46395785
GRSKKLGKKIEKAGKRVFNAAQKGLPVAAGVQAL
> CAMPCecH | CAMPSQ474 | Q86PR5 | 46395786
GRLKKLGKKIEKAGKRVFNAVQKGLPVAAGVQAL
> CAMPCecH | CAMPSQ513 | Q06589 | 543976
GWLKKIGKKIERVGQHTRDATIQTIAVAQQAANVAATAR
> CAMPCecP31, CAMPCecH31 | CAMPSQ534 | P14661 | 58430587
SWLSKTAKKLENSAKKRISEGIAIAIQGGPR
> CAMPCecH | CAMPSQ625 | Q95VE8 | 74844670
GSPEFGWLKKIGKKIERVGQHTRDATIQTIGVAQQAANVAATLKG
> CAMPCecH, CAMPCecH31 | CAMPSQ812 | Q68KS5 | 51039035
KWKIFKKIEHMGQNIRDGLIKAGPAVQVVGQAATIYKG
> CAMPCecH | CAMPSQ883 | P21663 | 113829
VFIDILDKVENAIHNAAQVGIGFAKPFEKLINPK
> CAMPCecH | CAMPSQ886 | P14954 | 116086
GWLKKIGKKIERVGQHTRDATIQGLGIAQQAANVAATARG
> CAMPCecH | CAMPSQ887 | P14956 | 116089
GWLRKLGKKIERIGQHTRDASIQVLGIAQQAANVAATAR
> CAMPCecH, CAMPCecH35 | CAMPSQ1107 | P82592 | 205829605
GGLKKLGKKLEGAGKRVFNAAEKALPVVAGAKAL
> CAMPCecP37, CAMPCecH, CAMPCecP35, CAMPCecH31 | CAMPSQ1134 | Q9XZH0 | 46396048
RWKVFKKIEKMGRNIRDGIVKAGPAIEVLGSAKALGK
> CAMPCecP35, CAMPCecH, CAMPCecH31 | CAMPSQ1135 | Q9XZG9 | 46396047
RWKVFKKIEKVGRNVRDGIIKAGPAIGVLGQAKALG
> CAMPCecH, CAMPCecH35 | CAMPSQ1196 | Q963A8, Q9Y0X9 | 46395945
GGLKKLGKKLEGAGKRVFNAAEKALPVVAGAKALG
> CAMPCecH, CAMPCecH35 | CAMPSQ1197 | P81417 | 46397850
GGLKKLGKKLEGVGKRVFKASEKALPVAVGIKALG
> CAMPCecH | CAMPSQ1199 | P67792 | 54036841
AGWLRKLGKKIERIGQHTRDASIQVLGIAQQAANVAATAR
> CAMPCecH, CAMPCecH31, CAMPCecH35 | CAMPSQ1241 | P14666 | 116082
RWKIFKKIEKVGQNIRDGIVKAGPAVAVVGQAATI
> CAMPCecH | CAMPSQ1242 | O16829 | 55584020
GWLKKLGKRIERIGQHTRDATIQGLGIAQQAANVAATARG
> CAMPCecH | CAMPSQ1243 | P84021 | 62901501
GWLKKLGKRIERIGQHTRDATIQGLGIAQQAANVAATAR
> CAMPCecH | CAMPSQ1244 | Q06590 | 543977
GWLKKIGKKIERVGQHTRDATIQTIGVAQQAANVAATLK
> CAMPCecH | CAMPSQ1245 | O16825 | 25089574
VFIDILDKMENAIHKAAQAGIGIAKPIEKMILPK
> CAMPCecH, CAMPCecH31 | CAMPSQ1246 | P50720 | 1705741
RWKIFKKIERVGQNVRDGIIKAGPAIQVLGTAKAL
> CAMPCecH, CAMPCecH31 | CAMPSQ1247 | P50721 | 1705742
RWKFFKKIERVGQNVRDGLIKAGPAIQVLGAAKAL
> CAMPCecP37, CAMPCecH, CAMPCecP35, CAMPCecH31 | CAMPSQ1248 | P50722 | 1705743
RWKVFKKIEKVGRNIRDGVIKAGPAIAVVGQAKAL
> CAMPCecH, CAMPCecH31 | CAMPSQ1249 | P50723 | 1705744
RWKVFKKIEKVGRHIRDGVIKAGPAITVVGQATAL
> CAMPCecH, CAMPCecH37 | CAMPSQ1250 | P48821 | 1345724
PWNIFKEIERAVARTRDAVISAGPAVRTVAAATSVAS
> CAMPCecH, CAMPCecH35 | CAMPSQ1256 | Q9Y0Y0, Q963A9 | 15029359, 15029356
GGLKKLGKKLEGVGKRVFKASEKALPVLTGYKAIG
> CAMPCecH | CAMPSQ1259 | P83403 | 150421529
GWLKKIGKKIERVGQNTRDATVKGLEVAQQAANVAATVR
> CAMPCecH | CAMPSQ1260 | P84225 | 55584029
GWLKKLGKRIERIGQHTRDATIQGLGIAQQAANVAATAR
> CAMPCecH | CAMPSQ1261 | P84224 | 55584028
GWLKKLGKRIERIGQHTRDATIQGLGIAQQAANVAATAR
> CAMPCecH | CAMPSQ1262 | P84223 | 55584027
GWLKKLGKRIERIGQHTRDATIQGLGIAQQAANVAATAR
> CAMPCecH | CAMPSQ1263 | P84222 | 55584026
GWLKKLGKRIERIGQHTRDATIQGLGIAQQAANVAATAR
> CAMPCecH | CAMPSQ1264 | P67791 | 54036840
AGWLRKLGKKIERIGQHTRDASIQVLGIAQQAANVAATAR
> CAMPCecH | CAMPSQ1265 | P81688 | 10719933
GWLKKIGKKIERVGQHTRDATIQGLGVAQQAANVAATAR
> CAMPCecH, CAMPCecH31 | CAMPSQ1266 | O76146 | 46395627
GNFFKDLEKMGQRVRDAVISAAPAVDTLAKAKALGQ
> CAMPCecH | CAMPSQ1267 | O61281 | 25089852
VFVALILAIAIGQSEAGWLKKIGKKIERVGQHTRDATIQGLGIAQQAANVAATAR
> CAMPCecH | CAMPSQ1268 | O61272 | 25089850
QSEAGWLKKIGKKIERVGQHTRDATIQGLGVAQQAPNVAATAR
> CAMPCecH | CAMPSQ1269 | Q94990 | 2842738
GWLKKIGKKIERVGQHTRDATIQGLGIAQQAANVAATAR
> CAMPCecH | CAMPSQ1271 | Q94557 | 2842734
GWLKKIGKKIERIGQHTRDATIQGLGIAQQAANVAATAR
> CAMPCecH, CAMPCecH35 | CAMPSQ1420 | A7L9C2 | 151574048
MNFTKLFIMVAIAVLLIAGIQPVEAAPRMEIGKRREKLGRNVFKAAKKALPVIAGYKALG
> CAMPCecH, CAMPCecH31 | CAMPSQ1432 | Q6QWJ2 | 41059818
MNFKKILFFVFACLVFTVTAAPEPRWKFFKKIEKVGQNIRDGIIKAGPAVAVVGQAAAIS
GK
> CAMPCecH31, CAMPCecH, CAMPCecH35 | CAMPSQ1465 | Q308S4 | 78093596
MNFSRALFYVFAVFLVCASVMAAPEPRWKIFKKIEKVGQNIRDGIIKAGPAVAVVGQAAT
IAHGK
> CAMPCecH, CAMPCecH35 | CAMPSQ1517 | Q963B0 | 46395947
GGLKKFGKKLEGVGKRVFKASEKALPVAVGIKALG
> CAMPCecH | CAMPSQ1554 | Q8WSV2 | 25089614
VFIDILDKMENAIHKAAQAGIGLAKPIENMILPK
> CAMPCecH | CAMPSQ1555 | Q8WSV4 | 25089619
VFIDILDKMENAIHKAAQAGIGIAKPIENMILPKLTK
> CAMPCecH | CAMPSQ1570 | P81686 | 10719932
AGWLRKLGKKIERIGQHTRDASIQVLGIAQQAANVAATAR
> CAMPCecH | CAMPSQ2130 | Q8MUF3 | 161784301
RRFKKFLKKVEGAGRRVANAAQKGLPLAAGVKGLV
> CAMPCecH | CAMPSQ2131 | P84020 | 62901500
GWLKKLGKRIERIGQHTRDATIQGLGIAQQAANVAATAR
> CAMPCecH | CAMPSQ2132 | P84019 | 62901499
GWLKKLGKRIERIGQHTRDATIQGLGIAQQAANVAATAR
> CAMPCecH | CAMPSQ2133 | P84226 | 55584030
GWLKKLGKRIERIGQHTRDATIQGLGIAQQAANVAATAR
> CAMPCecH, CAMPCecH35 | CAMPSQ2134 | Q8MUF4 | 46395902
APRWKFGKRLEKLGRNVFRAAKKALPVIAGYKAL
> CAMPCecH, CAMPCecH35 | CAMPSQ2135 | Q86PR6 | 46395787
GGLKKFGKKLEGVGKRVFKASEKALPVVTGFKAL
> CAMPCecH | CAMPSQ2136 | P81685 | 10719931
GWLKKIGKKIERVGQHTRDATIQGLGVAQQAANVAATAR
> CAMPCecH, CAMPCecH31 | CAMPSQ2138 | P50724 | 1705753
RWKFFKKIEKVGQNIRDGIIKAGPAVAVVGQAASIT
> CAMPCecP31, CAMPCecH, CAMPCecH31 | CAMPSQ2191 | A6BMG0 | 260100777
MKLSNIFFFVFMAFFAVASVSAAPRWKPFKKLEKVGRNIRNGIIRYNGPAVAVIGQATSI
ARPTGK
> CAMPCecP37, CAMPCecH, CAMPCecH31 | CAMPSQ2214 | D0ETI6 | 260447210
NPFKELERAGQRVRDAIIS
> CAMPCecH, CAMPCecH31 | CAMPSQ2230 | D2KD89 | 281022082
MKLSNIFFFVFMAFFAVASVSAAPRWKPFKKLEKVGRNIRDGIIKAGPAVAVIGQATSIA
RPTGK
2015, © Biomedical Informatics Centre, NIRRH, Mumbai