Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_4137 |
Symbol | |
ID | 6967744 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | - |
Start bp | 3827232 |
End bp | 3828353 |
Gene Length | 1122 bp |
Protein Length | 373 aa |
Translation table | 11 |
GC content | 33% |
IMG OID | 643387889 |
Product | surface presentation of antigens protein SpaS |
Protein accession | YP_002272329 |
Protein GI | 209399671 |
COG category | [N] Cell motility [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG1377] Flagellar biosynthesis pathway, component FlhB |
TIGRFAM ID | [TIGR01404] type III secretion protein, YscU/HrpY family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 33 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 31 |
Fosmid unclonability p-value | 0.000465227 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGCAAATA AAACTGAAAA GCCAACACAA AAGAAACTGC AGGATGCTTC TAAAAAAGGG CAAATTCTAA AAAGTAGAGA CTTAACCGTT TCTGTAATAA TGCTTGTGGG TACTTTATAT CTCGGATATG TTTTTGATGT GCATCACATT ATGTCGATTC TTGAATATAT CCTTGATCAT AACGCTAAAC CGGATATTTG GGACTATTTT AAAGCTATGG GGATTGGTTG GTTGAAAACG ATCATTCCTT TTTTGCTGGT TTGCATGTTC ACAACAATAC TTGTCTCCTG GTTTCAAAGT AAAATGCAAT TAGCAACTGA AGCTGTAAAA TTAAAGTTTG ATTCATTGAA TCCAGTAAAT GGTTTAAAGC GTATATTTGG CTTAAAAACC GTAAAAGAAT TTGTTAAAGC AATTCTTTAT ATTATTTTTT TTGCATTGGA GATCAAAGTA TTTTGGAGTA ATCATAAATC ACTGCTTTTT AAAACTCTTG ATGGAGATAT CATATCTTTA TTATCAGATT GGGGAGAGAT GCTATTCCTT CTCATACTGT ATTGTCTCGG CAGTATGATA ATTGTCTTAA TTTTTGATTT TATCGCTGAA TATTTTTTAT TTATGAAAGA TATGAAAATG GATAAACAAG AAGTTAAAAG AGAATACAAG GAACAAGAAG GAAATCCTGA AATTAAGTCT AAACGCAGAG AGCGCCATCA GGAAATTCTT TCTGAGCAAT TGAAATCTGA TGTCAGTAAT AGCCGTTTGA TGATTGCCAA CCCTACTCAC ATTGCAATAG GGATATACTT TAAGCCACAT CTGTCACCTA TTCCATTGAT TTCTGTAAGA GAAACTAATG AGGTAGCATT AGCTGTAAGG AAATATGCAA AGGAAATCGG GATACCAATT ATTACAGATA AAAAATTAGC ACGAAAAATT TATGCTACCC ATCGTCGCTA CGATTATGTT AGCTTCGAAA ATATAGATGA AATATTACGT CTTCTGCTGT GGCTTGAAGA TGTGGAGAAT GCTGGACAAC CTGTTCCAGA TGAACAGCTC TCTTCAGAAG ATAAATATAT TGAGGGTGAA GACACAAAAA GCGAGAATAA TGACAATAAT TTAAAAAATT AA
|
Protein sequence | MANKTEKPTQ KKLQDASKKG QILKSRDLTV SVIMLVGTLY LGYVFDVHHI MSILEYILDH NAKPDIWDYF KAMGIGWLKT IIPFLLVCMF TTILVSWFQS KMQLATEAVK LKFDSLNPVN GLKRIFGLKT VKEFVKAILY IIFFALEIKV FWSNHKSLLF KTLDGDIISL LSDWGEMLFL LILYCLGSMI IVLIFDFIAE YFLFMKDMKM DKQEVKREYK EQEGNPEIKS KRRERHQEIL SEQLKSDVSN SRLMIANPTH IAIGIYFKPH LSPIPLISVR ETNEVALAVR KYAKEIGIPI ITDKKLARKI YATHRRYDYV SFENIDEILR LLLWLEDVEN AGQPVPDEQL SSEDKYIEGE DTKSENNDNN LKN
|
| |