Gene ECH74115_1455 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_1455 
SymbolflgE 
ID6967515 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp1435304 
End bp1436509 
Gene Length1206 bp 
Protein Length401 aa 
Translation table11 
GC content52% 
IMG OID643385428 
Productflagellar hook protein FlgE 
Protein accessionYP_002269922 
Protein GI209395987 
COG category[N] Cell motility 
COG ID[COG1749] Flagellar hook protein FlgE 
TIGRFAM ID[TIGR03506] fagellar hook-basal body proteins 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value0.00159718 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGCCTTTT CTCAAGCGGT TAGCGGATTA AACGCTGCCG CCACCAACCT CGATGTTATT 
GGCAACAATA TCGCCAACTC CGCCACCTAC GGCTTTAAAT CAGGCACGGC CTCTTTTGCC
GATATGTTTG CCGGTTCGAA AGTGGGACTG GGGGTAAAAG TTGCCGGTAT CACTCAGGAC
TTTACCGATG GCACGACCAC CAACACCGGG CGTGGTCTGG ACGTTGCTAT CAGCCAGAAC
GGTTTTTTCC GTCTGGTAGA TAGCAACGGT TCGGTGTTCT ACAGCCGTAA CGGACAATTT
AAGCTGGATG AAAATCGTAA CCTGGTGAAT ATGCAAGGTT TACAGCTGAC GGGTTACCCG
GCAACCGGTA CGCCGCCGAC TATTCAGCAA GGGGCGAATC CGACTAACAT TTCGATCCCG
AATACCCTGA TGGCAGCGAA AACTACCACC ACGGCGTCGA TGCAGATCAA CCTGAATTCC
AGCGATCCGC TTCCCTCTGT TAACGCATTT GATGCCAGCA ATGCGGATAG CTATAACAAA
AAAGGTTCGG TGACTGTTTT CGACAGTCAG GGTAATGCTC ATGACATGAG CGTCTACTTT
GTGAAGACCG GGGATAATAA CTGGCAGGTC TACACCCAGG ATAGCAGTGA TCCAACAGGT
ACAGCCGAGC CTGCAATGAA GCTGGTGTTT AATGCCAATG GCGTTCTGAC CTCAAATCCA
ACAGAGAATA TTACCACCGG CGCAATTAAC GGCGCAGAAC CCGCCACGTT TAGCCTGAGC
TTCCTCAACT CCATGCAGCA AAATACCGGC GCTAACAACA TTGTGGCAAC CACCCAGAAT
GGCTACAAAC CGGGCGATCT GGTGAGTTAT CAAATCAATG ATGACGGTAC GGTTGTCGGC
AACTATTCCA ACGAACAAAC CCAACTGCTG GGGCAGATTG TACTGGCGAA CTTTGCCAAC
AACGAAGGTC TGGCATCCGA AGGCGACAAC GTCTGGTCTG CGACGCAATC TTCTGGCGTG
GCGCTGTTGG GGACAGCCGG GACGGGCAAC TTTGGCACCC TGACCAACGG TGCGTTGGAA
GCGTCCAACG TCGATCTCAG TAAAGAACTG GTCAATATGA TCGTTGCCCA GCGTAACTAT
CAGTCTAACG CCCAGACCAT CAAAACCCAG GACCAGATCC TCAACACGCT GGTTAACTTA
CGCTAA
 
Protein sequence
MAFSQAVSGL NAAATNLDVI GNNIANSATY GFKSGTASFA DMFAGSKVGL GVKVAGITQD 
FTDGTTTNTG RGLDVAISQN GFFRLVDSNG SVFYSRNGQF KLDENRNLVN MQGLQLTGYP
ATGTPPTIQQ GANPTNISIP NTLMAAKTTT TASMQINLNS SDPLPSVNAF DASNADSYNK
KGSVTVFDSQ GNAHDMSVYF VKTGDNNWQV YTQDSSDPTG TAEPAMKLVF NANGVLTSNP
TENITTGAIN GAEPATFSLS FLNSMQQNTG ANNIVATTQN GYKPGDLVSY QINDDGTVVG
NYSNEQTQLL GQIVLANFAN NEGLASEGDN VWSATQSSGV ALLGTAGTGN FGTLTNGALE
ASNVDLSKEL VNMIVAQRNY QSNAQTIKTQ DQILNTLVNL R