Gene EcSMS35_2052 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_2052 
SymbolflgE 
ID6144330 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp2073514 
End bp2074719 
Gene Length1206 bp 
Protein Length401 aa 
Translation table11 
GC content52% 
IMG OID641616928 
Productflagellar hook protein FlgE 
Protein accessionYP_001744104 
Protein GI170683341 
COG category[N] Cell motility 
COG ID[COG1749] Flagellar hook protein FlgE 
TIGRFAM ID[TIGR03506] fagellar hook-basal body proteins 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones42 
Fosmid unclonability p-value0.302696 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCTTTT CTCAAGCGGT TAGCGGATTA AACGCTGCCG CCACCAACCT CGATGTTATT 
GGCAACAATA TCGCCAACTC CGCCACCTAC GGCTTTAAAT CAGGCACGGC CTCTTTTGCC
GATATGTTTG CCGGTTCGAA AGTGGGACTG GGGGTAAAAG TTGCCGGTAT CACTCAGGAC
TTTACCGATG GCACGACCAC CAACACCGGG CGAGGTCTGG ACGTTGCTAT CAGCCAGAAC
GGTTTTTTCC GTCTGGTAGA CAGCAACGGT TCGGTGTTCT ACAGCCGTAA CGGACAATTT
AAGCTGGATG AAAACCGTAA CCTGGTGAAT ATGCAAGGTT TACAGCTGAC GGGTTACCCG
GCAACCGGTA CGCCGCCGAC TATTCAGCAA GGGGCGAATC CGACCAATAT TTCGATCCCG
AATACCCTGA TGGCAGCGAA AACTACCACC ACGGCGTCGA TGCAGATCAA CCTGAATTCC
AGTGATCCGC TTCCCTCTGT TAAAGCATTC GATGCCAGCA ATGCGGATAG CTATAACAAA
AAAGGTTCGG TGACTGTTTT CGATAGTCAG GGTAATGCTC ATGATATGAG CGTTTATTTT
GTGAAGACCG GGGATAACAA CTGGGATGTT TACACCCTGG ATAGCAGTGA TCCAACAGGC
ACAGCTAACC CTGCAACGAC GCTGGTGTTT AATGCCAATG GCGTTCTGAC CTCAGATCCA
ACAAAGGATA TTACCACCGG CGCAATTAAC GGCGCAGATC CCGCCACGTT TAGCCTGAGC
TTCCTCAACT CCATGCAGCA AAATACCGGC GCGAACAACA TTGTGGCAAC CACCCAGAAC
GGCTACAAAC CGGGCGATCT GGTGAGTTAT CAAATCAATG ATGACGGTAC GGTTGTCGGC
AACTATTCCA ACGAACAAAC CCAACTGCTG GGGCAGATTG TACTGGCGAA CTTTGCCAAC
AACGAAGGTC TGGCATCCGA AGGCGACAAC GTCTGGTCTG CGACGCAATC TTCTGGCGTG
GCGCTGTTGG GGACAGCCGG GACGGGCAAC TTTGGCACCC TGACCAACGG TGCGCTGGAA
GCGTCCAACG TCGATCTCAG TAAAGAACTG GTCAATATGA TCGTTGCCCA GCGTAACTAT
CAGTCTAACG CCCAGACCAT CAAAACCCAG GACCAGATCC TCAACACGCT GGTTAACTTA
CGCTAA
 
Protein sequence
MAFSQAVSGL NAAATNLDVI GNNIANSATY GFKSGTASFA DMFAGSKVGL GVKVAGITQD 
FTDGTTTNTG RGLDVAISQN GFFRLVDSNG SVFYSRNGQF KLDENRNLVN MQGLQLTGYP
ATGTPPTIQQ GANPTNISIP NTLMAAKTTT TASMQINLNS SDPLPSVKAF DASNADSYNK
KGSVTVFDSQ GNAHDMSVYF VKTGDNNWDV YTLDSSDPTG TANPATTLVF NANGVLTSDP
TKDITTGAIN GADPATFSLS FLNSMQQNTG ANNIVATTQN GYKPGDLVSY QINDDGTVVG
NYSNEQTQLL GQIVLANFAN NEGLASEGDN VWSATQSSGV ALLGTAGTGN FGTLTNGALE
ASNVDLSKEL VNMIVAQRNY QSNAQTIKTQ DQILNTLVNL R