Gene EcSMS35_2045 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_2045 
SymbolflgL 
ID6142693 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp2066345 
End bp2067298 
Gene Length954 bp 
Protein Length317 aa 
Translation table11 
GC content51% 
IMG OID641616921 
Productflagellar hook-associated protein FlgL 
Protein accessionYP_001744097 
Protein GI170680479 
COG category[N] Cell motility 
COG ID[COG1344] Flagellin and related hook-associated proteins 
TIGRFAM ID[TIGR02550] flagellar hook-associated protein 3 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.0140515 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value0.0179401 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGTTTCA GTACCCAGAT GATGTACCAG CAAAACATGC GCGGCATCAC CAATTCTCAG 
GCAGAATGGA TGAAGTACGG TGAACAGATG TCGACGGGTA AGCGAGTCGT TAACCCTTCA
GATGATCCCA TTGCTGCATC ACAAGCTGTT GTTCTCTCCC AGGCACAGGC GCAAAACAGC
CAGTACACGC TGGCGCGTAC TTTCGCCACT CAAAAAGTGT CACTGGAAGA GAGTGTACTT
AGCCAGGTCA CCACTGCTAT CCAGAATGCT CAGGAAAAAA TTGTCTACGC CAGCAATGGC
ACCTTGAGTG ACGATGACCG GGCCTCGCTG GCTACGGATA TTCAGGGGCT TCGTGACCAG
TTGCTGAATC TGGCGAACAC CACTGACGGT AACGGGCGCT ACATTTTTGC CGGTTATAAA
ACAGAATCCG CACCGTTTAG CGAGGTGGAT GGCAAATACG AAGGCGGAGC AGAAAGTATT
AAACAACAGG TCGATGCTTC GCGTTCGATG GTGATAGGGC ACACGGGGGA CAAAATTTTC
GACAGTATTA CCAGCAACGC GGTAGCGGAA CCAGACGGTA GCGCTTCTGA AACCAATCTT
TTTGCCATGC TGGATAGCGC CATCGCAGCC CTGAAAACGC CGGTCGCGGA TAGCGAAGCG
GATAAAGAAA CCGCCGCTGC GGCACTGGAT AAAACCAACC GCGGACTGAA AAACTCGCTG
AACAATGTGC TGACTGTTCG CGCGGAATTA GGCACGCAGC TGAACGAACT GGAGTCGCTG
GATTCATTAG GTAGCGATCG CGCTTTAGGG CAAACGCAGC AGATGAGCGA TCTGGTTGAT
GTGGACTGGA ATGCAACTAT TTCATCTTAC ATCATGCAGC AAACGGCATT GCAGGCGTCG
TATAAAGCAT TTACCGATAT GCAGGGATTG TCGCTCTTCC AGCTCAACAA ATAA
 
Protein sequence
MRFSTQMMYQ QNMRGITNSQ AEWMKYGEQM STGKRVVNPS DDPIAASQAV VLSQAQAQNS 
QYTLARTFAT QKVSLEESVL SQVTTAIQNA QEKIVYASNG TLSDDDRASL ATDIQGLRDQ
LLNLANTTDG NGRYIFAGYK TESAPFSEVD GKYEGGAESI KQQVDASRSM VIGHTGDKIF
DSITSNAVAE PDGSASETNL FAMLDSAIAA LKTPVADSEA DKETAAAALD KTNRGLKNSL
NNVLTVRAEL GTQLNELESL DSLGSDRALG QTQQMSDLVD VDWNATISSY IMQQTALQAS
YKAFTDMQGL SLFQLNK