Gene EcSMS35_0250 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_0250 
SymbolfliF 
ID6146331 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp263135 
End bp264781 
Gene Length1647 bp 
Protein Length548 aa 
Translation table11 
GC content58% 
IMG OID641615148 
Productflagellar MS-ring protein 
Protein accessionYP_001742357 
Protein GI170682472 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG1766] Flagellar biosynthesis/type III secretory pathway lipoprotein 
TIGRFAM ID[TIGR00206] flagellar basal-body M-ring protein/flagellar hook-basal body protein (fliF) 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones51 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATGCAC AAATAAAGAA GTTAACCCAG GCGTTTCCGG CGTTTCGACT GCGTCTTGCA 
GATAACAAAC GCTGGGCATT GATGGCGGGA GTGGGGCTTG CCGTTGCGGC GACGGCAATT
ATCGTCAGCG TACTGTGGAC CGGTAATCGC GGCTACGTCT CGCTCTACGG ACGCCAGGAA
AATCTGCCCG TGTCGCAGAT TGTCACCGTG CTGGACGGCG AAAAGCTCAG CTACCGCATC
GACCCGCAAA GCGGGCAGAT TCTGGTGCCG GAAGATGAAC TGTCGAAAAC GCGTATGACC
CTTGCCGCGA AAGGCGTGCA GGCGATCTTG CCCAGCGGCT ACGAGCTGAT GGACAAAGAC
GAAGTGCTCG GCAGCAGCCA GTTCGTGCAG AACGTGCGCT ACAAACGCAG CCTCGAAGGG
GAGCTGGCGC AGAGCATTAT GTCGCTGGAC GCGGTGGAGA GCGCGCGGGT GCATCTGGCG
CTCAACGAAG AGAGCTCGTT TGTGGTCAGC GATGAGCCGC AAAACAGCGC CTCGGTGGTG
GTGCGTCTGC ATTACGGCGC GAAGCTGAAT ATGGACCAGG TGAACGCCAT TGTGCATCTG
GTTTCCGGCA GCATTCCGGG GCTGCACGCC AGCAAAGTTA GCGTCGTCGA TCAGGCGGGA
AATCTGCTGA CCGACGGCAT TGGCGCGGGC GAAGCGGTTT CAGCCGCTAC CCGTAAACGC
GATCAGATCC TCAAAGATAT TCAGGACAAA ACCCGCGCCA GCGTGGCGAA CGTGCTGGAT
TCGCTGGTCG GCAGCGGGAA TTACCGCGTC AGCGTCATGC CGGATCTCGA CCTCAGTAAT
ATCGATGAAA CTCAGGAACA CTACGGCGAC GCGCCGAAAA TCAACCGCGA AGAAAACGTG
CTGGACAGCG ATACCAATCA GGTGGCGATG GGTGTGCCCG GTTCGCTCAG TAACCGTCCG
CCGATTGCAG CGAATCAGAT GACCAACGGT ACGGAAGAAA ACCGCTCGCC GGAAGCCCTT
TCGAAACACA GCGAAAGCAA GCGCGATTAC TCTTACGACC GCAGCGTCCA GCATATTCAG
CATCCCGGCT TTGCGGTGAA ACGCCTTAAC GTGGCGGTGG TGCTCAATCA AAACGCTCCG
GCGCTGAAAA ACTGGAAGCC GGAGCAGACC ACGCAACTGA CCGCGTTGCT GAACAATGCC
GCCGGGATCG ACGTCCAGCG CGGTGACAAT CTCACCTTAT CGCTGCTTAA CTTTGTGCCG
CAGGCGGTGC CGGTCGAACC AATTATTCCG CTGTGGAAAG ACGACAGCGT GCTGGCCTGG
GTGCGGCTGA TTGGCTGCGG CCTGCTGGCG CTGTTGTTGC TGTTCTTTGT GGTGCGCCCG
GTAATGAAAC GGCTGACGGC GGTACGTGCG CCGGTTATCA CACCAGAACC GGAAGCTGTC
AGCGAACCGT GGATTGCCAT GCCGGAAGAG GAACGCAAAA ACGTCGATCT GCCGTCGCTG
CCCGGCGATG ACAGCCTGCC GTCCCAGAGT TCCGGCCTCG AAGTAAAACT CGAGTTCCTG
CAAAAACTGG CCATGAGCGA CACCGATCGC GTAGCCGAAG TTCTCAGACA ATGGATCACC
AGCAATGAGC GAATTGACAA CAAATAA
 
Protein sequence
MNAQIKKLTQ AFPAFRLRLA DNKRWALMAG VGLAVAATAI IVSVLWTGNR GYVSLYGRQE 
NLPVSQIVTV LDGEKLSYRI DPQSGQILVP EDELSKTRMT LAAKGVQAIL PSGYELMDKD
EVLGSSQFVQ NVRYKRSLEG ELAQSIMSLD AVESARVHLA LNEESSFVVS DEPQNSASVV
VRLHYGAKLN MDQVNAIVHL VSGSIPGLHA SKVSVVDQAG NLLTDGIGAG EAVSAATRKR
DQILKDIQDK TRASVANVLD SLVGSGNYRV SVMPDLDLSN IDETQEHYGD APKINREENV
LDSDTNQVAM GVPGSLSNRP PIAANQMTNG TEENRSPEAL SKHSESKRDY SYDRSVQHIQ
HPGFAVKRLN VAVVLNQNAP ALKNWKPEQT TQLTALLNNA AGIDVQRGDN LTLSLLNFVP
QAVPVEPIIP LWKDDSVLAW VRLIGCGLLA LLLLFFVVRP VMKRLTAVRA PVITPEPEAV
SEPWIAMPEE ERKNVDLPSL PGDDSLPSQS SGLEVKLEFL QKLAMSDTDR VAEVLRQWIT
SNERIDNK