Gene EcolC_3382 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_3382 
SymbolfliF 
ID6067566 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp3701393 
End bp3703039 
Gene Length1647 bp 
Protein Length548 aa 
Translation table11 
GC content59% 
IMG OID641602796 
Productflagellar MS-ring protein 
Protein accessionYP_001726328 
Protein GI170021374 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG1766] Flagellar biosynthesis/type III secretory pathway lipoprotein 
TIGRFAM ID[TIGR00206] flagellar basal-body M-ring protein/flagellar hook-basal body protein (fliF) 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.275908 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATGCAC AAATAAAGAA GTTAACCCAG GCGTTTCCGG CGTTTCGACT GCGTCTTGCA 
GATAACAAAC GCTGGGCGCT GATGGCGGGA GTGGGGCTTG CCGTTGCGGC AACGGCGATT
ATCGTCAGCG TGCTGTGGAC CGGCAATCGC GGCTACGTCT CGCTCTACGG CCGCCAGGAA
AACCTGCCCG TTTCGCAGAT TGTCACCGTG CTGGACGGCG AAAAGCTCAG CTACCGCATC
GACCCGCAGA GCGGGCAGAT TCTGGTGCCG GAAGATGAAC TGTCGAAAAC GCGTATGACC
CTTGCCGCGA AAGGCGTGCA GGCGATATTG CCCAGCGGCT ACGAGCTGAT GGACAAAGAC
GAAGTGCTCG GCAGCAGCCA GTTCATGCAG AACGTGCGCT ACAAACGTAG TCTCGAAGGG
GAGCTGGCGC AGAGCATTAT GTCGCTGGAC GCGGTGGAGA GCGCGCGGGT GCATCTGGCG
CTCAACGAAG AAAGCTCGTT TGTGGTCAGC GATGAGCCGC AAAACAGCGC CTCGGTGGTG
GTGCGTCTGC ACTACGGCGC GAAGCTGAAT ATGGACCAGG TGAACGCCAT CGTGCATCTG
GTTTCCGGCA GTATTCCGGG GCTGCAGGCC AGCAAAGTCA GCGTCGTCGA TCAGGCGGGA
AATCTGCTAA CCGACGGCAT TGGCGCGGGC GAGGCGGTCT CTGCCGCTAC CCGTAAACGC
GATCAGATCC TCAAAGACAT TCAGGACAAA ACCCGCGCCA GCGTGGCGAA CGTGCTGGAT
TCGCTGGTCG GCAGCAGGAA TTACCGTGTC AGCGTGATGC CGGATCTCGA CCTCAGCACC
ATCGACGAAA CTCAGGAACA CTACGGAGAC GCGCCGAAAA TAAACCGCGA AGAGAGCGTG
CTGGACAGCG ACACCAATCA GGTGGCGATG GGTGTGCCCG GCTCTCTCAG CAACCGTCCG
CCGGTTGCGG CGAATCAGAT GACCAACGGC ACGGAAGAAA ATCGCTCGCC GGAAGCGTTA
TCCAAACACA GCGAAAGCAA GCGCGATTAC TCTTACGACC GCAGCGTCCA GCATATTCAG
CATCCCGGCT TTGCGGTGAA ACGCCTCAAC GTGGCGGTGG TGCTCAATCA AAACGCCCCG
GCGCTGAAAA ACTGGAAGCC GGAACAGACC ACGCAGCTTA CCGCGTTGCT GAACAATGCC
GCCGGGATCG ACGCGCAACG TGGAGATAAC CTGACGCTTT CACTGCTTAA CTTTGTCCCG
CAGGTGGTCC CGGTCGAACC GGTGATCCCG CTGTGGAAAG ATGACAGCGT GCTGGCCTGG
GTGCGGCTGA TTGGCTGCGG CCTGCTGGCG CTGTTGTTGC TGTTCTTTGT GGTGCGCCCG
GTAATGAAAC GCCTGACGGC GGTGCGTGCG CCGGTTATCA CACCAGAACC GGAAGCTGTC
AGCGAACCGT GGATCGCTAT GCCGGAAGAG GAGCGCAAAA ACGTCGATCT GCCGTCGCTG
CCCGGCGATG ACAGCCTGCC TTCGCAGAGT TCCGGCCTCG AAGTGAAACT CGAGTTCCTG
CAAAAACTGG CCATGAGCGA CACCGATCGC GTAGCCGAAG TTCTCAGACA ATGGATCACC
AGCAATGAGC GAATTGACAA CAAATAA
 
Protein sequence
MNAQIKKLTQ AFPAFRLRLA DNKRWALMAG VGLAVAATAI IVSVLWTGNR GYVSLYGRQE 
NLPVSQIVTV LDGEKLSYRI DPQSGQILVP EDELSKTRMT LAAKGVQAIL PSGYELMDKD
EVLGSSQFMQ NVRYKRSLEG ELAQSIMSLD AVESARVHLA LNEESSFVVS DEPQNSASVV
VRLHYGAKLN MDQVNAIVHL VSGSIPGLQA SKVSVVDQAG NLLTDGIGAG EAVSAATRKR
DQILKDIQDK TRASVANVLD SLVGSRNYRV SVMPDLDLST IDETQEHYGD APKINREESV
LDSDTNQVAM GVPGSLSNRP PVAANQMTNG TEENRSPEAL SKHSESKRDY SYDRSVQHIQ
HPGFAVKRLN VAVVLNQNAP ALKNWKPEQT TQLTALLNNA AGIDAQRGDN LTLSLLNFVP
QVVPVEPVIP LWKDDSVLAW VRLIGCGLLA LLLLFFVVRP VMKRLTAVRA PVITPEPEAV
SEPWIAMPEE ERKNVDLPSL PGDDSLPSQS SGLEVKLEFL QKLAMSDTDR VAEVLRQWIT
SNERIDNK