Gene EcSMS35_1241 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_1241 
SymbolfliK 
ID6146390 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp1239857 
End bp1240984 
Gene Length1128 bp 
Protein Length375 aa 
Translation table11 
GC content55% 
IMG OID641616119 
Productflagellar hook-length control protein 
Protein accessionYP_001743302 
Protein GI170682141 
COG category[N] Cell motility 
COG ID[COG3144] Flagellar hook-length control protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.896281 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones53 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATTCGCT TAGCGCCCTT AATTACCGCC GACGTTGACA CCACCACATT GCCTGGCGGC 
AAAGCCAGCG ATGCTGCACA AGATTTTCTC GCGTTGTTGA GCGAAGCATT AGCAGGCGAG
ACAACTACCG ACAAAGCGGC CCCCCAGTTG CTGGTGGCAA CAGATAAGCC CACGACAAAA
GGCGAGCCGC TGGTCAGCGA GATTCTTGCC GATGCGCAAC AAGCGGATTT ACTGATCCCT
GTGGATGAAA CACCGCCTGT CATCAACGAC GAACAATCCA CATCAACACC ATTAACCACC
GCTCAAACGA TGACGTTGGC TGCGGTGGCT GGCAACAATA CGGCAAAAGA CGAAAAAGCG
GATGATCTGA ATGAAGACGT CACCGCAAGC CTGAGCGCCC TTTTTGCGAT GTTGCCGGGT
TTTGACAATA CGCCCAAAGT GACTGATGCG CCGTCAACCG TGTTACCGAC AGAGAAACCA
ACGCTCTTCA CAAAACTGAC TTCTGAGCAA CTCACAACAG CACAGCCTGA TGACGCCCCC
GGCACACCAG CTCAGCCATT AACACCGCTG GTAGCAGAAG CCCAGAGTAA AGCGGAAGTC
ATCAGCACAC CTTCACCGGT GACCGCTGCC GCCAGCCCGC TAATCACTCC ACACCAGACA
CAGCCACTGC CCACCGTCGC CGCACCTGTT TTGAGTGCAC CGCTGGGTTC TCACGAATGG
CAACAATCAT TAAGCCAGCA TATTTCGCTG TTCACCCGCC AGGGGCAACA AAGTGCAGAG
TTGCGACTGC ACCCACAGGA TTTAGGTGAA GTGCAAATCT CCCTCAAAGT GGATGATAAC
CAGGCTCAAA TCCAGATGGT TTCACCGCAT CAGCATGTAC GCGCCGCCCT GGAAGCAGCG
CTGCCGGTAC TGCGTACGCA GCTGGCCGAA AGTGGCATTC AGTTAGGGCA AAGCAACATC
AGTGGCGAAA GCTTTAGTGG TCAGCAGCAG GCCGCTTCCC AACAACAGCA AAGCCAACGC
ACAGTAAACC ATGAACCTCT GGCGGGGGAA GACGACGATA CGCTTCCGGT TCCCGTCTCT
TTACAAGGGC GCGTAACAGG CAACAGCGGC GTTGATATTT TCGCCTAA
 
Protein sequence
MIRLAPLITA DVDTTTLPGG KASDAAQDFL ALLSEALAGE TTTDKAAPQL LVATDKPTTK 
GEPLVSEILA DAQQADLLIP VDETPPVIND EQSTSTPLTT AQTMTLAAVA GNNTAKDEKA
DDLNEDVTAS LSALFAMLPG FDNTPKVTDA PSTVLPTEKP TLFTKLTSEQ LTTAQPDDAP
GTPAQPLTPL VAEAQSKAEV ISTPSPVTAA ASPLITPHQT QPLPTVAAPV LSAPLGSHEW
QQSLSQHISL FTRQGQQSAE LRLHPQDLGE VQISLKVDDN QAQIQMVSPH QHVRAALEAA
LPVLRTQLAE SGIQLGQSNI SGESFSGQQQ AASQQQQSQR TVNHEPLAGE DDDTLPVPVS
LQGRVTGNSG VDIFA