Gene EcHS_A2043 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A2043 
SymbolfliK 
ID5593033 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp2038126 
End bp2039253 
Gene Length1128 bp 
Protein Length375 aa 
Translation table11 
GC content55% 
IMG OID640921187 
Productflagellar hook-length control protein 
Protein accessionYP_001458732 
Protein GI157161414 
COG category[N] Cell motility 
COG ID[COG3144] Flagellar hook-length control protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones41 
Plasmid unclonability p-value0.436184 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATTCGCT TAGCGCCCTT GATTACCGCC GACGTTGACA CCACCACATT GTCTGGCGGC 
AAAGCCAGCG ATGCTGCACA AGATTTTCTC GCGTTGTTGA GCGAAGCATT AACAGGCGAG
GCAACAACCG ACAAAGCGAC TCCCCAGTTG CTGGTGGCAA CAGATAAGCC CACGACAAAA
GGCGAGCCGC TGGTCAGCGA TATTGTTTCC GACGCGCAAC AAGCGGATTT ACTGATCCCT
GTGGATGAAA CACCGCCTGT CATCAACGAC GAACAATCCA CATCAACACC GTTAAACACC
GCTCAGACGA TAACGTTGGC TGCGGCGGCT GACAACAATA CGGCAAAAGA CGAAAAAGCG
GATGATCTGA ATGAAGACGT CACCGCCAGC CTGAGTGCCC TTTTTGCGAT GTTGCCGGGT
TTTGACAATA CGCCCAAAGT GACTGATGCA CCGTCAACCG TATTACCGAC AGAGAAACCA
ACGCTCTTCA CAAAACTGAC TTCTGCGCAA CTCACAACAG CACAGCCTGA TGACGCCCCC
GGCACGCCAG CTCAGCCATT AACACCGCTG ATAGCAGAAG CCCAGAGTAA AGCGGAAATC
ATCAGCACGC CTTCGCCGGT GACCGCTGCC GCCAGCCCGC TAATCACTCC ACACCAGACA
CAGCCACTGC CCACCGTCGC CGCGCCTGTT TTGAGTGCAC CGCTGGGTTC TCACGAATGG
CAACAATCAT TAAGCCAGCA TATTTCGCTG TTCACCCGCC AGGGGCAACA AAGTGCAGAG
TTGCGTCTGC ACCCGCAGGA TTTAGGTGAG GTGCAAATCT CCCTCAAAGT GGATGATAAC
CAGGCGCAAA TCCAGATGGT TTCACCGCAT CAGCATGTAC GCGCCGCCCT GGAAGCAGCG
CTGCCGGTAC TGCGTACGCA GCTGGCCGAA AGTGGCATTC AGTTAGGGCA AAGCAACATC
AGTGGCGAAA GCTTTAGTGG TCAGCAGCAG GCCGCTTCCC AACAACAGCA AAGCCAACGC
ACAGTAAACC ATGAACCTCT GGCGGGGGAA GAAGACGATA CGCTTCCGGT TCCCGTCTCT
TTACAAGGGC GCGTAACAGG CAACAGCGGC GTTGATATTT TCGCCTAA
 
Protein sequence
MIRLAPLITA DVDTTTLSGG KASDAAQDFL ALLSEALTGE ATTDKATPQL LVATDKPTTK 
GEPLVSDIVS DAQQADLLIP VDETPPVIND EQSTSTPLNT AQTITLAAAA DNNTAKDEKA
DDLNEDVTAS LSALFAMLPG FDNTPKVTDA PSTVLPTEKP TLFTKLTSAQ LTTAQPDDAP
GTPAQPLTPL IAEAQSKAEI ISTPSPVTAA ASPLITPHQT QPLPTVAAPV LSAPLGSHEW
QQSLSQHISL FTRQGQQSAE LRLHPQDLGE VQISLKVDDN QAQIQMVSPH QHVRAALEAA
LPVLRTQLAE SGIQLGQSNI SGESFSGQQQ AASQQQQSQR TVNHEPLAGE EDDTLPVPVS
LQGRVTGNSG VDIFA