Gene EcolC_1699 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_1699 
Symbol 
ID6066712 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp1897606 
End bp1898733 
Gene Length1128 bp 
Protein Length375 aa 
Translation table11 
GC content55% 
IMG OID641601113 
Productflagellar hook-length control protein 
Protein accessionYP_001724678 
Protein GI170019724 
COG category[N] Cell motility 
COG ID[COG3144] Flagellar hook-length control protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0151155 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATTCGCT TAGCGCCCTT AATTACCGCC GACGTTGACA CCACCACATT GCCTGGCGGC 
AAAGCCAGCG ATGCTGCACA AGATTTTCTC GCGTTGTTGA GCGAAGCATT AGCAGGCGAG
ACAACTACCG ACAAAGCGGC CCCCCAGTTG CTGGTGGCAA CAGATAAGCC CACGACAAAA
GGCGAGCCGC TGGTCAGCGA GATTCTTGCC GATGCGCAAC AAGCGGATTT ACTGATCCCT
GTGGATGAAA CACCGCCTGT CATCAACGAC GAACAATCCA CATCAACACC ATTAACCACC
GCTCAAACGA TGACGATGGC TGCGGTGGCT GGCAACAATA CGGCAAAAGA CGAAAAAGCG
GATGATCTGA ATGAAGACGT CACCGCAAGC CTGAGCGCCC TTTTTGCGAT GTTGCCGGGT
TTTGACAATA CGCCCAAAGT GACTGATGTG CCGTCAACCG TGTTACCGGC AGAGAAACCA
ACGCTATTCA CAAAACTGAC TTCTGCGCAA CTCACAACAG CACAGCCTGA TGATGCCCCC
GGCACGCCAG CTCAGCCATT AACACCGCTG GTAGCAGAAG CCCAGAGTAA AGCGGAAGTC
ATCAGCACAC CTTCACCGGT GACCGCTGCC GCCAGCCCGC TAATCACTCC ACACCAGACA
CAGCCACTGC CCACCGTCGC CGCGCCTGTG TTGAGTGCAC CGCTGGGTTC TCACGAATGG
CAACAATCAT TAAGCCAGCA TATTTCGCTG TTCACCCGCC AGGGGCAACA AAGTGCAGAG
TTGCGTCTGC ACCCGCAGGA TTTAGGTGAA GTGCAAATCT CCCTCAAAGT GGATGATAAC
CAGGCGCAAA TCCAGATGGT TTCACCGCAT CAACACGTAC GCGCCGCCCT GGAAGCAGCG
CTTCCGGTAC TGCGAACGCA GCTGGCCGAA AGTGGCATTC AGTTAGGGCA AAGCAACATC
AGTGGCGAAA GCTTTAGTGG TCAGCAGCAG GCCGCTTCCC AACAACAGCA AAGCCAACGC
ACAGTAAACC ATGAACCTCT GGCGGGGGAA GACGACGATA CGCTTCCGGT TCCCGTCTCT
TTACAAGGGC GTGTAACAGG CAACAGCGGC GTTGATATTT TCGCCTAA
 
Protein sequence
MIRLAPLITA DVDTTTLPGG KASDAAQDFL ALLSEALAGE TTTDKAAPQL LVATDKPTTK 
GEPLVSEILA DAQQADLLIP VDETPPVIND EQSTSTPLTT AQTMTMAAVA GNNTAKDEKA
DDLNEDVTAS LSALFAMLPG FDNTPKVTDV PSTVLPAEKP TLFTKLTSAQ LTTAQPDDAP
GTPAQPLTPL VAEAQSKAEV ISTPSPVTAA ASPLITPHQT QPLPTVAAPV LSAPLGSHEW
QQSLSQHISL FTRQGQQSAE LRLHPQDLGE VQISLKVDDN QAQIQMVSPH QHVRAALEAA
LPVLRTQLAE SGIQLGQSNI SGESFSGQQQ AASQQQQSQR TVNHEPLAGE DDDTLPVPVS
LQGRVTGNSG VDIFA