Gene EcSMS35_1301 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_1301 
SymbolcheB 
ID6142801 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp1290239 
End bp1291288 
Gene Length1050 bp 
Protein Length349 aa 
Translation table11 
GC content54% 
IMG OID641616179 
Productchemotaxis-specific methylesterase 
Protein accessionYP_001743359 
Protein GI170683043 
COG category[N] Cell motility
[T] Signal transduction mechanisms 
COG ID[COG2201] Chemotaxis response regulator containing a CheY-like receiver domain and a methylesterase domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones35 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value0.00345426 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAGCAAAA TCAGGGTGTT ATCTGTCGAT GATTCGGCAC TGATGCGCCA GATCATGACA 
GAAATCATCA ACAGCCATAG CGACATGGAA ATGGTGGCGA CCGCCCCTGA TCCGCTGGTT
GCGCGTGATT TGATTAAAAA ATTCAATCCC GATGTATTGA CGCTGGATGT TGAAATGCCG
CGGATGGACG GACTGGATTT TCTCGAAAAA TTAATGCGTT TGCGTCCAAT GCCCGTTGTG
ATGGTTTCTT CCCTGACCGG CAAAGGGTCA GAAGTCACGC TGCGCGCGCT GGAGCTGGGG
GCGATAGATT TTGTCACCAA ACCGCAACTG GGTATTCGCG AAGGAATGCT GGCGTATAGC
GAAATGATTG CTGAAAAGGT GCGTACAGCA GCAAAGGCGA GCCTTGCAGC ACATAAGCCA
TTGTCGGCAC CGACAACGCT GAAGGCAGGG CCGTTGTTGA GTTCTGAAAA ACTGATTGCG
ATTGGTGCTT CAACGGGGGG AACTGAGGCA ATTCGTCACG TGCTGCAACC GTTGCCGCTT
TCCAGCCCGG CACTGTTAAT TACCCAGCAT ATGCCGCCCG GTTTCACCCG CTCTTTTGCC
GACAGACTTA ATAAGCTTTG CCAGATCGGG GTTAAAGAAG CCGAAGACGG AGAACGTGTC
TTACCGGGGC ATGCCTATAT TGCGCCGGGC GATCGGCATA TGGAGCTGGC GCGTAGTGGC
GCAAATTACC AAATCAAAAT TCACGATGGC CCGGCGGTTA ACCGTCATCG GCCTTCGGTA
GATGTGTTGT TCCATTCTGT CGCCAAACAG GCGGGGCGTA ATGCGGTTGG GGTGATCCTG
ACCGGTATGG GCAACGACGG TGCGGCGGGA ATGTTGGCGA TGCGTCAGGC GGGGGCATGG
ACCCTTGCGC AAAACGAAGC AAGTTGCGTG GTGTTCGGCA TGCCGCGCGA GGCCATCAAT
ATGGGTGGTG TCTGCGAAGT GATCGATCTT AGCCAGGTAA GCCAGCAAAT GTTGGCAAAA
ATTAGTGCCG GACAGGCGAT ACGTATTTAA
 
Protein sequence
MSKIRVLSVD DSALMRQIMT EIINSHSDME MVATAPDPLV ARDLIKKFNP DVLTLDVEMP 
RMDGLDFLEK LMRLRPMPVV MVSSLTGKGS EVTLRALELG AIDFVTKPQL GIREGMLAYS
EMIAEKVRTA AKASLAAHKP LSAPTTLKAG PLLSSEKLIA IGASTGGTEA IRHVLQPLPL
SSPALLITQH MPPGFTRSFA DRLNKLCQIG VKEAEDGERV LPGHAYIAPG DRHMELARSG
ANYQIKIHDG PAVNRHRPSV DVLFHSVAKQ AGRNAVGVIL TGMGNDGAAG MLAMRQAGAW
TLAQNEASCV VFGMPREAIN MGGVCEVIDL SQVSQQMLAK ISAGQAIRI