Gene EcSMS35_2369 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_2369 
SymbolatoC 
ID6144692 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp2405463 
End bp2406848 
Gene Length1386 bp 
Protein Length461 aa 
Translation table11 
GC content49% 
IMG OID641617242 
Productacetoacetate metabolism regulatory protein AtoC 
Protein accessionYP_001744414 
Protein GI170681202 
COG category[T] Signal transduction mechanisms 
COG ID[COG2204] Response regulator containing CheY-like receiver, AAA-type ATPase, and DNA-binding domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.0790211 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones58 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTGCTA TTAATCGCAT CCTTATTGTG GATGATGAAG ATAATGTTCG CCGTATGCTG 
AGCACCGCTT TTGCACTACA AGGATTCGAA ACACATTGTG CGAACAACGG GCGCACAGCA
TTACACCTGT TTGCCGATAT TCACCCTGAC GTGGTGTTGA TGGATATCCG CATGCCAGAG
ATGGACGGCA TCAAGGCACT CAAGGAGATG CGCAGCCATG AGACCCGGAC ACCCGTTATT
CTGATGACGG CCTATGCTGA AGTGGAAACC GCCGTCGAAG CACTACGCTG CGGAGCCTTC
GACTATGTTA TCAAACCGTT TGATCTCGAT GAGTTGAATT TAATCGTTCA GCGCACTTTA
CAACTCCAGT CAATGAAAAA AGAGATCCGT CATCTGCACC AGGCACTGAG CACCAGCTGG
CAATGGGGGC ACATTCTCAC CAACAGCCCG GCGATGATGG ACATCTGCAA AGACACCGCC
AAAATTGCCC TTTCTCAGGC CAGCGTCTTG ATTAGCGGTG AAAGCGGCAC CGGGAAAGAG
TTGATTGCCA GAGCGATTCA CTACAATTCG CGGAGGGCAA AGGGGCCGTT CATTAAAGTC
AACTGCGCGG CACTGCCGGA ATCGTTGCTC GAAAGTGAAC TGTTTGGTCA TGAAAAAGGC
GCATTTACTG GTGCACAAAC CTTACGTCAG GGATTATTTG AACGTGCCAA CGAAGGTACT
CTGCTCCTCG ACGAAATTGG CGAAATGCCG CTGGTACTGC AAGCCAAATT ACTACGCATT
CTGCAGGAAC GGGAATTTGA ACGGATTGGC GGTCATCAGA CCATAAAAGT TGATATCCGC
ATCATTGCTG CCACCAACCG CGACTTGCAG GCAATGGTGA AAGAAGGCAC CTTCCGTGAA
GATCTCTTTT ATCGCCTTAA CGTTATTCAT TTAATACTAC CGCCTCTGCG CGATCGCCGG
GAAGATATTT CCCTGTTAGC TAATCACTTT TTGCAAAAAT TCAGTAGTGA GAATCAGCGC
GATATTATCG ACATCGATCC GATGGCAATG TCGCTGCTTA CCGCCTGGTC ATGGCCGGGT
AATATTCGAG AGCTTTCCAA CGTCATTGAA CGCGCCGTCG TGATGAACTC AGGCCCGATC
ATTTTCTCTG AGGATCTTCC GCCGCAGATT CGTCAGCCAG TCTGTAATGC TGGTGAGGCA
AAAACAGCCC CTGTCGGTGA GCGTAATTTA AAAGAGGAAA TTAAACGCGT CGAAAAACGC
ATCATTATGG AAGTGCTGGA ACAACAAGAA GGAAACCGAA CCCGCACTGC GTTAATGCTG
GGCATCAGTC GCCGTGCATT GATGTATAAA CTCCAGGAAT ACGGTATCGA TCCGGCGGAT
GTATAA
 
Protein sequence
MTAINRILIV DDEDNVRRML STAFALQGFE THCANNGRTA LHLFADIHPD VVLMDIRMPE 
MDGIKALKEM RSHETRTPVI LMTAYAEVET AVEALRCGAF DYVIKPFDLD ELNLIVQRTL
QLQSMKKEIR HLHQALSTSW QWGHILTNSP AMMDICKDTA KIALSQASVL ISGESGTGKE
LIARAIHYNS RRAKGPFIKV NCAALPESLL ESELFGHEKG AFTGAQTLRQ GLFERANEGT
LLLDEIGEMP LVLQAKLLRI LQEREFERIG GHQTIKVDIR IIAATNRDLQ AMVKEGTFRE
DLFYRLNVIH LILPPLRDRR EDISLLANHF LQKFSSENQR DIIDIDPMAM SLLTAWSWPG
NIRELSNVIE RAVVMNSGPI IFSEDLPPQI RQPVCNAGEA KTAPVGERNL KEEIKRVEKR
IIMEVLEQQE GNRTRTALML GISRRALMYK LQEYGIDPAD V