Gene ECH74115_2526 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_2526 
SymbolttuC 
ID6966852 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp2389810 
End bp2390895 
Gene Length1086 bp 
Protein Length361 aa 
Translation table11 
GC content54% 
IMG OID643386395 
Producttartrate dehydrogenase/decarboxylase 
Protein accessionYP_002270877 
Protein GI209399002 
COG category[C] Energy production and conversion
[E] Amino acid transport and metabolism 
COG ID[COG0473] Isocitrate/isopropylmalate dehydrogenase 
TIGRFAM ID[TIGR02089] tartrate dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones58 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGAAAA CGATGCGTAT TGCTGCGATC CCGGGAGACG GGATTGGCAA AGAAGTCCTT 
CCTGAAGGGA TTCGCGTGTT ACAGGCTGCC GCTGAGCGCT GGGGCTTCGC CTTGAGTTTT
GAGCAAATGG AGTGGGCGAG CTGCGAGTAT TACAGCCATC ACGGTAAAAT GATGCCGGAC
GACTGGCATG AGCAACTTAG CCGTTTCGAC GCCATCTATT TTGGTGCCGT CGGCTGGCCG
GACACCGTTC CGGACCATAT TTCGTTGTGG GGTTCGCTGC TGAAATTTCG TCGTGAATTC
GACCAGTACG TCAACCTGCG CCCGGTTCGT CTCTTTCCTG GCGTTCCCTG CCCGCTGGCG
GGAAAACAGC CTGGCGACAT CGATTTTTAC GTGGTCAGGG AAAACACCGA AGGCGAATAT
TCCTCGCTCG GCGGTAGAGT GAATGAAGGT ACAGAGCATG AAGTCGTCAT TCAGGAATCG
GTATTTACGC GTCGTGGTGT CGATCGCATT TTGCGTTATG CCTTCGAACT TGCGCAAAGC
CGCCCACGTA AGACGCTAAC TTCTGCCACT AAATCAAACG GTTTAGCCAT CAGCATGCCG
TACTGGGATG AGCGAGTGGA AGCAATGGCC GAGAATTACC CGGAGATCCG CTGGGACAAG
CAGCATATTG ATATTCTCTG CGCGCGTTTT GTGATGCAGC CGGAACGCTT CGATGTGGTG
GTGGCGTCCA ATTTGTTTGG CGATATCCTT TCCGATCTTG GCCCGGCCTG CACCGGCACC
ATTGGCATTG CCCCATCCGC CAACCTGAAT CCGGAACGCA CTTTCCCCTC GCTCTTCGAG
CCTGTCCACG GTTCCGCGCC GGATATCTAC GGGAAAAATA TTGCTAACCC TATCGCCACA
ATTTGGGCCG GGGCAATGAT GCTCGATTTT CTCGGCAATG GCGATGAGCG TTTCCAGCAA
GCGCATAACG GTATTCTGGC AGCGATTGAA GAAGTGATTG CTCACGGGCC GAAAACACCG
GATATGAAAG GCAGTGCCAC CACGCCACAG GTTGCCGACG CGATTTGCAA AATTATTTTG
CGTTAA
 
Protein sequence
MMKTMRIAAI PGDGIGKEVL PEGIRVLQAA AERWGFALSF EQMEWASCEY YSHHGKMMPD 
DWHEQLSRFD AIYFGAVGWP DTVPDHISLW GSLLKFRREF DQYVNLRPVR LFPGVPCPLA
GKQPGDIDFY VVRENTEGEY SSLGGRVNEG TEHEVVIQES VFTRRGVDRI LRYAFELAQS
RPRKTLTSAT KSNGLAISMP YWDERVEAMA ENYPEIRWDK QHIDILCARF VMQPERFDVV
VASNLFGDIL SDLGPACTGT IGIAPSANLN PERTFPSLFE PVHGSAPDIY GKNIANPIAT
IWAGAMMLDF LGNGDERFQQ AHNGILAAIE EVIAHGPKTP DMKGSATTPQ VADAICKIIL
R