Gene EcSMS35_1388 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_1388 
SymbolttuC 
ID6145597 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp1377313 
End bp1378398 
Gene Length1086 bp 
Protein Length361 aa 
Translation table11 
GC content53% 
IMG OID641616266 
Producttartrate dehydrogenase/decarboxylase 
Protein accessionYP_001743446 
Protein GI170679663 
COG category[C] Energy production and conversion
[E] Amino acid transport and metabolism 
COG ID[COG0473] Isocitrate/isopropylmalate dehydrogenase 
TIGRFAM ID[TIGR02089] tartrate dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.253968 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones48 
Fosmid unclonability p-value0.832846 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGAAAA CGATCCGTAT TGCTGCGATC CCGGGAGACG GGATTGGCAA AGAAGTCCTT 
CCTGAAGGGA TTCGCGTGTT AAAGGCTGCC GCTGAGCGCT GGGGCTTCGC CTTGAGTTTT
GAGCAAATGG AGTGGGCGAG CTGCGAGTAT TACAGCCATC ACGGTAAAAT GATGCCGGAC
GACTGGCATG AGCAACTTAG CCGTTTCGAC GCCATCTATT TTGGTGCCGT CGGCTGGCCG
GATACCGTTC CGGACCATAT TTCGTTGTGG GGTTCGCTGC TGAAATTTCG TCGTGAGTTC
GACCAGTACG TCAACCTGCG CCCGGTTCGT CTCTTTCCTG GCGTTCCCTG CCCGCTGGCG
GGAAAACAGC CTGGCGACAT CGATTTTTAC GTGGTCAGGG AAAACACCGA AGGAGAATAT
TCCTCGCTCG GCGGTAGAGT GAATGAAGGT ACAGAGCATG AAGTCGTCAT TCAGGAATCG
GTATTTACGC GTCGTGGTGT CGATCGCATT TTGCGTTATG CCTTTGAACT TGCACAAAGC
CGCCCACGTA AGACGCTAAC TTCTGCAACT AAATCGAACG GTTTAGCCAT CAGCATGCCG
TACTGGGATG AGCGAGTGGA AGCAATGGCC GAGAATTACC CGGAGATCTG CTGGGACAAG
CAGCATATTG ATATTCTCTG CGCGCGTTTT GTGATGCAGC CGGAACGCTT CGATGTGGTG
GTGGCGTCCA ATTTGTTTGG CGATATCCTT TCCGATCTTG GCCCGGCCTG CACCGGCACC
ATTGGCATTG CCCCATCCGC CAACCTGAAT CCGGAACGCA CTTTCCCGTC GCTCTTCGAG
CCAGTCCACG GTTCCGCGCC GGATATCTAC GGGAAAAACA TTGCTAACCC TATCGCCACA
ATTTGGGCCG GTGCAATGAT GCTCGATTTT CTCGGCAATG GCGATGAGCG TTTCCAGCAA
GCGCATAACG GTATTCTGGC CGCGATTGAA GAAGTGATTG CTCACGGGCC GAAAACACCT
GATATGAAAG GCAATGCCAC CACGCCACAG GTTGCCGACG CGATTTGCAA AATTATTTTG
CGTTAA
 
Protein sequence
MMKTIRIAAI PGDGIGKEVL PEGIRVLKAA AERWGFALSF EQMEWASCEY YSHHGKMMPD 
DWHEQLSRFD AIYFGAVGWP DTVPDHISLW GSLLKFRREF DQYVNLRPVR LFPGVPCPLA
GKQPGDIDFY VVRENTEGEY SSLGGRVNEG TEHEVVIQES VFTRRGVDRI LRYAFELAQS
RPRKTLTSAT KSNGLAISMP YWDERVEAMA ENYPEICWDK QHIDILCARF VMQPERFDVV
VASNLFGDIL SDLGPACTGT IGIAPSANLN PERTFPSLFE PVHGSAPDIY GKNIANPIAT
IWAGAMMLDF LGNGDERFQQ AHNGILAAIE EVIAHGPKTP DMKGNATTPQ VADAICKIIL
R