Gene EcSMS35_0562 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_0562 
SymbolallD 
ID6145275 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp569182 
End bp570231 
Gene Length1050 bp 
Protein Length349 aa 
Translation table11 
GC content52% 
IMG OID641615454 
Productureidoglycolate dehydrogenase 
Protein accessionYP_001742661 
Protein GI170683300 
COG category[C] Energy production and conversion 
COG ID[COG2055] Malate/L-lactate dehydrogenases 
TIGRFAM ID[TIGR03175] ureidoglycolate dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones54 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAATCA GTCGGGAAAC ACTCCACCAG CTAATTGAGA ATAAACTCTG CCAGGCTGGG 
TTAAAACGTG AGCACGCTGC AACCGTGGCT GAAGTATTGG TTTACGCCGA TGCCAGAGGG
ATCCACTCTC ATGGCGCGGT GCGCGTGGAA TACTACGCCG AACGCATTTC AAAAGGCGGC
ACCAACCGTG AACCGGAATT TCGTCTTGAA GAAACCGGAC CGTGCTCGGC AATTTTACAT
GCCGACAATG CCGCCGGACA GGTCGCGGCG AAAATGGGTA TGGAACATGC CATCAAAACC
GCCCAGCAAA ATGGCGTTGC GGTGGTCGGT ATCAGCCGGA TGGGTCACAG CGGCGCAATC
TCTTATTTTG TACAGCAGGC AGCTCGCGCC GGGTTAATTG GCATTTCGAT GTGCCAGTCC
GATCCAATGG TGGTGCCGTT TGGCGGCGCG GAAATTTACT ACGGTACTAA CCCACTGGCC
TTTGCCGCGC CGGGAGAAGG CGACGAGATC CTTACCTTTG ATATGGCGAC TACCGTACAG
GCATGGGGAA AAGTGCTCGA CGCCCGTTCG CGTAATATGT CTATCCCGGA TACCTGGGCG
GTCGATAAAA ACGGTGCACC AACAACCGAT CCGTTCGCGG TACATGCTCT GCTCCCCGCC
GCTGGGCCGA AAGGGTATGG CCTGATGATG ATGATTGACG TCCTCTCAGG CGTCTTACTC
GGCTTACCGT TTGGGCGACA GGTTAGTTCG ATGTATGACG ATTTACACGC CGGGCGTAAT
TTGGGGCAAT TACATATCGT TATTAACCCG AACTTTTTCT CCTCCAGCGA ATTATTCCGT
CAACATCTTA GCCAGACCAT GCGCGAATTA AATGCCATTA CCCCCGCGCC CGGTTTTAAT
CAGGTTTATT ATCCCGGACA GGATCAGGAT ATTAAACAAC GCAAAGCCGC CGTCGAAGGC
ATCGAAATTG TTGATGATAT TTACCAGTAT TTGATTTCCG ACGCGCTTTA TAACACGTCA
TACGAAACGA AAAATCCCTT TGCGCAATAA
 
Protein sequence
MKISRETLHQ LIENKLCQAG LKREHAATVA EVLVYADARG IHSHGAVRVE YYAERISKGG 
TNREPEFRLE ETGPCSAILH ADNAAGQVAA KMGMEHAIKT AQQNGVAVVG ISRMGHSGAI
SYFVQQAARA GLIGISMCQS DPMVVPFGGA EIYYGTNPLA FAAPGEGDEI LTFDMATTVQ
AWGKVLDARS RNMSIPDTWA VDKNGAPTTD PFAVHALLPA AGPKGYGLMM MIDVLSGVLL
GLPFGRQVSS MYDDLHAGRN LGQLHIVINP NFFSSSELFR QHLSQTMREL NAITPAPGFN
QVYYPGQDQD IKQRKAAVEG IEIVDDIYQY LISDALYNTS YETKNPFAQ