Gene ECH74115_0618 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_0618 
SymbolallD 
ID6968967 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp641089 
End bp642138 
Gene Length1050 bp 
Protein Length349 aa 
Translation table11 
GC content53% 
IMG OID643384656 
Productureidoglycolate dehydrogenase 
Protein accessionYP_002269170 
Protein GI209398912 
COG category[C] Energy production and conversion 
COG ID[COG2055] Malate/L-lactate dehydrogenases 
TIGRFAM ID[TIGR03175] ureidoglycolate dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones58 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAATCA GTCGGGAAAC ACTCCACCAG CTAATTGAGA ATAAACTCTG CCAGGCTGGG 
TTAAAACGTG AGCACGCTGC AACCGTGGCT GAAGTATTGG TTTATGCCGA TGCCAGAGGG
ATCCACTCTC ATGGCGCGGT GCGCGTGGAA TACTACGCGG AACGCATTTC AAAAGGCGGC
ACCAACCGCG AACCGGAGTT TCGTCTTGAG GAAACCGGGC CGTGCTCGGC AATTTTACAT
GCCGACAATG CCGCCGGACA GGTCGCGGCG AAAATGGGTA TGGAACATGC CATCAAAACC
GCCCAGCAAA ATGGCGTTGC GGTGGTCGGT ATCAGCCGGA TGGGTCACAG CGGCGCAATC
TCTTATTTTG TGCAGCAGGC AGCCCGCGCC GGATTAATTG GCATTTCGAT GTGCCAGTCC
GATCCGATGG TGGTGCCGTT TGGCGGCGCG GAAATTTACT ACGGTACTAA CCCACTGGCC
TTTGCCGCGC CGGGAGAAGG CGACGAGATC CTTACCTTTG ATATGGCGAC TACCGTACAG
GCATGGGGAA AAGTCCTCGA CGCCCGCTCG CGTAATATGT CTATCCCGGA TACCTGGGCG
GTCGATAAAA ACGGTGCACC AACAACCGAT CCGTTCGCGG TACATGCTCT GCTCCCCGCA
GCCGGGCCGA AAGGGTATGG CCTGATGATG ATGATTGACG TCCTCTCAGG CGTCTTACTC
GGCTTACCGT TCGGGCGACA GGTTAGTTCG ATGTATGACG ATTTACACGC CGGGCGTAAT
TTGGGGCAAT TACATGTAGT TATTAACCCG AACTTTTTCT CCTCCAGCGA ATTATTCCGT
CAACATCTTA GCCAGACCAT GCGCGAATTA AATGCCATTA CCCCCGCGCC CGGTTTTAAT
CAGGTTTATT ATCCCGGACA GGATCAGGAT ATTAAACAAC GCCAAGCCGC CGTCGAAGGC
ATCGAAATTG TTGATGATAT TTACCAGTAT TTAATTTCCG ACGCGCTTTA TAACACGTCA
TACGAAACGA AAAATCCCTT TGCGCAATAA
 
Protein sequence
MKISRETLHQ LIENKLCQAG LKREHAATVA EVLVYADARG IHSHGAVRVE YYAERISKGG 
TNREPEFRLE ETGPCSAILH ADNAAGQVAA KMGMEHAIKT AQQNGVAVVG ISRMGHSGAI
SYFVQQAARA GLIGISMCQS DPMVVPFGGA EIYYGTNPLA FAAPGEGDEI LTFDMATTVQ
AWGKVLDARS RNMSIPDTWA VDKNGAPTTD PFAVHALLPA AGPKGYGLMM MIDVLSGVLL
GLPFGRQVSS MYDDLHAGRN LGQLHVVINP NFFSSSELFR QHLSQTMREL NAITPAPGFN
QVYYPGQDQD IKQRQAAVEG IEIVDDIYQY LISDALYNTS YETKNPFAQ