Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_0618 |
Symbol | allD |
ID | 6968967 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | - |
Start bp | 641089 |
End bp | 642138 |
Gene Length | 1050 bp |
Protein Length | 349 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 643384656 |
Product | ureidoglycolate dehydrogenase |
Protein accession | YP_002269170 |
Protein GI | 209398912 |
COG category | [C] Energy production and conversion |
COG ID | [COG2055] Malate/L-lactate dehydrogenases |
TIGRFAM ID | [TIGR03175] ureidoglycolate dehydrogenase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 58 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAATCA GTCGGGAAAC ACTCCACCAG CTAATTGAGA ATAAACTCTG CCAGGCTGGG TTAAAACGTG AGCACGCTGC AACCGTGGCT GAAGTATTGG TTTATGCCGA TGCCAGAGGG ATCCACTCTC ATGGCGCGGT GCGCGTGGAA TACTACGCGG AACGCATTTC AAAAGGCGGC ACCAACCGCG AACCGGAGTT TCGTCTTGAG GAAACCGGGC CGTGCTCGGC AATTTTACAT GCCGACAATG CCGCCGGACA GGTCGCGGCG AAAATGGGTA TGGAACATGC CATCAAAACC GCCCAGCAAA ATGGCGTTGC GGTGGTCGGT ATCAGCCGGA TGGGTCACAG CGGCGCAATC TCTTATTTTG TGCAGCAGGC AGCCCGCGCC GGATTAATTG GCATTTCGAT GTGCCAGTCC GATCCGATGG TGGTGCCGTT TGGCGGCGCG GAAATTTACT ACGGTACTAA CCCACTGGCC TTTGCCGCGC CGGGAGAAGG CGACGAGATC CTTACCTTTG ATATGGCGAC TACCGTACAG GCATGGGGAA AAGTCCTCGA CGCCCGCTCG CGTAATATGT CTATCCCGGA TACCTGGGCG GTCGATAAAA ACGGTGCACC AACAACCGAT CCGTTCGCGG TACATGCTCT GCTCCCCGCA GCCGGGCCGA AAGGGTATGG CCTGATGATG ATGATTGACG TCCTCTCAGG CGTCTTACTC GGCTTACCGT TCGGGCGACA GGTTAGTTCG ATGTATGACG ATTTACACGC CGGGCGTAAT TTGGGGCAAT TACATGTAGT TATTAACCCG AACTTTTTCT CCTCCAGCGA ATTATTCCGT CAACATCTTA GCCAGACCAT GCGCGAATTA AATGCCATTA CCCCCGCGCC CGGTTTTAAT CAGGTTTATT ATCCCGGACA GGATCAGGAT ATTAAACAAC GCCAAGCCGC CGTCGAAGGC ATCGAAATTG TTGATGATAT TTACCAGTAT TTAATTTCCG ACGCGCTTTA TAACACGTCA TACGAAACGA AAAATCCCTT TGCGCAATAA
|
Protein sequence | MKISRETLHQ LIENKLCQAG LKREHAATVA EVLVYADARG IHSHGAVRVE YYAERISKGG TNREPEFRLE ETGPCSAILH ADNAAGQVAA KMGMEHAIKT AQQNGVAVVG ISRMGHSGAI SYFVQQAARA GLIGISMCQS DPMVVPFGGA EIYYGTNPLA FAAPGEGDEI LTFDMATTVQ AWGKVLDARS RNMSIPDTWA VDKNGAPTTD PFAVHALLPA AGPKGYGLMM MIDVLSGVLL GLPFGRQVSS MYDDLHAGRN LGQLHVVINP NFFSSSELFR QHLSQTMREL NAITPAPGFN QVYYPGQDQD IKQRQAAVEG IEIVDDIYQY LISDALYNTS YETKNPFAQ
|
| |