Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_0562 |
Symbol | allD |
ID | 6145275 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 569182 |
End bp | 570231 |
Gene Length | 1050 bp |
Protein Length | 349 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641615454 |
Product | ureidoglycolate dehydrogenase |
Protein accession | YP_001742661 |
Protein GI | 170683300 |
COG category | [C] Energy production and conversion |
COG ID | [COG2055] Malate/L-lactate dehydrogenases |
TIGRFAM ID | [TIGR03175] ureidoglycolate dehydrogenase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 54 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAATCA GTCGGGAAAC ACTCCACCAG CTAATTGAGA ATAAACTCTG CCAGGCTGGG TTAAAACGTG AGCACGCTGC AACCGTGGCT GAAGTATTGG TTTACGCCGA TGCCAGAGGG ATCCACTCTC ATGGCGCGGT GCGCGTGGAA TACTACGCCG AACGCATTTC AAAAGGCGGC ACCAACCGTG AACCGGAATT TCGTCTTGAA GAAACCGGAC CGTGCTCGGC AATTTTACAT GCCGACAATG CCGCCGGACA GGTCGCGGCG AAAATGGGTA TGGAACATGC CATCAAAACC GCCCAGCAAA ATGGCGTTGC GGTGGTCGGT ATCAGCCGGA TGGGTCACAG CGGCGCAATC TCTTATTTTG TACAGCAGGC AGCTCGCGCC GGGTTAATTG GCATTTCGAT GTGCCAGTCC GATCCAATGG TGGTGCCGTT TGGCGGCGCG GAAATTTACT ACGGTACTAA CCCACTGGCC TTTGCCGCGC CGGGAGAAGG CGACGAGATC CTTACCTTTG ATATGGCGAC TACCGTACAG GCATGGGGAA AAGTGCTCGA CGCCCGTTCG CGTAATATGT CTATCCCGGA TACCTGGGCG GTCGATAAAA ACGGTGCACC AACAACCGAT CCGTTCGCGG TACATGCTCT GCTCCCCGCC GCTGGGCCGA AAGGGTATGG CCTGATGATG ATGATTGACG TCCTCTCAGG CGTCTTACTC GGCTTACCGT TTGGGCGACA GGTTAGTTCG ATGTATGACG ATTTACACGC CGGGCGTAAT TTGGGGCAAT TACATATCGT TATTAACCCG AACTTTTTCT CCTCCAGCGA ATTATTCCGT CAACATCTTA GCCAGACCAT GCGCGAATTA AATGCCATTA CCCCCGCGCC CGGTTTTAAT CAGGTTTATT ATCCCGGACA GGATCAGGAT ATTAAACAAC GCAAAGCCGC CGTCGAAGGC ATCGAAATTG TTGATGATAT TTACCAGTAT TTGATTTCCG ACGCGCTTTA TAACACGTCA TACGAAACGA AAAATCCCTT TGCGCAATAA
|
Protein sequence | MKISRETLHQ LIENKLCQAG LKREHAATVA EVLVYADARG IHSHGAVRVE YYAERISKGG TNREPEFRLE ETGPCSAILH ADNAAGQVAA KMGMEHAIKT AQQNGVAVVG ISRMGHSGAI SYFVQQAARA GLIGISMCQS DPMVVPFGGA EIYYGTNPLA FAAPGEGDEI LTFDMATTVQ AWGKVLDARS RNMSIPDTWA VDKNGAPTTD PFAVHALLPA AGPKGYGLMM MIDVLSGVLL GLPFGRQVSS MYDDLHAGRN LGQLHIVINP NFFSSSELFR QHLSQTMREL NAITPAPGFN QVYYPGQDQD IKQRKAAVEG IEIVDDIYQY LISDALYNTS YETKNPFAQ
|
| |