Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dgeo_2731 |
Symbol | |
ID | 4073962 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Deinococcus geothermalis DSM 11300 |
Kingdom | Bacteria |
Replicon accession | NC_008010 |
Strand | + |
Start bp | 295489 |
End bp | 296724 |
Gene Length | 1236 bp |
Protein Length | 411 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 641228745 |
Product | imidazolonepropionase |
Protein accession | YP_594238 |
Protein GI | 94972198 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG1228] Imidazolonepropionase and related amidohydrolases |
TIGRFAM ID | [TIGR01224] imidazolonepropionase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.626121 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCTGAAC TGCTCTTGAC TGGCATCACC CAGCTCGTGA CGCCCCCGCC GGGACCGCAA AGGGGCGCGG CCATGCGGAA GCTGACGGTG CTGCAGGACG CGGCGCTCCT CATGCGTGAC GGGATGATCG CGTGGGTGGG ATCACGGCAG GAGGCGCCCG CCGCCGCTCA GATCCGCGAC TTGGGCGGCG TCGCCGTCGT TCCCGGCCTG GTCGACCCTC ACACCCATGC GGTCTGGGCC GGGGACCGCC TGGCCGATTT CGAGGCACGG GTGGAAGGCG TGCCCTACGA GGAGCTGCTG GCGCGGGGGG GCGGCATCCG CTCCACCATG CAGGCGACGG CAACGGCAGG TGTGGAGGAA CTTGCCCAGC TCGCCCACCC CCGCCTAGCG GCCCTGCTCC ATTCCGGCGC GACCACCATC GAGGTCAAGA GCGGCTACGG GCTGGACTTT GGGGCCGAGT TGAGGATGCT GAAGGCGGTG CGTGCGTTGC AGGAGAGCTT GCCGGCCACG CTCGTGCCCA CGCTGCTGAT TCACGTCCCG CCCACCGAAA GCCGCGCGGC GTACGTCCGG GCGGTCTGTG AGGCCCTCAT TCCCGAGGTG GCGCGCAAGC GCCTGGCTGC CGCTGTGGAC GTGTTCTGCG AGCGCGAAGC CTTCACGGTG GAGGAAACGC GTGCCCTCTT CGCGGCGGCC CGGTCGAATG GCCTGCAGGT CAAGCTGCAC GCCGACCAGT TCCACGCCCT CGGCGGCACC GAACTTGCCT GCGCGGTGGA GGCGCTCAGC GTGGACCACC TGGAAGCCAG CGGCGAGGCG CAGATCGAGG CGCTGGCCGC GTCGGAGACG GTGGCGACGG TCCTGCCCGG CGTCACGCTG CACCTGGGGC TGAGGGCAGC CCCGGCCCGC CGCCTCGTGG ACGCGGGCGC CTGCGTGGCG GTCGGTACGG ACCTGAACCC CGGCAGCTCT CCCCTCTTCA GCGCCCAGCT CGCGCTGGCC CTCGCGGTGC GGCTGAACGG CCTCACGCCC GCCGAGGCCC TCACCGCTTG CACCGTGAAC GCCGCCGCCG CACTGGGGCT GAGGGACCGG GGGGCACTGG TGGCTGGGCA GCGGGCCGAC TTGCTCGCCC TGCATGCCTC CGACTGGCGC GACCTGGCCT ACACGCTGGG CGCAAACCCT GTCCGCGACG TGTTCGTGGG CGGGCAAAAC ATCAAGGAGA CTCTGAGCAA GGAGAAGGCC CTGTGA
|
Protein sequence | MAELLLTGIT QLVTPPPGPQ RGAAMRKLTV LQDAALLMRD GMIAWVGSRQ EAPAAAQIRD LGGVAVVPGL VDPHTHAVWA GDRLADFEAR VEGVPYEELL ARGGGIRSTM QATATAGVEE LAQLAHPRLA ALLHSGATTI EVKSGYGLDF GAELRMLKAV RALQESLPAT LVPTLLIHVP PTESRAAYVR AVCEALIPEV ARKRLAAAVD VFCEREAFTV EETRALFAAA RSNGLQVKLH ADQFHALGGT ELACAVEALS VDHLEASGEA QIEALAASET VATVLPGVTL HLGLRAAPAR RLVDAGACVA VGTDLNPGSS PLFSAQLALA LAVRLNGLTP AEALTACTVN AAAALGLRDR GALVAGQRAD LLALHASDWR DLAYTLGANP VRDVFVGGQN IKETLSKEKA L
|
| |