Gene Dgeo_2731 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDgeo_2731 
Symbol 
ID4073962 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDeinococcus geothermalis DSM 11300 
KingdomBacteria 
Replicon accessionNC_008010 
Strand
Start bp295489 
End bp296724 
Gene Length1236 bp 
Protein Length411 aa 
Translation table11 
GC content71% 
IMG OID641228745 
Productimidazolonepropionase 
Protein accessionYP_594238 
Protein GI94972198 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG1228] Imidazolonepropionase and related amidohydrolases 
TIGRFAM ID[TIGR01224] imidazolonepropionase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.626121 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCTGAAC TGCTCTTGAC TGGCATCACC CAGCTCGTGA CGCCCCCGCC GGGACCGCAA 
AGGGGCGCGG CCATGCGGAA GCTGACGGTG CTGCAGGACG CGGCGCTCCT CATGCGTGAC
GGGATGATCG CGTGGGTGGG ATCACGGCAG GAGGCGCCCG CCGCCGCTCA GATCCGCGAC
TTGGGCGGCG TCGCCGTCGT TCCCGGCCTG GTCGACCCTC ACACCCATGC GGTCTGGGCC
GGGGACCGCC TGGCCGATTT CGAGGCACGG GTGGAAGGCG TGCCCTACGA GGAGCTGCTG
GCGCGGGGGG GCGGCATCCG CTCCACCATG CAGGCGACGG CAACGGCAGG TGTGGAGGAA
CTTGCCCAGC TCGCCCACCC CCGCCTAGCG GCCCTGCTCC ATTCCGGCGC GACCACCATC
GAGGTCAAGA GCGGCTACGG GCTGGACTTT GGGGCCGAGT TGAGGATGCT GAAGGCGGTG
CGTGCGTTGC AGGAGAGCTT GCCGGCCACG CTCGTGCCCA CGCTGCTGAT TCACGTCCCG
CCCACCGAAA GCCGCGCGGC GTACGTCCGG GCGGTCTGTG AGGCCCTCAT TCCCGAGGTG
GCGCGCAAGC GCCTGGCTGC CGCTGTGGAC GTGTTCTGCG AGCGCGAAGC CTTCACGGTG
GAGGAAACGC GTGCCCTCTT CGCGGCGGCC CGGTCGAATG GCCTGCAGGT CAAGCTGCAC
GCCGACCAGT TCCACGCCCT CGGCGGCACC GAACTTGCCT GCGCGGTGGA GGCGCTCAGC
GTGGACCACC TGGAAGCCAG CGGCGAGGCG CAGATCGAGG CGCTGGCCGC GTCGGAGACG
GTGGCGACGG TCCTGCCCGG CGTCACGCTG CACCTGGGGC TGAGGGCAGC CCCGGCCCGC
CGCCTCGTGG ACGCGGGCGC CTGCGTGGCG GTCGGTACGG ACCTGAACCC CGGCAGCTCT
CCCCTCTTCA GCGCCCAGCT CGCGCTGGCC CTCGCGGTGC GGCTGAACGG CCTCACGCCC
GCCGAGGCCC TCACCGCTTG CACCGTGAAC GCCGCCGCCG CACTGGGGCT GAGGGACCGG
GGGGCACTGG TGGCTGGGCA GCGGGCCGAC TTGCTCGCCC TGCATGCCTC CGACTGGCGC
GACCTGGCCT ACACGCTGGG CGCAAACCCT GTCCGCGACG TGTTCGTGGG CGGGCAAAAC
ATCAAGGAGA CTCTGAGCAA GGAGAAGGCC CTGTGA
 
Protein sequence
MAELLLTGIT QLVTPPPGPQ RGAAMRKLTV LQDAALLMRD GMIAWVGSRQ EAPAAAQIRD 
LGGVAVVPGL VDPHTHAVWA GDRLADFEAR VEGVPYEELL ARGGGIRSTM QATATAGVEE
LAQLAHPRLA ALLHSGATTI EVKSGYGLDF GAELRMLKAV RALQESLPAT LVPTLLIHVP
PTESRAAYVR AVCEALIPEV ARKRLAAAVD VFCEREAFTV EETRALFAAA RSNGLQVKLH
ADQFHALGGT ELACAVEALS VDHLEASGEA QIEALAASET VATVLPGVTL HLGLRAAPAR
RLVDAGACVA VGTDLNPGSS PLFSAQLALA LAVRLNGLTP AEALTACTVN AAAALGLRDR
GALVAGQRAD LLALHASDWR DLAYTLGANP VRDVFVGGQN IKETLSKEKA L