Gene Dgeo_2419 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDgeo_2419 
Symbol 
ID4073647 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDeinococcus geothermalis DSM 11300 
KingdomBacteria 
Replicon accessionNC_008010 
Strand
Start bp67111 
End bp68088 
Gene Length978 bp 
Protein Length325 aa 
Translation table11 
GC content69% 
IMG OID641228534 
Product3,4-dihydroxyphenylacetate 2,3-dioxygenase HpaD 
Protein accessionYP_593927 
Protein GI94971887 
COG category[R] General function prediction only 
COG ID[COG2514] Predicted ring-cleavage extradiol dioxygenase 
TIGRFAM ID[TIGR02295] 3,4-dihydroxyphenylacetate 2,3-dioxygenase 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGGCCC CCGCCCATCC CGATATCGTT CGCGTCGCCC ACGCCGTCTT CACCGTCACG 
GACCTGGAAG CTTCGCGCGA GTTCTATGTG AACCTGCTCG GTCTGAACGT GCTGCACGAG
GAGCCGGGCG CCCTCTACCT GCGCGGGGTG GAGGACCGCG AGTGGACCCT CAAGCTGGAA
GAGAACCCGG AAGCCGGGGT GCGGCACATC GCCTACCGGG TGCGGACGTA CGCCGACCTG
GACGGCCTGG TCGCGCTGGC GGAGCGTGAG GGCCTCCCCT CCCGCTGGGA AGAGGAACTC
GACCGGCCCC GCATGCTGCG CATGCAAGAC CCCTTTGGCG TCCCGGTCGC CTTTTACCGC
GAGAGCCGCA CCCACCCCTG GTTCTTGCAG GACTACCACC TGCACCGCGG GCCGGGTTTG
CAACGGGTGG ACCACGTGAA CGTGATGACG CCGGACGTGG AAGGCATGCT GGGCTGGTAC
ACGCGCGAAC TGGGGTTCCG CGTCTCCGAG TACACCGAGG ACGAGGCGGG GCGCATCTGG
GCGGCCTGGA TTCAGCGGCG GGGCGGCGTG CATGACCTCG CCCTGACGAA TGGCGCGGGG
CCGCGGCTGC ACCACTGGGC CTACTGGATG CCCGACGCCA TGAGCATCAT CCGCGCCTGC
GACATCCTGG CGGGGGCGCG GCAGCCCGAG CGCATCGAGC GCGGGCCGGG GCGGCACGGC
ATCTCCAACG CCTTTTTCCT GTATATCCGC GACCCAGACG GCCACCGCAT CGAGCTGTAC
ACCTCTGACT ACCTCACGGT GGACCCCGAC TTCCAGCCCA TCCGCTGGCA GCTCAACGAC
CCGCGGCGCC AGACGCTGTG GGGGGCCAAG ACGCCGCGGA GCTGGTTTGA GGAAGGCTCG
CGGCTGGAAG CTTTCGGCGG GGGCTGGGTC ACGCCGGCGG AGGGGCAGCT GAAGGGGCTA
CCGGTTCATG TCATCTGA
 
Protein sequence
MTAPAHPDIV RVAHAVFTVT DLEASREFYV NLLGLNVLHE EPGALYLRGV EDREWTLKLE 
ENPEAGVRHI AYRVRTYADL DGLVALAERE GLPSRWEEEL DRPRMLRMQD PFGVPVAFYR
ESRTHPWFLQ DYHLHRGPGL QRVDHVNVMT PDVEGMLGWY TRELGFRVSE YTEDEAGRIW
AAWIQRRGGV HDLALTNGAG PRLHHWAYWM PDAMSIIRAC DILAGARQPE RIERGPGRHG
ISNAFFLYIR DPDGHRIELY TSDYLTVDPD FQPIRWQLND PRRQTLWGAK TPRSWFEEGS
RLEAFGGGWV TPAEGQLKGL PVHVI