Gene Dgeo_2240 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDgeo_2240 
Symbol 
ID4057208 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDeinococcus geothermalis DSM 11300 
KingdomBacteria 
Replicon accessionNC_008025 
Strand
Start bp2362180 
End bp2364045 
Gene Length1866 bp 
Protein Length621 aa 
Translation table11 
GC content65% 
IMG OID641231287 
Productcarboxylyase-like protein 
Protein accessionYP_605703 
Protein GI94986339 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0043] 3-polyprenyl-4-hydroxybenzoate decarboxylase and related decarboxylases 
TIGRFAM ID[TIGR00148] UbiD family decarboxylases 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00980247 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTCCTGC CAGCCAGGCC AGCTTCGGCC CCACGCAGCC CTGATCTGCA GAGTTTTCTC 
CGCCTCCTGG AGGAACGCGG CGAACTCGTG CGCGTGTCCA TTCCGGTTGA CCGCGAACTC
GAAATCACCG AGATTGCCGA CCGTCTCGTC AAGCAAGGCG GCCCGGCCGT CTTGTTCGAG
CACGTGAAGG GGAGCCCCTT TCCCCTGGTG ATCGGCCTGC TGGGCACCCG TGAGCGCACG
GCCCGGGCCC TGGGTGTGGC TGACCTCGAC GACCTCGCCC GGAAGGTCCG GCACCTGATG
GATCTCAAAG GGGGTGGCGG TCTGAGCGGT CTGCTGGGCA ACGTGGGCAA GTTGCGCGAC
GCGCTCCATC TGCCCCCGCG GCGTGTTCGG TCCGGTCCAG CGCAGGAGAT CATATGGACG
GGAGATGAGG TGGACCTTTC CAAACTCCCG GTCCTGAAGT GCTGGCCGCT CGACGGCGGG
CCGTTCATCA CGCTGCCTCT GGTGATGACC CGCGACCCCG AGACGGGCGA ACGCAACATG
GGGATGTACC GCATGCAGGT GATGGGTCGG AATGTCACGG GGATGCACTG GCAGCGCCAC
AAGACCGGGA CCCGGCACCT GGAAAAGGCG CGCCGCCTAG GGCAAAAGTT GCCCGTTGCG
GTGGCTCTTG GAGGAGACCC AGCCCTGATC TACGCGGCGA CCGCTCCGCT GCCTCCCATT
CCCGGCCTCG ACGAGTACGC TCTGGCGGGC TACCTACGCG GCGAGCGCTA CCCCGTGATG
AGGGGCGTGA CGGTCGACCT GGACGTTCCG GCCAACGCTG AATTTATCCT CGAAGGCTAC
GTGGATCCGC AAGAAGAGTG GGCGGTAGAA GGACCCTTTG GGGACCATAC CGGCTTCTAC
ACCCTCCCCG ACCGCTACCC GCGCTTTCAC GTCACGGCGA TCACCATGCG CCGCAACCCG
GTTTATCCGG CCACCATCGT GGGCCGTCCG CCGATGGAGG ATGCCTATCT GATCGAGGCC
AGCGAGCGGC TCTTCCTGCC CGCCGCGCAG ATGATCTTGC CGGAAATTGT GGACTACCAT
ATGCCGCCCG CCGGAGTCGC CCACAACCTT GTCGTGGTCA GCATCAAGAA GAGTTATCCC
GGCCAGGCCT ACAAGGTCGC CCAGGGGCTT CTCGGTCTGG GCCAGATGAT GTTTGCCAAG
GTGGTTGTCG TCGTGGACGA GGACGTGCAG GTGAATGACT TTGCTGCAGT CTGGCGCGAG
GTGACAGCCA GGGCCGTGCC AGGCCGCGAT ACCCTGATTA CCCGTGGTCC GGTCGACGTG
CTCGACCACT CCAGCCGCGG GTGGGGCTAC GGCGGCAAGC TGATCATTGA CGCCACCACC
AAACTTCCTG AGGAGATCGG CAGCGCCGTC AGCAGCCGAG AGGAGCTGGG CAGGGAAGGC
AGGGTAGAGC CGCTTTTTGT GCCGCGTGTT GCCGTAGACC TTCCCAACTA TGAGGGTGTC
TTGGCCCAGC GGCAGACCCC GGATGGCTAC TGGTCTGTGG CCCTGCACAA GACCCGCGCT
GGCCAGTCCC AGGCCCTCGC GCAAGCCTTT GCCGCGCACC CCGCCGCCTC GGGAGTGCGC
CACCTCCTGA TCGCGGACGA ACAGACTGAC GTGCACAACA CGCAGGATGT GTGGTGGACC
GTTCTCAACA ACATTGATCC TGAGCGCGAC GTGCGTCAGC TCGGGGGACT GCTCGTTTGG
GACGGCTCGC GCAAGCTGTC TGAGGAGGGC TTCGTCCGCA CGTGGCCTCC CAAGATCGAG
ATGACACCTG AGGTGCGGCG CCGGGTGGAC GCCCGCTGGC ACCTGTACGG CCTGCCCGAG
CTGTAG
 
Protein sequence
MFLPARPASA PRSPDLQSFL RLLEERGELV RVSIPVDREL EITEIADRLV KQGGPAVLFE 
HVKGSPFPLV IGLLGTRERT ARALGVADLD DLARKVRHLM DLKGGGGLSG LLGNVGKLRD
ALHLPPRRVR SGPAQEIIWT GDEVDLSKLP VLKCWPLDGG PFITLPLVMT RDPETGERNM
GMYRMQVMGR NVTGMHWQRH KTGTRHLEKA RRLGQKLPVA VALGGDPALI YAATAPLPPI
PGLDEYALAG YLRGERYPVM RGVTVDLDVP ANAEFILEGY VDPQEEWAVE GPFGDHTGFY
TLPDRYPRFH VTAITMRRNP VYPATIVGRP PMEDAYLIEA SERLFLPAAQ MILPEIVDYH
MPPAGVAHNL VVVSIKKSYP GQAYKVAQGL LGLGQMMFAK VVVVVDEDVQ VNDFAAVWRE
VTARAVPGRD TLITRGPVDV LDHSSRGWGY GGKLIIDATT KLPEEIGSAV SSREELGREG
RVEPLFVPRV AVDLPNYEGV LAQRQTPDGY WSVALHKTRA GQSQALAQAF AAHPAASGVR
HLLIADEQTD VHNTQDVWWT VLNNIDPERD VRQLGGLLVW DGSRKLSEEG FVRTWPPKIE
MTPEVRRRVD ARWHLYGLPE L