Gene Dole_2106 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDole_2106 
Symbol 
ID5694949 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfococcus oleovorans Hxd3 
KingdomBacteria 
Replicon accessionNC_009943 
Strand
Start bp2558731 
End bp2560452 
Gene Length1722 bp 
Protein Length573 aa 
Translation table11 
GC content63% 
IMG OID641264707 
Productsingle-stranded-DNA-specific exonuclease RecJ 
Protein accessionYP_001529987 
Protein GI158522117 
COG category[L] Replication, recombination and repair 
COG ID[COG0608] Single-stranded DNA-specific exonuclease 
TIGRFAM ID[TIGR00644] single-stranded-DNA-specific exonuclease RecJ 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00306237 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCCGGA ATATAAACAC CAGACAACCC GATCCCTCGG CCGTTACCGC CATTGCAAAG 
GCCCTGCCCT GCTCGGACAT TGCCGCGTCC CTGCTGGTCA ATCGGGGCAT TCTATCGGTA
AAACAGGCCC GGCACTTCCT GTCCGCCACC GTGGCCGACC TGCCCTCTCC CTTTTCCATG
AAAGGTATAG ACACGGCCGT GGCCCGCATT CACTCGGCCC TGATCAACCA TGAAACCATC
CTGGTGTTCG GCGATTATGA TGCTGACGGC ATCACCGCCA CCGCTCTTCT GGTGGGGTTC
CTCACACAGG CCGGTGCAAC GGTCCGGCAT CACCTGCCCC ACCGGATTCT GGAGGGGTAC
GGTCTCAAGC CCCGGCACAT CACCGAGGTT GCCGCCACGC ACAAGGCCTC GCTGCTGATC
ACCGTGGATT GCGGCATCAC CAGCCATGCC GCGGCCCTGG CCTGCCGGGA GGCGGGTATT
GACCTGATCA TCACCGACCA TCACCAGGCA CCGGCCGAAC CGCCTACGGC CCTGGCCGTG
ATCGACCCGG CCCAGGCCGG CTGCCCGTCG GGCCTGGGCG ATCTGGCCGG CGTGGGCGTG
GCCTTCTACC TGCTGATCGC CCTGCGGGCC TTTCTGCGAG AAAAGGGGTT CTGGAAAACC
CGGCCCGAGC CCAACCTCAA GGCCTGCTGT GACCTGGTGG CCATCGGCGC CATCGCCGAT
ATCGTTCCCC TGACCGGCGA AAACCGGATT CTGGTCAGAA CCGGCCTGGA ACTGCTCCGC
ACCCCCTCCC GGCCGGGTAT AGAGGCATTG ATTGCCGGCG CAAAGATCAA AAAACCGGCC
CTTTCCGCCG AAGACGTGGC CTTTCGGCTG GCTCCCCGGC TTAACGCGCC GGGCCGGATC
GACCATGCCG CAACAGCCCT GGAACTGCTG CTGGCCGAAA CAAAGGAGGT CGCCGTAAAA
ATAGCGGGCC GGCTCAACCG GCTCAACACC AAACGGCAGA TGATTGAAGA GGAGATTTCG
GGTGAGATTC AGCAGATTGT CAATGATCGC ATCAACACCA TCAAGGAACG GCGTTCCCTG
GTACTGGCCC ATCGCCAGTG GCACCAGGGT GTGATCGGCA TCGCCGCGTC ACGGGCCGCG
CGGCGCTACA GCCTGCCCGT GGCCCTGATC ACCATTTCCG GAGACATGGG CATCGGATCG
GCCCGCAGCA TCCCCGGCCT TCACCTTTAC GACGCCCTCA AGGGATGCGC CCGGCTCTTC
GAGGACTTCG GGGGCCACGC CCAGGCCGCG GGTTTCCGCA TTCGCATGGA GAACCTGGCC
GCGTTTGAGA CACAGTTTGA GTATATTGTG CGGCAGCAGA CCACACCCGG CGACTTTGTC
CCGGCCGTTG AGGTAGACTG CGAGCTGGCC ATGGACCGCA TCACTAACAC CCTGGTCCGG
GAGATCGAAG GGCTTTCCCC CTTTGGCGCC GGTAACCCGG AGCCGCTGTT TTCAGCCCGC
AATGTACGGG CCAGCGACAC CTTCATGATC AACGGCCGGC ACCGGAAAAT GACCCTGGCC
CACCCCGGAA CGGACAGGGG TATTGCCGCC ATCTGGTTCA ACACCCCGGC ATCACATCAG
GAGTCCTCTT TTTTCAGACG AATCCTCTTC CGGCTGCGGC AAGACACCTG GAACGGCTCC
GGCGCCCCGC AAATCATCAT TGAAGACGCC TTTGCCGAGT AG
 
Protein sequence
MTRNINTRQP DPSAVTAIAK ALPCSDIAAS LLVNRGILSV KQARHFLSAT VADLPSPFSM 
KGIDTAVARI HSALINHETI LVFGDYDADG ITATALLVGF LTQAGATVRH HLPHRILEGY
GLKPRHITEV AATHKASLLI TVDCGITSHA AALACREAGI DLIITDHHQA PAEPPTALAV
IDPAQAGCPS GLGDLAGVGV AFYLLIALRA FLREKGFWKT RPEPNLKACC DLVAIGAIAD
IVPLTGENRI LVRTGLELLR TPSRPGIEAL IAGAKIKKPA LSAEDVAFRL APRLNAPGRI
DHAATALELL LAETKEVAVK IAGRLNRLNT KRQMIEEEIS GEIQQIVNDR INTIKERRSL
VLAHRQWHQG VIGIAASRAA RRYSLPVALI TISGDMGIGS ARSIPGLHLY DALKGCARLF
EDFGGHAQAA GFRIRMENLA AFETQFEYIV RQQTTPGDFV PAVEVDCELA MDRITNTLVR
EIEGLSPFGA GNPEPLFSAR NVRASDTFMI NGRHRKMTLA HPGTDRGIAA IWFNTPASHQ
ESSFFRRILF RLRQDTWNGS GAPQIIIEDA FAE