Gene Dole_2239 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDole_2239 
Symbol 
ID5695087 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfococcus oleovorans Hxd3 
KingdomBacteria 
Replicon accessionNC_009943 
Strand
Start bp2712243 
End bp2715383 
Gene Length3141 bp 
Protein Length1046 aa 
Translation table11 
GC content57% 
IMG OID641264845 
ProductHsdR family type I site-specific deoxyribonuclease 
Protein accessionYP_001530120 
Protein GI158522250 
COG category[V] Defense mechanisms 
COG ID[COG0610] Type I site-specific restriction-modification system, R (restriction) subunit and related helicases 
TIGRFAM ID[TIGR00348] type I site-specific deoxyribonuclease, HsdR family
[TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCCCTG CCGGATACAC CGAAGACGCC CTGATCGAAC AGCCGGCCAT CGCCCTGTTT 
AAAGAACTGG GATGGGAAAC GGTCAATGCC TATCACGAGT TTGAACAGGG CGGGAGCACG
CTGGGCCGGG AGACAAAAGG CGAGGTCATC CTTGCTCTCC GGCTGCGGAG TGCCCTGCTG
CGCCTTAACC CGGACGCGCC GACCGAGGCC ATTGACCAGA CCATTGAAGA CCTGACCCGG
GACCGTTCGC GCATGAGCCC GGCGGCCGCC AACCGGGAGG TCTATCATCT GCTTAAACAC
GGCGTGCGCG TTCCCGTGCC CGACCCCGAA GGCGACGGTG AAACGGTAGA AGTGGTGCGA
ATCATCGACT GGGAGAACCC GGGCAAAAAC GACTTTCTGC TCTGCTCGCA GTTCTGGGTG
ACCGGCGAGA TGTATACCCG CCGGGCCGAC CTGGTGGGTT TTGTCAACGG CCTGCCCTTG
CTGTTTATCG AACTTAAAGC AACACACCGC CGCCTGGAAA CCGCCTTTAC CGGCAACCTG
AACGATTATA AAGATACCAT TCCTCAGATG TTCTGGCCGA ATGGCCTGGT CATACTTTCC
AACGGCAGCC AGAGCCGGGT GGGCAGTATC ACCGCCGGAT GGGAACACTT TTCCGAGTGG
AAGAAAATCG GCAGCGAGGA TGAGGCGGGC AAAATCTCGC TGGAGACCAT GCTGCGCGGC
GTGTGCGAAC CGGCCAGGCT GCTGGACCTG GTAGAGAACT TCACCCTGTT TCAGGAAGCG
CCCGGCGGGC TGATCAAGCT GACGGCCAAG AATCATCAGT ACCTTGGGGT CAACAACGCG
CTGGCCGCCC TGACCGACAT TCGGCAACGG GAAGGCAAAC TGGGCGTGTT CTGGCATACC
CAGGGCAGCG GCAAGAGCGT TTCCATGATC TTTTTTGCCC AGAAGGTGAT GCGCAAGCAG
CCGGGCAACT GGACCTTTGT GATTGTCACC GACCGCCAGG AACTGGACGG ACAGATTTAC
AAAAATTTCG CTTCGGCCGG GGTTGTCACC GAAGGCCGCG CCCAGGCCGA AAGCAGCAAG
CACCTGCGGC AGTTGCTGGG TGAGGATCAC CGCTATATTT TTACCCTGAT CCACAAATTC
CGTGCGCCGG AGATGATCTC CACCCGCGAC GACATCATCG TCATCACTGA CGAAGCCCAC
CGCAGCCAGT ATGATACTCT GGCGCTGAAC ATGCGCACGG CTTTGCCGAA CGCGGCTTTT
CTGGCCTTTA CCGGCACGCC GCTGATCGTA GGCGAGGAGA AGACGCGGCA GGTATTCGGT
GATTACATCA GTGTTTATGA TTTCCGTCAG TCGGTGATTG ACGGCGCCAC AGTGCCCCTC
TATTACGAAA ACCGCATTCC TGAACTGCAA CTGGTCAACG ACAACCTCAA CGAGGAGATG
GCGCGCCTGC TGGAAGAGGC GGAACTGGAT GAGGCCCAGG AGAAAAAACT GGAACGGGAA
TTTTCCCGGG AGTACCACCT GATCACCCGC GATGACCGGC TGGAGGCTGT GGCCAAGGAC
ATTGTCGCGC ATTTCACCGG TCGGGGCTTT TCCGGCAAGG GTATGGTGAT CTGCATTGAC
AAGGCCACGG CCATCCGCAT GTACGATAAA GTGAAAAAAC ACTGGGCGGC AAAGATCGCC
ACCCTGGAGG CGGAAAAGGA TACGGCGCCG ATTGAAGCCC TGCCTGAAAC CGAAGATCGC
CTTGCCTGGA TGAAAGAGAC CGATATGGCA GTGGTGGTTT CGCAGGGGCA GAACGAAATC
GCCGAAATGG CGAACAAGGG GCTGGACATC CGCCCGCACC GCAAGCGCGT GGTCGAAGAA
GACCTGGATA CCAAGTTCAA AAAACCGGAC GATCCGTTCC GGCTGGTATT TGTCTGCGCC
ATGTGGATGA CCGGCTTTGA CGTGCCAAGC TGTTCCACCC TCTATCTTGA CAAGCCCATG
CGCAATCACA CGCTGATGCA GACCATTGCC CGAGCCAACC GGGTCTATCC CGGCAAGGTC
AGCGGCCTGA TCGTGGATTA TGCCGGGGTG TTCCGAAACC TTGAAAAGGC GCTGGCCATT
TATGGCGCCG GCGAGGGCGG CGACAAGCCG ATTGAAGATA AGGCCGCGCT GGTGGCGGCG
CTGCGGCAGG TATTGGAAGA AACCGGGGCC TTGTGTCAGG AACAGGGGGT GGGTCTTGCG
GCCATTCGAT CTGCTGATGG GTTTGCCCGC GTGGGCCTGC TTGATGATGC CGTGGAGGCG
CTGGTTGTTT CAGAAGAGAT CAAGCGCCGT TATCTTGACC TGGCCAATGC CGTGCAGCGG
CTGTATAAGG CTGTGCTGCC CGACCCGGCC GCGCATGCGT TTGTCGCCGA GGTGACGCCC
GTCAAGGTGA TTGCGGATAA AATCCGCGTT TTGGCACCGC CGGCCGACAT CTCCCAGGTC
ATGCAACAGG TGGAGGAACT GCTTGACCGC TCCATTGCCA CTGAGGGTTA CGTCTTTCGC
GAGATTGTTC CGCCTTTTGG CCACGAGCAC CAGATTGACT TGAGTGATAT TGACTTTGAA
AAGCTGGCTG AGAAATTTAA GACCGGCCGC AAGCGCACCA TCAATGAAAA GCTCAAAGGG
GCTGTGGCGA GGAAACTCAT GATCATGGTG CGGCTCAACC GCACTCGCAT GGATTATCTG
GAAAAGTTCC AGGCGATGAT CGACGCCTAC AACGCCGGCA GCCTCAATGC CGAGGCGTTC
TTTGAACAAC TGATGGCTTT TGCCGGGAGC CTGAATGAAG AAGAAAAACG CGGCGTAAGT
GAACAATTAA ATGAGGAAGA GCTGGTACTG TTTGACCTTT TAACCAAACC ACGGATTGAA
ATGAGCGAGG CGGATCGCAA CAAGGTGAAA GTCACTGCCC GGGAACTGCT GGCCACACTC
AAAACTGGCA AACTGGTGCT GGACTGGCGC AAGCGGCAAC AGGCGCGGGC TGAAGTCCGA
GTCCCCATTG AAAAGGTGCT TGATAAGGGG CTGCCAAGGG CCTACACACC GGAACTGTTT
GAGCAAAAAA CAGCGGCTGT TTTCCAGCAT GTCTATGAGG CTTACTGTGG TGCGGGACAA
AGTATTTACG CGGCGGCATA A
 
Protein sequence
MTPAGYTEDA LIEQPAIALF KELGWETVNA YHEFEQGGST LGRETKGEVI LALRLRSALL 
RLNPDAPTEA IDQTIEDLTR DRSRMSPAAA NREVYHLLKH GVRVPVPDPE GDGETVEVVR
IIDWENPGKN DFLLCSQFWV TGEMYTRRAD LVGFVNGLPL LFIELKATHR RLETAFTGNL
NDYKDTIPQM FWPNGLVILS NGSQSRVGSI TAGWEHFSEW KKIGSEDEAG KISLETMLRG
VCEPARLLDL VENFTLFQEA PGGLIKLTAK NHQYLGVNNA LAALTDIRQR EGKLGVFWHT
QGSGKSVSMI FFAQKVMRKQ PGNWTFVIVT DRQELDGQIY KNFASAGVVT EGRAQAESSK
HLRQLLGEDH RYIFTLIHKF RAPEMISTRD DIIVITDEAH RSQYDTLALN MRTALPNAAF
LAFTGTPLIV GEEKTRQVFG DYISVYDFRQ SVIDGATVPL YYENRIPELQ LVNDNLNEEM
ARLLEEAELD EAQEKKLERE FSREYHLITR DDRLEAVAKD IVAHFTGRGF SGKGMVICID
KATAIRMYDK VKKHWAAKIA TLEAEKDTAP IEALPETEDR LAWMKETDMA VVVSQGQNEI
AEMANKGLDI RPHRKRVVEE DLDTKFKKPD DPFRLVFVCA MWMTGFDVPS CSTLYLDKPM
RNHTLMQTIA RANRVYPGKV SGLIVDYAGV FRNLEKALAI YGAGEGGDKP IEDKAALVAA
LRQVLEETGA LCQEQGVGLA AIRSADGFAR VGLLDDAVEA LVVSEEIKRR YLDLANAVQR
LYKAVLPDPA AHAFVAEVTP VKVIADKIRV LAPPADISQV MQQVEELLDR SIATEGYVFR
EIVPPFGHEH QIDLSDIDFE KLAEKFKTGR KRTINEKLKG AVARKLMIMV RLNRTRMDYL
EKFQAMIDAY NAGSLNAEAF FEQLMAFAGS LNEEEKRGVS EQLNEEELVL FDLLTKPRIE
MSEADRNKVK VTARELLATL KTGKLVLDWR KRQQARAEVR VPIEKVLDKG LPRAYTPELF
EQKTAAVFQH VYEAYCGAGQ SIYAAA