Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dole_0245 |
Symbol | |
ID | 5693063 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Desulfococcus oleovorans Hxd3 |
Kingdom | Bacteria |
Replicon accession | NC_009943 |
Strand | - |
Start bp | 274010 |
End bp | 277126 |
Gene Length | 3117 bp |
Protein Length | 1038 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 641262825 |
Product | HsdR family type I site-specific deoxyribonuclease |
Protein accession | YP_001528132 |
Protein GI | 158520262 |
COG category | [V] Defense mechanisms |
COG ID | [COG0610] Type I site-specific restriction-modification system, R (restriction) subunit and related helicases |
TIGRFAM ID | [TIGR00348] type I site-specific deoxyribonuclease, HsdR family [TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCTGAAA AACCCCGCCC TGAGCGGATT ACCCAGAATC GCGTGATTGC CCTGTTTACC GACAAATCCC GGCCGGATTG TCTTGGGTAT CAATACCTGG GTGACTGGAG CAAGCGCGAC AACAACCGGC CCGTTGAGGC GGAGTATCTC CGGGCGAACC TGAAAAAGCG CGGCTATTCG GACGCCCATA TTTCAGCCGC TCTGCTGCAA CTGGAAATCG CCGCCGGCAC CACCGGGGTG ACGCTTTACC AGGCCAATCT GCGCACCTAC AATCTGCTGC GCTACCCGGT GAAGGTGCTG CTGGCGCCGG GCCAGCCGCA CGAGGATGTG CATTTGATCG ACTGGGAACA TCCTGAAAAC AATGATTTTG CCCTGGCCGA GGAAGTGACC CTTAAAGGTG GCTACGAGCG GCGGCCCGAC CTGGTATTGT ACATCAACGG CATGGCCATT GGCGTGATTG AACTCAAGCG CAGCTCGGTG GAGGTGGCCG ACGGGGTTCG CCAGCTCATC ACCAACCAGG AAAAGATATT CAACGAGGGT TTCTTTTCCA CGGTGCAGCT TGTGTTTGCG GGCAGCGATT CTCAAGGCTT GCGTTACGGC ACCACCGGCA CGCCGGAGCA GTTCTTTGTT CAATGGAAAG ACGAGGAAGG CGACAGTGCG CTTGCGCCCG GCACCCTGCT TGACAAGCCC CTGGCCCAGA TGTGCAACAA AAAGCGGTTG CTGGACCTGA TTCGCAACTG CATTATTTTC GATGCCGGGC AGAAAAAGGT GCCCCGGCCC CATCAGTATT TCGGTTTTAA GGCTGCCCAG GAGAGAATCC GCCGGCGCGA GGGCGGCGTT ATCTGGCACA CCCAGGGCAG CGGCAAAAGC ATCCTCATGG TGTTGATCGC CAAATGGCTG ATGGAACACG ACCCTGACGC GCGTATTCTG GTCATCACGG ACCGGGACGA ACTGGACAAA CAGATTGTGG ATGTAATGCG AAACGCCGGG GTGGTCGGCG AGGACGCACC TTCGCCGCGC ATTACCTCGC GGGCGCAGTT TGTCGAAAAG ATCGGTGCGA CAACGCCGCG CCTGCTCTGC GCTCTGATTC ACAAGTTCGA GACGACCGAC CTGAAAGGCA ACCCGCCGCC GATTCACGGG CGCTTCTACA TTTTTGTGGA TGAATGTCAC CGCACCCAGG GCGGGGACAT GAACCGGCAG ATGAAACGGT GGATGCAGGA CGCGATTTTT ATCGGCTTTA CCGGCACCCC GCTTTTGCGC CGGGACAAAT TGATGACCCG GGATGTTTTT GGAACCTACA TTCACACCTA TAAATTCCAC CAGGGCGTGG CCGACAAGGT CATTCTGGAC CTCAAGTATG AGGCCCGCAA TGTGCCGCAG CGCCTCACTT CCCAAAAAGC CATTGACGCC TGGTTTGAGC AGAAGACAAA AAACCTGAAC AATTTTCAGA AGGCCATTGT AAGAAAAACC TGGGCCACCA TGGAAAAGCT GATGAGCGCG GGCGAGCGCA AGCAACGCAT TATTGCCGAT ATCATCCAGG ATTTCAGCCT TAAACCGCGC CTGAACAATG ATCGCGGCAC CGCCATTCTG GTGACGGCCT CGATTTACGA CGCCTGTCAC TATTTCCGGC TGTTTCAAAA CACCGGCTTT GGCGCGTACT GCGGCATCAT CACCTCTTAC GAGCCGAACG CCAACGCCAT TTCCAAAGAA CCGGCCAACA GCAATGAGCG CTACAAGTTT GACACCTATA AGCAGTGTGT TTTGGATGCG TTTACCACAA CGGAAAAATA CGAGGCGGAA ACCAGGCGGC GCTTTATTGA AGAGCCGGCC AACTGCAAGC TGCTGATCGT GGTCAGCAAA CTGCTGACCG GGTTTGACGC CCCCTCCTGC ACCTATATCT ATCTTGATAA TGAGATGCAT GACCACACCC TGTTTCAGGC TATCTGCCGC ACCAACCGCC TGGATGGTGA CGACAAGGAG TATGGCCACA TTGTCGACTT CAAGGAGCTG TTCGGCGATG TGCAGCAGGC CATTGCCGTT TACAATTCCG ATGAGCTGGA TATAGACGAG GGCGGCGGGG GGGAGAACAA CATCCACCTG AAGGACTGGC TGAAAGAGGG CAAAAAGAAA CTTGATGACA CCCGCGAGGC CCTGAAGTAT CTCTGCGCGC CGGTTCCCGA GCCGCGTGAA ATGGAGCAGT ATCTTTTCTA TTTCTGCGGC AACGCCGATA ATCCGAACGC GCTTACCGAC ACAGAGGCCC TGCGCGTATC GTTCTACAAG TCAGTGGCCA CGTTTGTGCG GGCGTTTGCC GCTGTTTCCC AATACCTGGC CGAGGCGGGG TATTCGGCCG CTGAAATTGC CACGCTGAAC AACGAGGTCA AATTCTTCAG TGATACGCGG GCGGCCATTA AAAAACATTC CGGCGAAGAG TTGGACATCA AGCCCTATGA AGCGGACATG CGTCATCTGC TCAACACCTA TATTCAGGCT GATCCGGCCG ATCCACTGGG CGAAATGGAC CGGTACTCGC TGGTGGAACT TATCATCAAA ACCGGCATTC ATGACGCCAT TGCAAAAAAA CTCAACGAAA AGGGAAAGCT GTCAAAGACC GCGGTTGCCG AGGGCATTAT CCACAACATC CGCAAGACCA TCATTCGGGA GCAGTTGGCC GATCCCCGAT TCTACGAAGA GATGTCCAAG CTGTTGGATG ATCTGATCAA ACAGCGGCAG GATGATACAA AGTCCTATGA AGAATTTCTC AAACAGGCCG AGGCCCTGGT TAAAAAAATG TCCAAGGGGC AGCCTGGGAA AAATATTCCC CAGGAACTTC ACGGAAGGCC CGAAGCAACC GTGATTTACA ATAACCTGCC GGATATTCTG ATGCACCTTG CGTCCGAAGA CATGGTTCAG GATCCGGTAA CCGGCTTCGG CGCTACCCGG CTCGGGCTTG CCCTGGAAAT CGACCGCGCC ATGCGGGAAC ACGCACCGGC CGGATGGAAG GGGGATGACA CCCGGGAAAA ACAGGTTTTA AATGCGTTGT TTCCCCTTTT AAAACGGAAT CGCAAGGCAA CGATGGCGGC GTTTGAACTC ATAAAAAACA TGTCGGGTTA CGCATGA
|
Protein sequence | MPEKPRPERI TQNRVIALFT DKSRPDCLGY QYLGDWSKRD NNRPVEAEYL RANLKKRGYS DAHISAALLQ LEIAAGTTGV TLYQANLRTY NLLRYPVKVL LAPGQPHEDV HLIDWEHPEN NDFALAEEVT LKGGYERRPD LVLYINGMAI GVIELKRSSV EVADGVRQLI TNQEKIFNEG FFSTVQLVFA GSDSQGLRYG TTGTPEQFFV QWKDEEGDSA LAPGTLLDKP LAQMCNKKRL LDLIRNCIIF DAGQKKVPRP HQYFGFKAAQ ERIRRREGGV IWHTQGSGKS ILMVLIAKWL MEHDPDARIL VITDRDELDK QIVDVMRNAG VVGEDAPSPR ITSRAQFVEK IGATTPRLLC ALIHKFETTD LKGNPPPIHG RFYIFVDECH RTQGGDMNRQ MKRWMQDAIF IGFTGTPLLR RDKLMTRDVF GTYIHTYKFH QGVADKVILD LKYEARNVPQ RLTSQKAIDA WFEQKTKNLN NFQKAIVRKT WATMEKLMSA GERKQRIIAD IIQDFSLKPR LNNDRGTAIL VTASIYDACH YFRLFQNTGF GAYCGIITSY EPNANAISKE PANSNERYKF DTYKQCVLDA FTTTEKYEAE TRRRFIEEPA NCKLLIVVSK LLTGFDAPSC TYIYLDNEMH DHTLFQAICR TNRLDGDDKE YGHIVDFKEL FGDVQQAIAV YNSDELDIDE GGGGENNIHL KDWLKEGKKK LDDTREALKY LCAPVPEPRE MEQYLFYFCG NADNPNALTD TEALRVSFYK SVATFVRAFA AVSQYLAEAG YSAAEIATLN NEVKFFSDTR AAIKKHSGEE LDIKPYEADM RHLLNTYIQA DPADPLGEMD RYSLVELIIK TGIHDAIAKK LNEKGKLSKT AVAEGIIHNI RKTIIREQLA DPRFYEEMSK LLDDLIKQRQ DDTKSYEEFL KQAEALVKKM SKGQPGKNIP QELHGRPEAT VIYNNLPDIL MHLASEDMVQ DPVTGFGATR LGLALEIDRA MREHAPAGWK GDDTREKQVL NALFPLLKRN RKATMAAFEL IKNMSGYA
|
| |