Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dole_2239 |
Symbol | |
ID | 5695087 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Desulfococcus oleovorans Hxd3 |
Kingdom | Bacteria |
Replicon accession | NC_009943 |
Strand | + |
Start bp | 2712243 |
End bp | 2715383 |
Gene Length | 3141 bp |
Protein Length | 1046 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 641264845 |
Product | HsdR family type I site-specific deoxyribonuclease |
Protein accession | YP_001530120 |
Protein GI | 158522250 |
COG category | [V] Defense mechanisms |
COG ID | [COG0610] Type I site-specific restriction-modification system, R (restriction) subunit and related helicases |
TIGRFAM ID | [TIGR00348] type I site-specific deoxyribonuclease, HsdR family [TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACCCCTG CCGGATACAC CGAAGACGCC CTGATCGAAC AGCCGGCCAT CGCCCTGTTT AAAGAACTGG GATGGGAAAC GGTCAATGCC TATCACGAGT TTGAACAGGG CGGGAGCACG CTGGGCCGGG AGACAAAAGG CGAGGTCATC CTTGCTCTCC GGCTGCGGAG TGCCCTGCTG CGCCTTAACC CGGACGCGCC GACCGAGGCC ATTGACCAGA CCATTGAAGA CCTGACCCGG GACCGTTCGC GCATGAGCCC GGCGGCCGCC AACCGGGAGG TCTATCATCT GCTTAAACAC GGCGTGCGCG TTCCCGTGCC CGACCCCGAA GGCGACGGTG AAACGGTAGA AGTGGTGCGA ATCATCGACT GGGAGAACCC GGGCAAAAAC GACTTTCTGC TCTGCTCGCA GTTCTGGGTG ACCGGCGAGA TGTATACCCG CCGGGCCGAC CTGGTGGGTT TTGTCAACGG CCTGCCCTTG CTGTTTATCG AACTTAAAGC AACACACCGC CGCCTGGAAA CCGCCTTTAC CGGCAACCTG AACGATTATA AAGATACCAT TCCTCAGATG TTCTGGCCGA ATGGCCTGGT CATACTTTCC AACGGCAGCC AGAGCCGGGT GGGCAGTATC ACCGCCGGAT GGGAACACTT TTCCGAGTGG AAGAAAATCG GCAGCGAGGA TGAGGCGGGC AAAATCTCGC TGGAGACCAT GCTGCGCGGC GTGTGCGAAC CGGCCAGGCT GCTGGACCTG GTAGAGAACT TCACCCTGTT TCAGGAAGCG CCCGGCGGGC TGATCAAGCT GACGGCCAAG AATCATCAGT ACCTTGGGGT CAACAACGCG CTGGCCGCCC TGACCGACAT TCGGCAACGG GAAGGCAAAC TGGGCGTGTT CTGGCATACC CAGGGCAGCG GCAAGAGCGT TTCCATGATC TTTTTTGCCC AGAAGGTGAT GCGCAAGCAG CCGGGCAACT GGACCTTTGT GATTGTCACC GACCGCCAGG AACTGGACGG ACAGATTTAC AAAAATTTCG CTTCGGCCGG GGTTGTCACC GAAGGCCGCG CCCAGGCCGA AAGCAGCAAG CACCTGCGGC AGTTGCTGGG TGAGGATCAC CGCTATATTT TTACCCTGAT CCACAAATTC CGTGCGCCGG AGATGATCTC CACCCGCGAC GACATCATCG TCATCACTGA CGAAGCCCAC CGCAGCCAGT ATGATACTCT GGCGCTGAAC ATGCGCACGG CTTTGCCGAA CGCGGCTTTT CTGGCCTTTA CCGGCACGCC GCTGATCGTA GGCGAGGAGA AGACGCGGCA GGTATTCGGT GATTACATCA GTGTTTATGA TTTCCGTCAG TCGGTGATTG ACGGCGCCAC AGTGCCCCTC TATTACGAAA ACCGCATTCC TGAACTGCAA CTGGTCAACG ACAACCTCAA CGAGGAGATG GCGCGCCTGC TGGAAGAGGC GGAACTGGAT GAGGCCCAGG AGAAAAAACT GGAACGGGAA TTTTCCCGGG AGTACCACCT GATCACCCGC GATGACCGGC TGGAGGCTGT GGCCAAGGAC ATTGTCGCGC ATTTCACCGG TCGGGGCTTT TCCGGCAAGG GTATGGTGAT CTGCATTGAC AAGGCCACGG CCATCCGCAT GTACGATAAA GTGAAAAAAC ACTGGGCGGC AAAGATCGCC ACCCTGGAGG CGGAAAAGGA TACGGCGCCG ATTGAAGCCC TGCCTGAAAC CGAAGATCGC CTTGCCTGGA TGAAAGAGAC CGATATGGCA GTGGTGGTTT CGCAGGGGCA GAACGAAATC GCCGAAATGG CGAACAAGGG GCTGGACATC CGCCCGCACC GCAAGCGCGT GGTCGAAGAA GACCTGGATA CCAAGTTCAA AAAACCGGAC GATCCGTTCC GGCTGGTATT TGTCTGCGCC ATGTGGATGA CCGGCTTTGA CGTGCCAAGC TGTTCCACCC TCTATCTTGA CAAGCCCATG CGCAATCACA CGCTGATGCA GACCATTGCC CGAGCCAACC GGGTCTATCC CGGCAAGGTC AGCGGCCTGA TCGTGGATTA TGCCGGGGTG TTCCGAAACC TTGAAAAGGC GCTGGCCATT TATGGCGCCG GCGAGGGCGG CGACAAGCCG ATTGAAGATA AGGCCGCGCT GGTGGCGGCG CTGCGGCAGG TATTGGAAGA AACCGGGGCC TTGTGTCAGG AACAGGGGGT GGGTCTTGCG GCCATTCGAT CTGCTGATGG GTTTGCCCGC GTGGGCCTGC TTGATGATGC CGTGGAGGCG CTGGTTGTTT CAGAAGAGAT CAAGCGCCGT TATCTTGACC TGGCCAATGC CGTGCAGCGG CTGTATAAGG CTGTGCTGCC CGACCCGGCC GCGCATGCGT TTGTCGCCGA GGTGACGCCC GTCAAGGTGA TTGCGGATAA AATCCGCGTT TTGGCACCGC CGGCCGACAT CTCCCAGGTC ATGCAACAGG TGGAGGAACT GCTTGACCGC TCCATTGCCA CTGAGGGTTA CGTCTTTCGC GAGATTGTTC CGCCTTTTGG CCACGAGCAC CAGATTGACT TGAGTGATAT TGACTTTGAA AAGCTGGCTG AGAAATTTAA GACCGGCCGC AAGCGCACCA TCAATGAAAA GCTCAAAGGG GCTGTGGCGA GGAAACTCAT GATCATGGTG CGGCTCAACC GCACTCGCAT GGATTATCTG GAAAAGTTCC AGGCGATGAT CGACGCCTAC AACGCCGGCA GCCTCAATGC CGAGGCGTTC TTTGAACAAC TGATGGCTTT TGCCGGGAGC CTGAATGAAG AAGAAAAACG CGGCGTAAGT GAACAATTAA ATGAGGAAGA GCTGGTACTG TTTGACCTTT TAACCAAACC ACGGATTGAA ATGAGCGAGG CGGATCGCAA CAAGGTGAAA GTCACTGCCC GGGAACTGCT GGCCACACTC AAAACTGGCA AACTGGTGCT GGACTGGCGC AAGCGGCAAC AGGCGCGGGC TGAAGTCCGA GTCCCCATTG AAAAGGTGCT TGATAAGGGG CTGCCAAGGG CCTACACACC GGAACTGTTT GAGCAAAAAA CAGCGGCTGT TTTCCAGCAT GTCTATGAGG CTTACTGTGG TGCGGGACAA AGTATTTACG CGGCGGCATA A
|
Protein sequence | MTPAGYTEDA LIEQPAIALF KELGWETVNA YHEFEQGGST LGRETKGEVI LALRLRSALL RLNPDAPTEA IDQTIEDLTR DRSRMSPAAA NREVYHLLKH GVRVPVPDPE GDGETVEVVR IIDWENPGKN DFLLCSQFWV TGEMYTRRAD LVGFVNGLPL LFIELKATHR RLETAFTGNL NDYKDTIPQM FWPNGLVILS NGSQSRVGSI TAGWEHFSEW KKIGSEDEAG KISLETMLRG VCEPARLLDL VENFTLFQEA PGGLIKLTAK NHQYLGVNNA LAALTDIRQR EGKLGVFWHT QGSGKSVSMI FFAQKVMRKQ PGNWTFVIVT DRQELDGQIY KNFASAGVVT EGRAQAESSK HLRQLLGEDH RYIFTLIHKF RAPEMISTRD DIIVITDEAH RSQYDTLALN MRTALPNAAF LAFTGTPLIV GEEKTRQVFG DYISVYDFRQ SVIDGATVPL YYENRIPELQ LVNDNLNEEM ARLLEEAELD EAQEKKLERE FSREYHLITR DDRLEAVAKD IVAHFTGRGF SGKGMVICID KATAIRMYDK VKKHWAAKIA TLEAEKDTAP IEALPETEDR LAWMKETDMA VVVSQGQNEI AEMANKGLDI RPHRKRVVEE DLDTKFKKPD DPFRLVFVCA MWMTGFDVPS CSTLYLDKPM RNHTLMQTIA RANRVYPGKV SGLIVDYAGV FRNLEKALAI YGAGEGGDKP IEDKAALVAA LRQVLEETGA LCQEQGVGLA AIRSADGFAR VGLLDDAVEA LVVSEEIKRR YLDLANAVQR LYKAVLPDPA AHAFVAEVTP VKVIADKIRV LAPPADISQV MQQVEELLDR SIATEGYVFR EIVPPFGHEH QIDLSDIDFE KLAEKFKTGR KRTINEKLKG AVARKLMIMV RLNRTRMDYL EKFQAMIDAY NAGSLNAEAF FEQLMAFAGS LNEEEKRGVS EQLNEEELVL FDLLTKPRIE MSEADRNKVK VTARELLATL KTGKLVLDWR KRQQARAEVR VPIEKVLDKG LPRAYTPELF EQKTAAVFQH VYEAYCGAGQ SIYAAA
|
| |