Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dole_2724 |
Symbol | |
ID | 5695579 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Desulfococcus oleovorans Hxd3 |
Kingdom | Bacteria |
Replicon accession | NC_009943 |
Strand | + |
Start bp | 3284305 |
End bp | 3287115 |
Gene Length | 2811 bp |
Protein Length | 936 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641265336 |
Product | type III restriction protein res subunit |
Protein accession | YP_001530604 |
Protein GI | 158522734 |
COG category | [V] Defense mechanisms |
COG ID | [COG4096] Type I site-specific restriction-modification system, R (restriction) subunit and related helicases |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACTAAAA CAGAGGCCCA AACCCGATCC GAACTTATCG ATAAGCATCT GGCGCAATCG GGCTGGAATG TCAAAGACCC TATGCAGGTG GTTGAAGAGT TCGATATTTT GATGGCCCTC CCCGAAGGTA TTGCCGAACC GCGCACCCCA TACGAAGGGC ATCAGTTCAG TGATTACGTG TTGCTTGGGA AAGATGGCAG ACCGCTTGCC GTCGTAGAAG CTAAGAAGTC CAGCAAAGAT GCCGCCATCG GCCGGGAACA AGCCAAACAG TACTGTTATA ACATCCAAAA GCAGCTCGGT GGAGAATTGC CATTCTGCTT TTATACCAAT GGCCTTGAAA CCTATTTCTG GGACTTGGAC AATTATCCGC CGCGCAAGGT CGTGGGCTTC CCCACTCGTG ATGACCTGGA GCGGTTCCAA TATATCCGCC GCAACCACAA GCCACTGACT CAGGAGCTGA TCAACACCGC CATCGCCGGA CGTGATTACC AGATCCGTGC CATCCGAGCA GTATTGGAAG GAATCGAGCA AAAGAAACGC GACTTCCTGC TGGTGATGGC TACCGGCACC GGCAAAACGC GCACCAGCAT CGCCATGGTC GACGCCCTGA TGCGGGCAGG CCATGCCGAA AAAGCGCTGT TTCTAGTCGA CCGCATTGCC CTTCGGGAAC AGGCACTGGC CGCTTTTAAG GAACACCTTC CCCACGAGCC CCGCTGGCCC AACGTCGGCG AAAAGCTGAT CGCCAAGGAC CGCCGCATAT ATATCTCAAC CTATCCCACC ATGCTCAACA TCATCCGGGA TGAGTCGCAG TATCTTTCGC CGCATTTCTT TGATTTTATC GTCATCGATG AAAGTCATCG TTCCATTTAT AACACTTACG GCGAAATTCT CGACTACTTC AAAACCATCA CCCTGGGATT GACGGCAACA CCCACCGACA TCATCGACCA TAACACTTTC CGGATTTTTC ACTGTGAAGA CGGTCTTCCC ACCTTTGCGT ATACTTTTGA GGAAGCTGCC AACAACGTGC CGCCGTACCT TTGTAGTTTT CAGGTGATGA AGATTCAGAC CAAGTTCCAG AAAGAAGGGA TCAGCAAGCG CACCATCTCG TTGGAGGATC AGAAAAAACT ACTGCTGGAA GGTAAGGATG TTGCAGAGAT TAACTTTGAA GGCACACAAC TTGAAAAGAC AGTCATCAAC AAGGGCACCA ATACGCTGAT TGTCAAGGAG TTCATGGAAG AGTGCATCAA GGATCACAAT GGTGTTATGC CCGGCAAGAC CATCTTTTTC TGTTCAACCA TAGCCCATGC CCGGCGTATG GAGGATATTT TCGACAAACT TTATCCCCAG CACAAAGGCG AACTGGCCAA AGTTTTGGTT TCCGAAGACC CGCGTGTTTA CGGCAAGGGA GGGCTGCTTG ACCAGTTTAC CAATAGCGAT ATGCCCCGTG TCGCCATCAG CGTTGACATG CTTGATACCG GCATTGATGT ACGCGAAATT GTTAACCTGG TCTTTGCCAA ACCGGTCTAT TCTTATACCA AGTTCTGGCA GATGATCGGG CGCGGCACCC GTCTGCTGGA AATCGCCAAG CCCAAACCCT GGTGCATTGA AAAAGATGTT TTCCTGATTC TCGATTGCTG GGACAATTTT GAATATTTCA AGCTCCAACC AAAAGGCAAG GAGCTTAAGC AGCAACTGCC CCTTCCGGTG CGTCTGGTGG GCTTGCGTCT CGACAAGATC GAAAAGGCCA CCGACACTGC TCAAACAACG ATCACCGAGC GCGAAATCGG AAAATTTCGC AAGCAGATAA GCGAGTTGCC GCAAACTTCC GTGGTTATCA AGGAAGCTGC CGCCGCGCTG GCCCGGCTTG AAGAAGAAAA TTTCTGGATC ACTCTCAACC ATCAGAAGCT GGAATTCCTG CGTGCCGAGA TCAAGCCCCT GTTCCGGACC GTGTCCGAGG CAGACTTTAA GGCCATGCGT TTTGAGCGCG ATCTGCTGGA ATATTCCCTG GCCCGATTGC GCCAAGAGAA AGAAAAGGCT GAAACCCTCA AGGCTGGAAT CGTCGAGCAA ATAAGCGACC TGCCGTTATC CGTCAATTTT GTCAAAGCCG AGGAAACACT GATTCGTGCT TCCCAGACCA ATCACTACTG GGCCAAGCAA GACTCCATTG AAACAGAGAA CGCTCTGGAT GAGCTGAACA CCCGCCTTGG CCCCCTGATG AAATTTCGCG AGCAGGACAC CGGCCCCGGC CCCATGAATC TGGACCTGAC CGATACCTTG CACCATAAAG AGTGGGTAGA GTTCGGTCCG CAACACGAGG CGGTAAGCAT CAGTCGCTAC CGTGAGATGG TCGAGGCGCT GATCGCCGAG CTGACCGAAC ATAACCCCGT GCTGTTGAAG ATAAAGAACG GCGAAGCGGT GACGCCGGAT GAAGCCAATG CCCTGGCCGA ACTACTCCAT ACCGAGCATC CGCACATTAC CGAGGATTTA CTGCGTCAGG CCTACAAGAA CCGCAAGGCT CATTTTATTC AGTTTATCCG TCACATCCTC GGCATCGAAA TTTTGAAGAC CTTTCCTGAA ACGGTCAGCG AAGCGTTTGA GCAGTTTATC CAGCAGCACA GCAGCCTCAG CAGCCGGCAG CTGGAGTTTT TGAACCTGCT CAAAAATTTT ATCATCGAAC GCGAAAAGGT GGAAAAGAAA GACCTGATAA ACTCCCCCTT TACGGTCATT CACCCGCAAG GAATTCGCGG CGTTTTCAGC CCGGCGGAAA TCAACGAAAT ATTACAACTT ACCGAAAGGG TGGCAGCCTG A
|
Protein sequence | MTKTEAQTRS ELIDKHLAQS GWNVKDPMQV VEEFDILMAL PEGIAEPRTP YEGHQFSDYV LLGKDGRPLA VVEAKKSSKD AAIGREQAKQ YCYNIQKQLG GELPFCFYTN GLETYFWDLD NYPPRKVVGF PTRDDLERFQ YIRRNHKPLT QELINTAIAG RDYQIRAIRA VLEGIEQKKR DFLLVMATGT GKTRTSIAMV DALMRAGHAE KALFLVDRIA LREQALAAFK EHLPHEPRWP NVGEKLIAKD RRIYISTYPT MLNIIRDESQ YLSPHFFDFI VIDESHRSIY NTYGEILDYF KTITLGLTAT PTDIIDHNTF RIFHCEDGLP TFAYTFEEAA NNVPPYLCSF QVMKIQTKFQ KEGISKRTIS LEDQKKLLLE GKDVAEINFE GTQLEKTVIN KGTNTLIVKE FMEECIKDHN GVMPGKTIFF CSTIAHARRM EDIFDKLYPQ HKGELAKVLV SEDPRVYGKG GLLDQFTNSD MPRVAISVDM LDTGIDVREI VNLVFAKPVY SYTKFWQMIG RGTRLLEIAK PKPWCIEKDV FLILDCWDNF EYFKLQPKGK ELKQQLPLPV RLVGLRLDKI EKATDTAQTT ITEREIGKFR KQISELPQTS VVIKEAAAAL ARLEEENFWI TLNHQKLEFL RAEIKPLFRT VSEADFKAMR FERDLLEYSL ARLRQEKEKA ETLKAGIVEQ ISDLPLSVNF VKAEETLIRA SQTNHYWAKQ DSIETENALD ELNTRLGPLM KFREQDTGPG PMNLDLTDTL HHKEWVEFGP QHEAVSISRY REMVEALIAE LTEHNPVLLK IKNGEAVTPD EANALAELLH TEHPHITEDL LRQAYKNRKA HFIQFIRHIL GIEILKTFPE TVSEAFEQFI QQHSSLSSRQ LEFLNLLKNF IIEREKVEKK DLINSPFTVI HPQGIRGVFS PAEINEILQL TERVAA
|
| |