Gene Dole_2724 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDole_2724 
Symbol 
ID5695579 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfococcus oleovorans Hxd3 
KingdomBacteria 
Replicon accessionNC_009943 
Strand
Start bp3284305 
End bp3287115 
Gene Length2811 bp 
Protein Length936 aa 
Translation table11 
GC content52% 
IMG OID641265336 
Producttype III restriction protein res subunit 
Protein accessionYP_001530604 
Protein GI158522734 
COG category[V] Defense mechanisms 
COG ID[COG4096] Type I site-specific restriction-modification system, R (restriction) subunit and related helicases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACTAAAA CAGAGGCCCA AACCCGATCC GAACTTATCG ATAAGCATCT GGCGCAATCG 
GGCTGGAATG TCAAAGACCC TATGCAGGTG GTTGAAGAGT TCGATATTTT GATGGCCCTC
CCCGAAGGTA TTGCCGAACC GCGCACCCCA TACGAAGGGC ATCAGTTCAG TGATTACGTG
TTGCTTGGGA AAGATGGCAG ACCGCTTGCC GTCGTAGAAG CTAAGAAGTC CAGCAAAGAT
GCCGCCATCG GCCGGGAACA AGCCAAACAG TACTGTTATA ACATCCAAAA GCAGCTCGGT
GGAGAATTGC CATTCTGCTT TTATACCAAT GGCCTTGAAA CCTATTTCTG GGACTTGGAC
AATTATCCGC CGCGCAAGGT CGTGGGCTTC CCCACTCGTG ATGACCTGGA GCGGTTCCAA
TATATCCGCC GCAACCACAA GCCACTGACT CAGGAGCTGA TCAACACCGC CATCGCCGGA
CGTGATTACC AGATCCGTGC CATCCGAGCA GTATTGGAAG GAATCGAGCA AAAGAAACGC
GACTTCCTGC TGGTGATGGC TACCGGCACC GGCAAAACGC GCACCAGCAT CGCCATGGTC
GACGCCCTGA TGCGGGCAGG CCATGCCGAA AAAGCGCTGT TTCTAGTCGA CCGCATTGCC
CTTCGGGAAC AGGCACTGGC CGCTTTTAAG GAACACCTTC CCCACGAGCC CCGCTGGCCC
AACGTCGGCG AAAAGCTGAT CGCCAAGGAC CGCCGCATAT ATATCTCAAC CTATCCCACC
ATGCTCAACA TCATCCGGGA TGAGTCGCAG TATCTTTCGC CGCATTTCTT TGATTTTATC
GTCATCGATG AAAGTCATCG TTCCATTTAT AACACTTACG GCGAAATTCT CGACTACTTC
AAAACCATCA CCCTGGGATT GACGGCAACA CCCACCGACA TCATCGACCA TAACACTTTC
CGGATTTTTC ACTGTGAAGA CGGTCTTCCC ACCTTTGCGT ATACTTTTGA GGAAGCTGCC
AACAACGTGC CGCCGTACCT TTGTAGTTTT CAGGTGATGA AGATTCAGAC CAAGTTCCAG
AAAGAAGGGA TCAGCAAGCG CACCATCTCG TTGGAGGATC AGAAAAAACT ACTGCTGGAA
GGTAAGGATG TTGCAGAGAT TAACTTTGAA GGCACACAAC TTGAAAAGAC AGTCATCAAC
AAGGGCACCA ATACGCTGAT TGTCAAGGAG TTCATGGAAG AGTGCATCAA GGATCACAAT
GGTGTTATGC CCGGCAAGAC CATCTTTTTC TGTTCAACCA TAGCCCATGC CCGGCGTATG
GAGGATATTT TCGACAAACT TTATCCCCAG CACAAAGGCG AACTGGCCAA AGTTTTGGTT
TCCGAAGACC CGCGTGTTTA CGGCAAGGGA GGGCTGCTTG ACCAGTTTAC CAATAGCGAT
ATGCCCCGTG TCGCCATCAG CGTTGACATG CTTGATACCG GCATTGATGT ACGCGAAATT
GTTAACCTGG TCTTTGCCAA ACCGGTCTAT TCTTATACCA AGTTCTGGCA GATGATCGGG
CGCGGCACCC GTCTGCTGGA AATCGCCAAG CCCAAACCCT GGTGCATTGA AAAAGATGTT
TTCCTGATTC TCGATTGCTG GGACAATTTT GAATATTTCA AGCTCCAACC AAAAGGCAAG
GAGCTTAAGC AGCAACTGCC CCTTCCGGTG CGTCTGGTGG GCTTGCGTCT CGACAAGATC
GAAAAGGCCA CCGACACTGC TCAAACAACG ATCACCGAGC GCGAAATCGG AAAATTTCGC
AAGCAGATAA GCGAGTTGCC GCAAACTTCC GTGGTTATCA AGGAAGCTGC CGCCGCGCTG
GCCCGGCTTG AAGAAGAAAA TTTCTGGATC ACTCTCAACC ATCAGAAGCT GGAATTCCTG
CGTGCCGAGA TCAAGCCCCT GTTCCGGACC GTGTCCGAGG CAGACTTTAA GGCCATGCGT
TTTGAGCGCG ATCTGCTGGA ATATTCCCTG GCCCGATTGC GCCAAGAGAA AGAAAAGGCT
GAAACCCTCA AGGCTGGAAT CGTCGAGCAA ATAAGCGACC TGCCGTTATC CGTCAATTTT
GTCAAAGCCG AGGAAACACT GATTCGTGCT TCCCAGACCA ATCACTACTG GGCCAAGCAA
GACTCCATTG AAACAGAGAA CGCTCTGGAT GAGCTGAACA CCCGCCTTGG CCCCCTGATG
AAATTTCGCG AGCAGGACAC CGGCCCCGGC CCCATGAATC TGGACCTGAC CGATACCTTG
CACCATAAAG AGTGGGTAGA GTTCGGTCCG CAACACGAGG CGGTAAGCAT CAGTCGCTAC
CGTGAGATGG TCGAGGCGCT GATCGCCGAG CTGACCGAAC ATAACCCCGT GCTGTTGAAG
ATAAAGAACG GCGAAGCGGT GACGCCGGAT GAAGCCAATG CCCTGGCCGA ACTACTCCAT
ACCGAGCATC CGCACATTAC CGAGGATTTA CTGCGTCAGG CCTACAAGAA CCGCAAGGCT
CATTTTATTC AGTTTATCCG TCACATCCTC GGCATCGAAA TTTTGAAGAC CTTTCCTGAA
ACGGTCAGCG AAGCGTTTGA GCAGTTTATC CAGCAGCACA GCAGCCTCAG CAGCCGGCAG
CTGGAGTTTT TGAACCTGCT CAAAAATTTT ATCATCGAAC GCGAAAAGGT GGAAAAGAAA
GACCTGATAA ACTCCCCCTT TACGGTCATT CACCCGCAAG GAATTCGCGG CGTTTTCAGC
CCGGCGGAAA TCAACGAAAT ATTACAACTT ACCGAAAGGG TGGCAGCCTG A
 
Protein sequence
MTKTEAQTRS ELIDKHLAQS GWNVKDPMQV VEEFDILMAL PEGIAEPRTP YEGHQFSDYV 
LLGKDGRPLA VVEAKKSSKD AAIGREQAKQ YCYNIQKQLG GELPFCFYTN GLETYFWDLD
NYPPRKVVGF PTRDDLERFQ YIRRNHKPLT QELINTAIAG RDYQIRAIRA VLEGIEQKKR
DFLLVMATGT GKTRTSIAMV DALMRAGHAE KALFLVDRIA LREQALAAFK EHLPHEPRWP
NVGEKLIAKD RRIYISTYPT MLNIIRDESQ YLSPHFFDFI VIDESHRSIY NTYGEILDYF
KTITLGLTAT PTDIIDHNTF RIFHCEDGLP TFAYTFEEAA NNVPPYLCSF QVMKIQTKFQ
KEGISKRTIS LEDQKKLLLE GKDVAEINFE GTQLEKTVIN KGTNTLIVKE FMEECIKDHN
GVMPGKTIFF CSTIAHARRM EDIFDKLYPQ HKGELAKVLV SEDPRVYGKG GLLDQFTNSD
MPRVAISVDM LDTGIDVREI VNLVFAKPVY SYTKFWQMIG RGTRLLEIAK PKPWCIEKDV
FLILDCWDNF EYFKLQPKGK ELKQQLPLPV RLVGLRLDKI EKATDTAQTT ITEREIGKFR
KQISELPQTS VVIKEAAAAL ARLEEENFWI TLNHQKLEFL RAEIKPLFRT VSEADFKAMR
FERDLLEYSL ARLRQEKEKA ETLKAGIVEQ ISDLPLSVNF VKAEETLIRA SQTNHYWAKQ
DSIETENALD ELNTRLGPLM KFREQDTGPG PMNLDLTDTL HHKEWVEFGP QHEAVSISRY
REMVEALIAE LTEHNPVLLK IKNGEAVTPD EANALAELLH TEHPHITEDL LRQAYKNRKA
HFIQFIRHIL GIEILKTFPE TVSEAFEQFI QQHSSLSSRQ LEFLNLLKNF IIEREKVEKK
DLINSPFTVI HPQGIRGVFS PAEINEILQL TERVAA