Gene Dole_1258 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDole_1258 
Symbol 
ID5694093 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfococcus oleovorans Hxd3 
KingdomBacteria 
Replicon accessionNC_009943 
Strand
Start bp1500676 
End bp1503039 
Gene Length2364 bp 
Protein Length787 aa 
Translation table11 
GC content52% 
IMG OID641263852 
Producttype III restriction protein res subunit 
Protein accessionYP_001529141 
Protein GI158521271 
COG category[V] Defense mechanisms 
COG ID[COG4096] Type I site-specific restriction-modification system, R (restriction) subunit and related helicases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.377183 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCCGAA ACGAAAAACA GACATGCACA GACTTAATTG AACCAGCCCT TACAAAAGCC 
GGCTGGCAAT GGGATAGTCA GGTCACCATC GGCCTCGGGC GGGTCAATAT TACTGGCGAT
ACCATGTATG ATGAAACGCA GACAATTATT GCCGACTATG TTCTGCGCTA TTATGGTGCG
CCACTGGTCG TCCTTGAAGC CAAAGCAGAG TCGATCTCTG CTGCCGATGG CATGCAACAG
GGATCGCGTT ATGCGGGACG GCTGGACATC CGTTTTTCCA TCGCCACCAA TGGCACCGAC
TGGGTTCTTA CCGACAATGA TTCTGGCAAA TACGAAACGC TTTCTGCGCC GCCATCGCCG
GAAGCCATCC TCACACATCA CGGCATTTCC ATTGACTGGG ATCAATGGGG CGAGGTCTTT
TCAGCTGGGT ACCATATTGA TCAAGTCTCT CGCAAAGTAG TCAGGCCATA TCAAGATGTG
GCCATCCATA AAACCCTCTG GCATTTTGCC GGTGGGAATA ACCGAGCCCT TCTGCTGATG
GCAACCGGGA CAGGAAAAAC ATTTGTCGTT TTTCAGTTGG TCTGGAAATT GTTAAACACC
AGTATCCTGA AGCGCCAGCA CATCCTTTTC CTGACCGACC GTAATTCGCT TAAAGATCAG
GCCTACCGGG CTTTTGCCGC TTTTTCCACC GATGAACGCG TCACCATCAA CAAGGATACC
GTCGCCAATG GCCAGCACCA GGTTGGTAAA GTGTTCTTCG CCAATTATCA AAACCTGGAT
GAAGAACTGG ACGGCAAAAA AATCTTTGAG CACTACGACC AGGACTTTTT TGATCTGGTC
ATTATCGACG AATGCCATCG GTCCGGATTC GGCGACTGGT TTGGCGTGCT GGAACATTTT
AACTCCGCCC TGCAGATGGG TCTTACCGCC ACGCCCCGCG AACTTGAAGA AGGCCGCCGC
GTTTTGACTG AAGAAGAAAA ACGTCGCGAT ACCTATGAAT ACTTCGGTGC CCCGATCTAC
ACATACAGCC TTAAGCAGGC TATTGAGGAC GGCTATCTTG TTCCCTACCT GCTTGAAGAA
CGCATTACCA ACGTCGATGA AGAAGGCTAC ACCGGGCTGG ACGGCAAACA CTATACGACA
GCCAACTTTG AACGCGATAT CCGCATGCCG GACCGTACCA CAGCCATTGC TGAAGACCTT
TGGGAAATCT TAGGGCAATA CGACTTGCGA GATGAAAAAA GCATTGTCTT CTGCGTGGAC
GATACCCATG CGGCGTTCAT GGCTGCTGAA CTGCGGCGCC TGTCAGGAGA CGATGATTAT
GCCGGCCGCA TAGTTCGTTC AGAACGCAAC AGTCATCAGC TGGAACGCAA CTTTGCCACT
GTCGGTTCCA CCAAACCCCG CGTCGCCGTA ACCGTGGATC TTTTAACCAC TGGTTTCGAC
GCTCCAGATG TAAAAAATAT CGTTTTTGTA AGACCACTTC GAAGCGCCAT CCTTTACAAG
CAGATGAAAG GGCGCGGCAC CCGCCTCTGC GAGGATATCA ATAAACGGTA TTTTACCATT
TTCGATTATT CCGGTGCCAG TCAACTTGAA GATGCGGAAT TTGACGGCCA TCCGGCCAAC
CGGCAGAAAG GGGCTCAACC CAAAACCAAG CCGCGGAAGA AAACCGACGA GCCCGCCGCA
AAACCGGCGG GTGAAGGCAT CTCGGTGGTT ATTTCCGACA CCAACCGCTA TGTCTGCCTG
GCGGATGGCC GCAAGATCCC CTTTGAGGAG TATACCGAGC AGTCGAGGAA CTTTATCCTC
GATGTTTCAG CCAAGTCCCT TGACGAGCTG CTCACCATCT GGATCGACAA GAACAGCCGC
AAGGAGTTGC GTGAGGAGCT GCGGGACCAT GATATTTATC CGTCCGCCTT CCGCCATTAC
CTTGAATTGC CGAAGACCGA CGATGTGGAC ATCCTGGCCA AAATCGGCTT CAACCTGGTG
CGAGTGCCAA CACGCCCACA GCGTGTAGAG CGTTTCTGGA AGGATGAGGC CCAATGGCTG
GAATATGAAT TGGGGGCAAA TAGAGTTAAT AAAGCACAAG TGTTCTCATT TCCTAAGTCT
CAACCCATGG ATAAGGTTGC GGACCCCGAG CCCGGCTACC GTTCGTCAGA TCCCTTCAAA
GTATTATTCT GGCAATGCGC CCTTGACCAT TACCAGTTGT TCGGAATCGA TGATCTGGAG
CAGGCCCGGA CCTATGGCGC CCCCCAGTTT GTCGCACAGT TCGGCAGTTT TCAAACCCTG
ACCAGCCGTT ACGGCGGGCC GCAACTATTA AAAACCGACC TGGAGGCGGT AAAGCACCAC
CTCTATGTAC CCATGACTGT ATAA
 
Protein sequence
MSRNEKQTCT DLIEPALTKA GWQWDSQVTI GLGRVNITGD TMYDETQTII ADYVLRYYGA 
PLVVLEAKAE SISAADGMQQ GSRYAGRLDI RFSIATNGTD WVLTDNDSGK YETLSAPPSP
EAILTHHGIS IDWDQWGEVF SAGYHIDQVS RKVVRPYQDV AIHKTLWHFA GGNNRALLLM
ATGTGKTFVV FQLVWKLLNT SILKRQHILF LTDRNSLKDQ AYRAFAAFST DERVTINKDT
VANGQHQVGK VFFANYQNLD EELDGKKIFE HYDQDFFDLV IIDECHRSGF GDWFGVLEHF
NSALQMGLTA TPRELEEGRR VLTEEEKRRD TYEYFGAPIY TYSLKQAIED GYLVPYLLEE
RITNVDEEGY TGLDGKHYTT ANFERDIRMP DRTTAIAEDL WEILGQYDLR DEKSIVFCVD
DTHAAFMAAE LRRLSGDDDY AGRIVRSERN SHQLERNFAT VGSTKPRVAV TVDLLTTGFD
APDVKNIVFV RPLRSAILYK QMKGRGTRLC EDINKRYFTI FDYSGASQLE DAEFDGHPAN
RQKGAQPKTK PRKKTDEPAA KPAGEGISVV ISDTNRYVCL ADGRKIPFEE YTEQSRNFIL
DVSAKSLDEL LTIWIDKNSR KELREELRDH DIYPSAFRHY LELPKTDDVD ILAKIGFNLV
RVPTRPQRVE RFWKDEAQWL EYELGANRVN KAQVFSFPKS QPMDKVADPE PGYRSSDPFK
VLFWQCALDH YQLFGIDDLE QARTYGAPQF VAQFGSFQTL TSRYGGPQLL KTDLEAVKHH
LYVPMTV