Gene Dole_0251 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDole_0251 
Symbol 
ID5693069 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfococcus oleovorans Hxd3 
KingdomBacteria 
Replicon accessionNC_009943 
Strand
Start bp285776 
End bp288202 
Gene Length2427 bp 
Protein Length808 aa 
Translation table11 
GC content56% 
IMG OID641262831 
Producttype I restriction-modification system, M subunit 
Protein accessionYP_001528138 
Protein GI158520268 
COG category[V] Defense mechanisms 
COG ID[COG0286] Type I restriction-modification system methyltransferase subunit 
TIGRFAM ID[TIGR00497] type I restriction system adenine methylase (hsdM) 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGCACTTA AAAAATCGGA GCTTTACAGT TCACTCTGGG CTTCCTGCGA TGAGCTTCGC 
GGCGGCATGG ATGCCAGTCA GTATAAAGAC TATGTCCTGT TCATGCTGTT TATCAAATAC
ATCTCGGATA AATATGCCGC CTCCGACGAT TACGCCCCGC CGGTCACCAT CCCCAGGGGC
GCAAGCTTTC AGGATATGGT CAAGCTCAAG GGCAAGAGCG ATATCGGCGA CAAGATCAAC
ACCCAGATCA TTCAGCCGTT GATCGACAGC AACTCCCGCC TGGCCCGCAG CGACTTTCCC
GACTTCAACG ACCCCAACAA GCTCGGCGAA GGCAAGGCCA TGGTGGATCG CCTGACCAAC
CTGATCAGCA TCTTCCAGAA ACCGGAACTG GATTTTTCCA AAAACCGGGC CGACCATGAC
GATATTCTCG GCGATGCCTA TGAATACCTG ATGCGCCACT TTGCCCAGGA GAGCGGCAAA
AGCAAGGGCC AATTCTACAC GCCTTCGGAA GTCAGCCGGA TTATTGCCAA AGTGATCGGT
ATTTCGCCGC AAAAAGCCGT TGCTTCCACC ACGGCCTATG ACCCGACCTG CGGCTCGGGG
TCGCTGCTGT TGAAAGTGGC GGCCGAGGCG GGCAAACACA TTACCCTTGA GGGGCAGGAA
AAAGACGTGA CCACCGCCGG TCTGGCCCGC ATGAACATGA TCCTGCACGA CTTTCCAACC
GCCAACATCC TCAATGGCAA CACCCTGGCC TCTCCCAAAT TCAAAGACGG CGAAAAGCTG
CGCACCTATG ACTTTGTGGT CGCCAATCCC CCGTTTTCTG ACAAAACCTG GAGCACCGGG
CTCACGTCCG AAAACGATCC CTACCAGCGC TTTGAATGGG GGGTGCCGCC GGCCAAGCAG
GGCGATTACG CTTACCTGCT GCACATTATC CGCTCGATGA AAAGCACGGG CAAGGCGGCC
TGCATTCTGC CGCACGGCGT GCTGTTTCGC GGCAATGCGG AAAACGTCAT CCGCAAGCGG
CTTGTCCGGT CCGGCTACCT GAAAGGCATC ATCGGCCTGC CCGCCAACCT GTTTTACGGC
ACCGGCATTC CGGCCTGCAT CCTGGTGCTG GACAAGGAAA ACGCCACGGC CCGCAAAGGC
ATTTTCATGA TCGACGCCTC CAGGGGCTTT ATCAAGGACG GCAACAAGAA CCGCCTGCGC
GAGCAGGACA TTCATAAAAT TGTCGATACC TTCCGCAAGC AAGCCGAAAC GCCCCGCTAT
GCCCGCATGG TGCCCTTTGA CGAGATCGCT GATTCCAAAA ACGACTACAA TCTCAATCTG
CCGCGCTACA TCGACGGCAC CGAACCCGAA GACATTCAGG ACATTGACGG CCATCTGCGC
GGGGGCATTC CCGACCGGGA TATTGACGCG CTTTCTGATT ACTGGGCAAT TCTTCCCACT
GTTCGCGCCG CGCTTTTTAA GCCCCTGCGT CCCGGCTATG CGCAACTCGC CATTCCCCAT
TCTCAATTGA AGCAAGCCAT TCTGGGTCAC GACGAATTTA CGGCCTTTAA AAAGACCGTG
ACAAAAATCT TTGACAAGTG GCAAAAGGCG AACACTCCGG CCCTGAAAGG CTTTGACAAA
AAAGGACACC CCAGAGCCCT GATCGAGGCC ATTGCCGAAC ACCTGTTGAC CGCCTTCCGC
GGCGCCCCGC TGCTTGACGC CTATGATGTC TACCAGCACC TGATGGACTA CTGGGCCGAG
GCCATGCAGG ACGACTGTTA CCTGATTGCC GCCGACGGCT GGGTCGCCAA ACCCCACCGC
GTCATGGAAG AGGTCAAGGC CGGCAAAAAA AAGGGCGAGA TGAAAGACAA GGGCTGGGCC
TGCGACCTGA TTCCCAAGCC CTATATCGTG GCCCGCTATT TTGCCAAGGA ACAGGCCGAG
CTGGATGCCC TGCAAAGTGA ACTGGAATCT GTTACTGCAC GGATCACCGA GCTTGAAGAA
GAACATGGCG GTGAAGAGGG CGCTTTTGCC GACCTGGACA AGATCAACAA GGGCGAGGTC
AACAGGCGCT TAAAAGAGAT CAAGGGCAAT TCCGACTATG CCGACGAAGA AAAAATTCTA
AAACAGTGGG CAAAGCTGGA ACAGCAACAG AGCGGCCTCA AGAGCAAGAT CAAGGAAGCC
GATACCGCGC TCGATAAGTT GGCCTATGAA AAATACCCCC GGCTCAGCAA GGATGAAATC
AAGACACTGG TGGTGGATGA CAAGTGGCTG ACCACCCTGG CCGTGGCCGT TCAGGGCGAA
CTGGATCGCG TCTCGCAAAC CCTCACCTCC CGCATTCGCC AGCTGGCCGA ACGTTACGCC
ACGCCCCTGC CGCGTCTGGT GAACGAAGTG ACGGACCTTT CCGGCCGGGT TGATGCGCAT
CTGAAAAAAA TGGGGTATCA GCCATGA
 
Protein sequence
MALKKSELYS SLWASCDELR GGMDASQYKD YVLFMLFIKY ISDKYAASDD YAPPVTIPRG 
ASFQDMVKLK GKSDIGDKIN TQIIQPLIDS NSRLARSDFP DFNDPNKLGE GKAMVDRLTN
LISIFQKPEL DFSKNRADHD DILGDAYEYL MRHFAQESGK SKGQFYTPSE VSRIIAKVIG
ISPQKAVAST TAYDPTCGSG SLLLKVAAEA GKHITLEGQE KDVTTAGLAR MNMILHDFPT
ANILNGNTLA SPKFKDGEKL RTYDFVVANP PFSDKTWSTG LTSENDPYQR FEWGVPPAKQ
GDYAYLLHII RSMKSTGKAA CILPHGVLFR GNAENVIRKR LVRSGYLKGI IGLPANLFYG
TGIPACILVL DKENATARKG IFMIDASRGF IKDGNKNRLR EQDIHKIVDT FRKQAETPRY
ARMVPFDEIA DSKNDYNLNL PRYIDGTEPE DIQDIDGHLR GGIPDRDIDA LSDYWAILPT
VRAALFKPLR PGYAQLAIPH SQLKQAILGH DEFTAFKKTV TKIFDKWQKA NTPALKGFDK
KGHPRALIEA IAEHLLTAFR GAPLLDAYDV YQHLMDYWAE AMQDDCYLIA ADGWVAKPHR
VMEEVKAGKK KGEMKDKGWA CDLIPKPYIV ARYFAKEQAE LDALQSELES VTARITELEE
EHGGEEGAFA DLDKINKGEV NRRLKEIKGN SDYADEEKIL KQWAKLEQQQ SGLKSKIKEA
DTALDKLAYE KYPRLSKDEI KTLVVDDKWL TTLAVAVQGE LDRVSQTLTS RIRQLAERYA
TPLPRLVNEV TDLSGRVDAH LKKMGYQP