Gene Dole_0823 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDole_0823 
Symbol 
ID5693658 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfococcus oleovorans Hxd3 
KingdomBacteria 
Replicon accessionNC_009943 
Strand
Start bp952762 
End bp954243 
Gene Length1482 bp 
Protein Length493 aa 
Translation table11 
GC content54% 
IMG OID641263420 
ProductN-6 DNA methylase 
Protein accessionYP_001528710 
Protein GI158520840 
COG category[V] Defense mechanisms 
COG ID[COG0286] Type I restriction-modification system methyltransferase subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCAACG TATCAGGCAT TATCAAATCC ATTCAGGACA TCATGCGCAA AGACGTGGGC 
GTAGACGGCG ACGCCCAGCG CATCAGCCAG TTGGTCTGGA TGTTTTTCCT TAAAATTTTC
GACGACCGCG AAGCCGAACT CGAACTTCTC GAAGACGACT ATCAATCCCC CCTGCCGGAT
CACCTGCGCT GGCGGGCCTG GGCCAAAGAC CCTGAAGGCA TGACCGGCGA GGAACTGGCC
GATTTTGTCA ATGCCCAGCT TTTCCCCCAT TTAAAAGATA AACTTAAAAT CCAGGGGCTC
CAGGGCAAAC GGGCCATGGT CATCCGGAAC GTCTTTGAAG ACGCCTACAA CTACATGAAG
TCCGGCACCC TCATGCGCCA GGTGATCAAC AAAATCTGTG AAATCGACTT TAACACCCAA
AAAGACCGCC ACACCTTCGG TCACATCTAC GAGCAGATCT TAAAAGACCT GCAAAGCGCG
GGCAATGCCG GCGAATTTTA CACCCCCCGC GCCGTCACCC AGTTTATCGT CAATCGGGTG
GACCCGAAAC TTTCAGAAAC CGTGCTCGAC CCGGCCTGCG GCACCGGCGG CTTTCTCACC
GGCACCATCA AGCACAAGCG CGATCATTAC GTCAAAACCA CCGAAGACGA AAAAATTCTC
CAGGCTTCAA TCTCCGGCGT AGAAAAAAAA GCCCTGCCTC ACATGCTCTG CGTCACCAAC
ATGATCTTAA ACGGCGTTGA TACGCCCGTC AGCATCCGGC ACGACAACAC CCTGTCCCGG
CCTTACAAAG ACTACGGCGA AAAAGACCGG GTCAACGTCA TCGTCACAAA CCCGCCCTTT
GGCGGCATGG AAGAAGACGG CATTGAAAAC AATTTCCCCG CCACTTTCCG CACCCGGGAA
ACCGCCGACC TGTTTCTGGC GCTTATCATC AAACTCTTAA AAAAAGGCGG CCGGGCCGCT
ATCGTCCTGC CGGACGGTTT TTTGTTCGGC GAAGGCATGA AAACCCGGCT CAAAGAAACC
CTTTTGGCCG AGTGCAACCT GCACACCATT GTGCGCCTGC CAAACGGCGT GTTCAACCCC
TACACCGGCA TTAAAACCAA CCTGCTCTTT TTCACCAAAG GCACGCCCAC CGAGCAAATC
TGGTATTACG AGCATCCTTA CCCCGAGGGC GCCAAAAGCT ACAACAAGAC CAAACCCATG
CTGTTTGAAG AGTTTGAACC GGAGATAAAG TGGTGGGGCA GCGAGGCCAA AAATTTTAAA
ACCCGCAAAG AAACCGCCCA GGCCTGGAAG GTGTCGGCAG AAGACATCGT AGCCCGTAAC
TACAATTTGG ATATTAAAAA CCCCCATGTG GCCGAACAGG AAGATCGTGA CCCCCAAAAA
CTGCTGGCCC AATACCAGGC CCAGCAAAAA GAGATTGCGG ACTTGCAAGA TCAGCTAAAA
GCCATCCTGG CCGATGCCCT CACAAATGGC GGCCAGGCAT GA
 
Protein sequence
MSNVSGIIKS IQDIMRKDVG VDGDAQRISQ LVWMFFLKIF DDREAELELL EDDYQSPLPD 
HLRWRAWAKD PEGMTGEELA DFVNAQLFPH LKDKLKIQGL QGKRAMVIRN VFEDAYNYMK
SGTLMRQVIN KICEIDFNTQ KDRHTFGHIY EQILKDLQSA GNAGEFYTPR AVTQFIVNRV
DPKLSETVLD PACGTGGFLT GTIKHKRDHY VKTTEDEKIL QASISGVEKK ALPHMLCVTN
MILNGVDTPV SIRHDNTLSR PYKDYGEKDR VNVIVTNPPF GGMEEDGIEN NFPATFRTRE
TADLFLALII KLLKKGGRAA IVLPDGFLFG EGMKTRLKET LLAECNLHTI VRLPNGVFNP
YTGIKTNLLF FTKGTPTEQI WYYEHPYPEG AKSYNKTKPM LFEEFEPEIK WWGSEAKNFK
TRKETAQAWK VSAEDIVARN YNLDIKNPHV AEQEDRDPQK LLAQYQAQQK EIADLQDQLK
AILADALTNG GQA