Gene DvMF_0229 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDvMF_0229 
Symbol 
ID7172109 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfovibrio vulgaris str. 'Miyazaki F' 
KingdomBacteria 
Replicon accessionNC_011769 
Strand
Start bp256056 
End bp257255 
Gene Length1200 bp 
Protein Length399 aa 
Translation table11 
GC content42% 
IMG OID643538726 
ProductDNA methylase N-4/N-6 domain protein 
Protein accessionYP_002434656 
Protein GI218885335 
COG category[L] Replication, recombination and repair 
COG ID[COG0863] DNA modification methylase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones63 
Fosmid unclonability p-value0.0324708 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTGTTTGC CTGACATTAA GCCGGAAACA AAGCTGTCGT CTCTTGAATG CGATAAGTAT 
ATAGGGTGCA ATGAGTGTTT TGATATTAAT GATGTCGCAC CGATTGTGGG AATGTCGACA
ACGTTTATAC GTAAGGCAAT TGGTGTGAGG ACAAAGGTTT ATGTGAACGA TGTTTTAAGG
CTATTGAATC TAGACGCTTT CGGTGAGACA TTTATTCCAA GGAGTATGGT TATTGATTAT
TTGTTGAGAA GTAAAAATGG TGTTAAAAAA AATAATGCAG GAATAGCTAA TGTTCCATTC
GAATTAGTTG AAGGTGATGC GCTTGAGAAG ATTAGATCGT TTTCTGATAA AAGTGTAAAT
TGTGTCGTAA CATCAACCCC ATATTGGGGT TTGAGATTGT ATAAGGATTC GTTTTTTGTC
CGATGGGGCG ATGGGGAGCT TTGCCCTTTT GGTCATGAGC AAACGCCAGA GTCATTCATT
AGACACAGTG TAGAGGTGTT GTCGGCATTG TATAATGTTT TGACAGATGA TGGGTCTGTC
TGGTGGAATA TAATGGATTC ATTTAATACT CGTACACAGA TTAGAGGTAG CTCTGCAGAG
GCGCTGCAGG CAATGCAAGG AAAAGATAGT CGAGGATGGG CTGATCATGC GTGTAGGAGA
TATAGTGCTG GTCATTCATA TCTGAAGGAT GGTGAGCAAT GTATGATACC TTCGAGAATA
GCAGAACGTG CGTCGCGGGT CGGATATTAC GTTAAAGCAG TAATTACATG GGCAAAGACA
AGTTCTCTTC CCGAACCTCA GACTTCGCGG GTCAGTAGGA ATCTTGAGTA TGTTCTGCAC
TTGACAAAGG TGCGAACCCC AAAATTTGAC AAAGAAGTGT ATAGGAGTCT TCCTTCGGCA
TTGGGGGGGC GCAATAACGG GTCTGAAACA GATAAATTAT CTGATGTTTG GGTGCTACCC
ACTTCTAATG GTCGCCATGG GCATGGGGCT CAATTTCCAG TAGCCCTTCC CGCTCGCTGT
ATTGCTCTTG CGACAAATGC AAATGATTTG GTTCTTGACC CATTTGTGGG TGCAGGTAAC
TCTGCCATTG CGGCATTGGC GCTTAAAAGA AAATTCATAG GCATTGATAC ATCGAGTGAA
TATATAGATG TTGCTAAAAA GAGAATTAAA GAATCAGTGT TTCGGCTAAG TGTGTCGTAG
 
Protein sequence
MCLPDIKPET KLSSLECDKY IGCNECFDIN DVAPIVGMST TFIRKAIGVR TKVYVNDVLR 
LLNLDAFGET FIPRSMVIDY LLRSKNGVKK NNAGIANVPF ELVEGDALEK IRSFSDKSVN
CVVTSTPYWG LRLYKDSFFV RWGDGELCPF GHEQTPESFI RHSVEVLSAL YNVLTDDGSV
WWNIMDSFNT RTQIRGSSAE ALQAMQGKDS RGWADHACRR YSAGHSYLKD GEQCMIPSRI
AERASRVGYY VKAVITWAKT SSLPEPQTSR VSRNLEYVLH LTKVRTPKFD KEVYRSLPSA
LGGRNNGSET DKLSDVWVLP TSNGRHGHGA QFPVALPARC IALATNANDL VLDPFVGAGN
SAIAALALKR KFIGIDTSSE YIDVAKKRIK ESVFRLSVS