Gene Dvul_1040 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDvul_1040 
Symbol 
ID4664234 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfovibrio vulgaris DP4 
KingdomBacteria 
Replicon accessionNC_008751 
Strand
Start bp1275966 
End bp1277945 
Gene Length1980 bp 
Protein Length659 aa 
Translation table11 
GC content68% 
IMG OID639819265 
ProductC-5 cytosine-specific DNA methylase 
Protein accessionYP_966487 
Protein GI120602087 
COG category[L] Replication, recombination and repair 
COG ID[COG0270] Site-specific DNA methylase 
TIGRFAM ID[TIGR00675] DNA-methyltransferase (dcm) 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.172605 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGCTAG ACCTCCTCAA CATCAGCCGG GACGAGATCG TCGTTGACCT GTTCGCGGGC 
GGGGGCGGTG CCAGCCTCGG CATCGAGATG GCAGGGTGCC GCGTGCACGC TGCGGTGAAT
CACGATCCGG TTGCCGTCTC GCTCCACCGC GAGAACCACC CCGACACCGA GCACTACACA
CAGGACGTGT TTACCGTGTC GCCGCAGTGG GTGACGCGCG GTCGCAAGGT GGGCCTGCTG
TGGGCCTCGC CAGACTGCAC GCACCACTCC AAAGCCAAAG GCGGAGCACC CACGCGCAAC
GCCCGTCGCC GCGAGCTGGC CCGTGTCATT GTCGACAAGT GGATACCGGA GTTGCGCCCA
AGCGGAGCAC ACCCCCGCGT CATCATCCTC GAAAACGTCG AGGAGTTTCA GGACTGGGGC
CCGTTGGACG CCAAGGGCCG CATCATCGAG GCGCAGCGTG GCAAGTCTTT CAAGCGGTTC
ATCAGCGACC TCAAGCGGTT CGGCTACAAG GTCGAGTGGC GCGAGCTGCG GGCGTGTGAC
TACGGCACAC CAACCATCCG CAAGAGGCTG TTCCTGATTG CCCGGCGCGA CAAACTGCCC
ATCGTCTGGC CCGAGCCGAC GCACGGTGCA CCCGGCTCTC CCAAGGTGCT GGCTGGCCAG
CGCAGGCCGT GGCGAACGGC AGCCGAGTGC ATCGACTGGT CACTACCCTG CCCGAGCGTA
TTTGCCTCGT CCGGGGAGAT TATGGAGCGG CACGGGGTGC GGGCCATCCG CCCCCTGTCG
CCCAACACGC TGCGCCGGGT CGCCAAGGGT ATCCAGCGGT ACGTCGTGGA GGCCGCCGAG
CCGTTTGTGG TGCAGATGCG TACCGGGGCC GTCGGTCATC CCATCGACGA GCCGTTGCGC
ACCGTCACGG CGGGGGGCAA GGCCGCAAGG CCGGGTACGG GCAACACGTT CGCCCTGTGC
GTTCCAAGCA TCCAGACCTA CTACGGCGAC CACGCCGGGA CGCACGACGG CGCACGGAGA
GGATGCGCGA TGGACGCGCC CGTGGGCAGT GTCACAGCCG GGGGCAACCG CCACGCGCTC
GCCGTGGCCC ACCTGCAACG GCAGTTTGGC AACAGCGTCG GTCAGGAGTG CGACAAGCCC
GCGCCTACCG TCATGCCCGG GGGCGACGGT AAGACCGCTG TCTGTGCGGC CATGCTCAAA
CACTATGGCG GCGTGGTCGG GCACGAGGTC GAGCAGCCCC TCGGCACAGT GACCCGCGTT
GACCATCACT CGCTCATGAC GGCGGTGGTG GTCGGGGCCG GCGGCCCCAG CTACGGCGGA
AGACCGGCAG CAGTCGACGC GCCGCTGGGC ACGGTGCTGA CCGACAATCA CCGCGCCGTC
GCTGTCTGCA AGATGCGTGG TGACAACGTC GGTCACGGGG CCGACGAGCC GCTCCACACG
GTCAGCGCAC GCGGGACGCA TCATGCGCTC CTCGCTGCGA CCATCGCCAA GGACTACGGT
ACGGGCGGAT GCGTGGACAC AAGAGCCCCC CTCGCGACTG TTACGCAGCG TGACAAGCTG
GAGCTCGTCA CGGGGTGCCT CGCGGCCTAC TACGGCGCAG AGGGCGACGG CCAGCCCGTC
ACGGCCCCCA TGCGCACCAC GACCACCCGC GACCGCTTCG CGTTCGTCCG CGCCCTGCTG
GACGAGTATA CCCCCGGCGT CGAGCCTGTC GTCACCATCG GCGGGCAGCG TTATGCCGTC
GTCGACATCG GGCTTCGGAT GCTGACGCCG CGCGAGCTTG CGCGGGCGCA GGGTTTTCCG
GACACCTACA TGCTCGACAT GGTGGGCGGG CAGCCTGTCA CCAAAGCGGC GCAGGTCAGC
ATGATTGGGA ACAGTGTGTG CCCCGATTTG GCCGCAGCTC TGGTGGGGGC CAACTACAAG
CCGGTGCGCC ACGATGCGCC GGTGGTCGCC ATGCCGCTTC TGGAGGTGTG CAATGCGTAG
 
Protein sequence
MLLDLLNISR DEIVVDLFAG GGGASLGIEM AGCRVHAAVN HDPVAVSLHR ENHPDTEHYT 
QDVFTVSPQW VTRGRKVGLL WASPDCTHHS KAKGGAPTRN ARRRELARVI VDKWIPELRP
SGAHPRVIIL ENVEEFQDWG PLDAKGRIIE AQRGKSFKRF ISDLKRFGYK VEWRELRACD
YGTPTIRKRL FLIARRDKLP IVWPEPTHGA PGSPKVLAGQ RRPWRTAAEC IDWSLPCPSV
FASSGEIMER HGVRAIRPLS PNTLRRVAKG IQRYVVEAAE PFVVQMRTGA VGHPIDEPLR
TVTAGGKAAR PGTGNTFALC VPSIQTYYGD HAGTHDGARR GCAMDAPVGS VTAGGNRHAL
AVAHLQRQFG NSVGQECDKP APTVMPGGDG KTAVCAAMLK HYGGVVGHEV EQPLGTVTRV
DHHSLMTAVV VGAGGPSYGG RPAAVDAPLG TVLTDNHRAV AVCKMRGDNV GHGADEPLHT
VSARGTHHAL LAATIAKDYG TGGCVDTRAP LATVTQRDKL ELVTGCLAAY YGAEGDGQPV
TAPMRTTTTR DRFAFVRALL DEYTPGVEPV VTIGGQRYAV VDIGLRMLTP RELARAQGFP
DTYMLDMVGG QPVTKAAQVS MIGNSVCPDL AAALVGANYK PVRHDAPVVA MPLLEVCNA