Gene Dvul_0133 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDvul_0133 
Symbol 
ID4663363 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfovibrio vulgaris DP4 
KingdomBacteria 
Replicon accessionNC_008751 
Strand
Start bp162987 
End bp164102 
Gene Length1116 bp 
Protein Length371 aa 
Translation table11 
GC content67% 
IMG OID639818328 
Productformamidopyrimidine-DNA glycosylase 
Protein accessionYP_965584 
Protein GI120601184 
COG category[L] Replication, recombination and repair 
COG ID[COG0266] Formamidopyrimidine-DNA glycosylase 
TIGRFAM ID[TIGR00577] formamidopyrimidine-DNA glycosylase (fpg) 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.272092 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0428662 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCTGAAC TGCCTGAAGT GGAAACCATC GCCTGCGGCT TGCGTCCGGC CCTTTCAGGG 
CGGCGCATCG TGGGTGTTAC GGTGCACAAC CCCGGCACGC TGGAAGGTCC GCTGCGCACA
CCTGCCGCCT TCACGGAGGC CGTGCAGGGG CGACGCATCG CGGATGTGGG ACGGCGCGGC
AAGCTGCTGC TTGTGGCGTT CGCGTCATTG CCACCTGTCG GCCACGCAGG GCAACCGCGA
CCTGAAGGTC TCTCCTCTTC CACGGTTCGC GACTTCCTCG TCACGCACGG CTTCCATGCC
GCAGGGTGCG CCACGTCAGT CCATGCCTGT GCCCCCCTTC TTGCGGACGG GCAACAGACA
TCCGGGCCCG TCCCGGAACG GGGCCGTCTC GCGGGGCACG GCGACGGCAT GGATGGCACA
TCGCGGACCG GAAGCACCTT GCCCGGAACC GGAGGCACCG AAAACTCTGA CGCTGTAGCC
GTAGCGGATG ACGACACCGT CCTCGGTCTC GCCTTCCACC TCAAGATGAC CGGACGCCTC
TTCATCCACC CGCCCGCAAC CCCGGCGGGT ATCCACACCC GCGTGGTCTT CGACCTTGAA
GGCGGCACTC GCCTCTTCTT CGATGACGCC CGCAAGTTCG GCTATGTGCG TTGCATCACC
CGGCGCAGCC TTGCGCTGTG GCCTTTCTGG CGCGACCTCG GCCCCGAGCC CCTCGAGACT
GACGCGCGCG GCTTCGCGGC GCGGCTCGCC CGCAGGCGAG GGCGCATCAA GGCCCTGTTG
CTCGACCAGA AGGTCGTGGC GGGGGTTGGC AACATCTATG CCGACGAGTC GCTGTTCCGT
GCCGGCATCC GCCCCGACAC GCAGGCCCAT ACCCTGATAC CTGAACGCCT CTTCGCCCTG
CACGGGCATC TTCAGGATGT GCTACGCGAG TCCATCGCCG AATGCGGCAG TTCCATCCGC
GACTACCGCG ATGCACACGG CGATGCGGGG GCCTTCCAGA ACAGCTTCAG GGTCTACGGG
CGGGGCGGGC AGCCTTGCCG TCACTGCGGC ACGACTCTCG CCACGGCGCA GGTAGCAGGA
CGCACCACGG TCTTCTGCCC CAGATGCCAG CGGTGA
 
Protein sequence
MPELPEVETI ACGLRPALSG RRIVGVTVHN PGTLEGPLRT PAAFTEAVQG RRIADVGRRG 
KLLLVAFASL PPVGHAGQPR PEGLSSSTVR DFLVTHGFHA AGCATSVHAC APLLADGQQT
SGPVPERGRL AGHGDGMDGT SRTGSTLPGT GGTENSDAVA VADDDTVLGL AFHLKMTGRL
FIHPPATPAG IHTRVVFDLE GGTRLFFDDA RKFGYVRCIT RRSLALWPFW RDLGPEPLET
DARGFAARLA RRRGRIKALL LDQKVVAGVG NIYADESLFR AGIRPDTQAH TLIPERLFAL
HGHLQDVLRE SIAECGSSIR DYRDAHGDAG AFQNSFRVYG RGGQPCRHCG TTLATAQVAG
RTTVFCPRCQ R