Gene Dvul_2199 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDvul_2199 
Symbol 
ID4665064 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfovibrio vulgaris DP4 
KingdomBacteria 
Replicon accessionNC_008751 
Strand
Start bp2552707 
End bp2553777 
Gene Length1071 bp 
Protein Length356 aa 
Translation table11 
GC content65% 
IMG OID639820444 
ProductTOBE domain-containing protein 
Protein accessionYP_967642 
Protein GI120603242 
COG category[H] Coenzyme transport and metabolism
[L] Replication, recombination and repair 
COG ID[COG3585] Molybdopterin-binding protein
[COG4974] Site-specific recombinase XerD 
TIGRFAM ID[TIGR00638] molybdenum-pterin binding domain 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAAGGTT CCCTCGAAAC GCTTTTTGCC GGCGTCGCCT CAAGAGCCAA TCCGGCAGGC 
ATGCTCACCA TTCCGGACGA GGTGCGTTGC CTCGATACCA ACGACCTCGA AAAGCTCGAG
TCCGCGTTCA GGGCATGGGT CGCCAAGGGA CGAGGCGCAG GAGTCACGTT GGCACGCAGG
CGTATTCTCG CTCTTTTCCT TCTCCTGCGG CATACGGGGG CACGTCTGGG AGAAGCCTCG
GGTCTGGAGG GGCGGTCGCT TGTGCTTGAG AAGGGGCTGG TGAGGCTCGG CACGCCACCT
CGCGATGTGC TCGTGGCGGA GACTGTGCTG GCAGAGATAG TGGAACTGGT CACCACGGGC
GGAGGGCGTG AGTCGCCCGT GCTCCCGGAG TATCCCTTCG CGGTCGACCC GGGGCATGTG
CGTCGCAAGT TCTACGCGTG CGCAGAGGCT GCGGGACTCG GGCGTGAGAT GGGGTCACCA
TCGGTCTTGC GCCGTTCGCG GGCTGTGGAG TTGCTGCGGG GGGGGGTGCC TCTCCCCGCC
GTCCAGCGGC TTCTGGGGCA TGGCAATGCG GAACTCACCG CCGCCTATGT CGCCGTGCCG
GATGACGCCA TGCGTCGTAT GGTGGGGGAT GCCCTCGACA GGGAACAGCG CAGAAGCAGT
GCCCGCAACG CCTTCTTCGG CAGGGTAGAG GACGTCCGTA TCGGGGATGT GCAGGCTTCA
GTGCGTCTGC AGACACCGAC CGGGCTCGAA GTGCTCGCCA TCATCACCAA TGACAGTCTC
GAGGCCCTCG GTCTCGCACG GGGTGCCTAT GCCCTCGCCG AGGTGAAGGC ACCGTGGGTC
GTACTGCAAC GCGGTGAACA CCCGCGCTCC AGTGCGGGCA ACGTCTATCC GGGAGAGGTC
GTCCGTGTCC GCAAGGGGGC TGTCTCGGCT GAGGTCATCG TGAGGCTCGA TGCTGGAACG
GAGGTCTGTG CCGTCCTCGC CGCTGACACT CTGGCGACGC TTGAACTGTG CGACGGGGCT
CATGTCTGGG TCGTGATCAA TGCCCTCTCT GTCATCCTCA ATGCCGTATA G
 
Protein sequence
MEGSLETLFA GVASRANPAG MLTIPDEVRC LDTNDLEKLE SAFRAWVAKG RGAGVTLARR 
RILALFLLLR HTGARLGEAS GLEGRSLVLE KGLVRLGTPP RDVLVAETVL AEIVELVTTG
GGRESPVLPE YPFAVDPGHV RRKFYACAEA AGLGREMGSP SVLRRSRAVE LLRGGVPLPA
VQRLLGHGNA ELTAAYVAVP DDAMRRMVGD ALDREQRRSS ARNAFFGRVE DVRIGDVQAS
VRLQTPTGLE VLAIITNDSL EALGLARGAY ALAEVKAPWV VLQRGEHPRS SAGNVYPGEV
VRVRKGAVSA EVIVRLDAGT EVCAVLAADT LATLELCDGA HVWVVINALS VILNAV