Gene Dvul_0567 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDvul_0567 
Symbol 
ID4664496 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfovibrio vulgaris DP4 
KingdomBacteria 
Replicon accessionNC_008751 
Strand
Start bp713261 
End bp716449 
Gene Length3189 bp 
Protein Length1062 aa 
Translation table11 
GC content65% 
IMG OID639818777 
Productpyruvate, water dikinase 
Protein accessionYP_966017 
Protein GI120601617 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0574] Phosphoenolpyruvate synthase/pyruvate phosphate dikinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.266468 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.689283 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGTTCT TCGATTTTCT GACCCGAGGC CGCAACACAT GCGACCTGCT TCATGCCGAA 
GAAGGTGCCG CAGCAAGAAA ATACAGGCAC TTCAAAGATT TCTTGCACGA CAATCACCGT
GCCCTCTCCG CCGTGGCCGA CCTTGAGGGG CTCTATCACC AGGGGGCCTA CAGCCTGCCC
GAAGCCCGCC GCCACCTCAA TGAACTTCTG GAGGCCACGG ACAATCTTCT TGGCGACCTC
GACCGGCTCA CCGGGGGACG CTACCCCGCA CTTCACGACG TACTCGCCCG CCTGAACGAA
GAGACCGGGG TCATCCTCTC GGCGGCAACC CCTGCCCCCT GCGTACCTCC GGTGCTTTCA
CTTGCGGATG TGACGCCCGA CATGGCATCG GCCTGCGGCA CCAAGTCGAC CAACCTCGCC
ACCATGCGGA ATGTCCTCGG CATTCCCACC CCGCCGGGGT TCGTCGTCAC CGCACGGGGT
TTCGAGCGGT TCATCGAGGA GAACGCACTG GGTGAGCGCA TCGCACAGGC CCTGTCACGA
TGTGCCGCCA CCGCGAGACC GGGTGACGAC CCCGTGATGC TGGAACGCGT GAGTGCCGAA
ATCCGCGACA TGGTGCTCCA TGCCCCCGTC CCCGCGTCGC TGTCGGACGC CATTCTCGAT
GCCTACCGTG CCCTTGAGGC CGCCACCCAT CCCGGTGTGC ATGTGGCCAT GCGCAGCAGC
GCCGTGGGCG AAGACACCGA GGCGTCGTTC GCGGGGCAGT ACGTCACGGT GCTCAACGTC
ACCGCCGCCG ACATCCTCAC CGCGTATAGG GAAGTGCTCG CCAGCAAGTA CTCGCCCCGC
GCCATCCTCT ACCGTCTCAG CTACGGTCTC GAAGACCGCG ATACGCCCAT GTGCGTTGCG
GGCATCGCCA TGGTGCGTTC GCGTGCCAGC GGCGTCATCT ACACGGTCGA CCCCTCCGCG
CCGGACTCTG GCAGCCTCAA GGTGGCTGCC CTGCTTGGCC TTGGCGAACT TCTCGCCGGG
GGTGAAGGCA GTGCCGACGT CTTTCATGTG GACAGGGCCA CAGGCGACCT GCGCAACACC
GAACTCACCG AGAAGACACA TCGCCTTGTC TGCCTTCCCG ACGGTGGCAT CAGCCTTGAA
GCGGCGACGG CCGAAGAACG CACCCGGCCC GCCATCGACG ACGCCATCGT GCAACGGCTG
CATGGCTATG GCATACGGCT TGAAGAGTAT TTCAAGTGCC CGCAGGACAT CGAATGGGCC
GTCGCCCCCG ACGGTGACCT CTTCATCCTG CAATCGCGGC CTCTCGGACT CGTCAGCGCC
CATCGGCCGC AGACCGCAGT CGAGTACACC GGGCACCCCA GACTCATCAC CCGCGGTCGC
GTCGCCTCTC CCGGGGCGGC TTCGGGCGTG GTCTACAACG CCTTGGGGAA CAACCCCGAC
CATGTTCCCG CGGGGGCCAT TCTCGTTGCC CGTACCGCGT CGCCCGACTA TGCGGGACTC
ATGGGCCGGG TGCGCGGCAT CGTCACCGAC GTGGGCAGCG GGGCCAGCCA TCTGGCATCC
GTGGCCCGCG AATTCGGCAT TCCCGCACTG TTCGACACAC GCGATGCCAC CATGCGCCTC
ACCCACGGGG CCACGGTGAC CCTCGATGCC GATACGACCT CTGTCTACGA AGGCACCGTG
GACGAACTTG CCAGCCGCAC GGCCCCCGCC CGCAAGCCAT TCTTCGACTC GCCCGCGCAC
CACAAGTTGC GCCGCGTTCT CGACCTCGTC TCGCCGCTGC ACCTTCTCGA CCCGCGCTCA
CCGGAATTCG ACCCGGCGCA TTGCGCCTCT GTGCACGACA TCATCCGCTT CGCGCACGAG
AACGCCATGC GCGAGATGTT CGGCCTGTGT GACGATTCTC TTGACGTGAC CCGCGCCGTG
CGTCTGCATC TCGGCATCCC CATCATGCTC TACTGCATCG ACCTCGGTGG CGGGCTTGAT
GACGGGCTTA CCTCGTGCGA ACGTGTCGAG ACGGCCCATG TGCGCAGCGT TCCCATGCGC
GCCCTCCTCG ACGGGCTGAC GCATCCCGGC ATCTCGTGGC AGGGCGGCGT GGCACTCAAC
GCCCACAGCC TGCTTTCGGT CATGGCTTCA GCGGCGACGG CAGACAGCGA GAACCTCGGC
GGTGACAGCT TCGCCGTCGT CTCACGCGAC TACATGAACA TCTCGGCGAA GTTCGGTTAC
CATTACGCCA ACATAGACGC CCTGTGCAGT GACAGGCCAA GCCAGAACCA CGTCGTGCTC
AAGTTCGCGG GCGGCGCAGG CAGCTATTTC GGGCGGTCGC TACGCGTCTC GTTTCTCGCG
TCGGTCCTGC AACGACTTGG CTTCGGGGTG GACATCAGGG GCGACATGCT CGAAGCCACC
CTCACCGGGC TGGATGGCGA CGCGCAGCAG GAACTTCTCG ACCAGACCGG ACGACTGCTG
GCCGCAGCAC GGCTGCTGGA CATGACCATC GGCAACGGGG CCGACGTGGA ACGCATGGTG
AGCATGTTCT TCGACGGCAA CTACGACTTT CTCGGCACGG CACGCCCGCG CCAGATAGAG
GGCTTCTACA CCGACACGGG CAACTGGAGC ATGGAGCAGT GCGACGACGT CCGCTGTATC
CGGCAGGACG GTTCCGAATG GGGGCGTAGC GTGGGAGGCG AGTCCTCCGC CGTGTTCGCC
CGCATGTTCG GCCCGCGCTA TCAGGAGGCA CTGGACAGGG TGGAGACCTT CTTCCACTTT
CCGCTGGCCA TCTGTCAGGA AAGCCGCATG GCCGATGGCA GCCTGCGTCT GCATGTCCGT
CCTGAAGGCG GACGCATGGA CAGGGCGGGC GGGCTCGCCT TCGGCATCCG CAACGCGGGC
AACTACTTCG TGCTGCGACT CAACGCCGTC GAGAACAACG TCATTCTCTT CGAATTCGTG
AACAGCAGAC GGGTACGCCT TGAGCGGGTC GACGTCCGCA TCCGCAGCGG CACATGGGCC
GAACTTGGTG TGGACATCTG CGGCAAGACA GTGCGCTGCC TTCTGGACGG GGTGCCGCTC
TTTGAAACGG TTCTGCGCAT CACGCCCTAC GGCTACGCCG GACTCTGGAC CAAAGGGGAC
TCTGTGACCC TCTTCCGCGA CTTCACGGTC GAACCCGCAC TGGGCCTTGC CCACCCTATC
ATGCAGTGA
 
Protein sequence
MSFFDFLTRG RNTCDLLHAE EGAAARKYRH FKDFLHDNHR ALSAVADLEG LYHQGAYSLP 
EARRHLNELL EATDNLLGDL DRLTGGRYPA LHDVLARLNE ETGVILSAAT PAPCVPPVLS
LADVTPDMAS ACGTKSTNLA TMRNVLGIPT PPGFVVTARG FERFIEENAL GERIAQALSR
CAATARPGDD PVMLERVSAE IRDMVLHAPV PASLSDAILD AYRALEAATH PGVHVAMRSS
AVGEDTEASF AGQYVTVLNV TAADILTAYR EVLASKYSPR AILYRLSYGL EDRDTPMCVA
GIAMVRSRAS GVIYTVDPSA PDSGSLKVAA LLGLGELLAG GEGSADVFHV DRATGDLRNT
ELTEKTHRLV CLPDGGISLE AATAEERTRP AIDDAIVQRL HGYGIRLEEY FKCPQDIEWA
VAPDGDLFIL QSRPLGLVSA HRPQTAVEYT GHPRLITRGR VASPGAASGV VYNALGNNPD
HVPAGAILVA RTASPDYAGL MGRVRGIVTD VGSGASHLAS VAREFGIPAL FDTRDATMRL
THGATVTLDA DTTSVYEGTV DELASRTAPA RKPFFDSPAH HKLRRVLDLV SPLHLLDPRS
PEFDPAHCAS VHDIIRFAHE NAMREMFGLC DDSLDVTRAV RLHLGIPIML YCIDLGGGLD
DGLTSCERVE TAHVRSVPMR ALLDGLTHPG ISWQGGVALN AHSLLSVMAS AATADSENLG
GDSFAVVSRD YMNISAKFGY HYANIDALCS DRPSQNHVVL KFAGGAGSYF GRSLRVSFLA
SVLQRLGFGV DIRGDMLEAT LTGLDGDAQQ ELLDQTGRLL AAARLLDMTI GNGADVERMV
SMFFDGNYDF LGTARPRQIE GFYTDTGNWS MEQCDDVRCI RQDGSEWGRS VGGESSAVFA
RMFGPRYQEA LDRVETFFHF PLAICQESRM ADGSLRLHVR PEGGRMDRAG GLAFGIRNAG
NYFVLRLNAV ENNVILFEFV NSRRVRLERV DVRIRSGTWA ELGVDICGKT VRCLLDGVPL
FETVLRITPY GYAGLWTKGD SVTLFRDFTV EPALGLAHPI MQ