Gene Dvul_2007 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDvul_2007 
Symbol 
ID4662898 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfovibrio vulgaris DP4 
KingdomBacteria 
Replicon accessionNC_008751 
Strand
Start bp2337359 
End bp2339923 
Gene Length2565 bp 
Protein Length854 aa 
Translation table11 
GC content69% 
IMG OID639820250 
Productphosphoenolpyruvate-protein phosphotransferase 
Protein accessionYP_967450 
Protein GI120603050 
COG category[G] Carbohydrate transport and metabolism
[S] Function unknown 
COG ID[COG1080] Phosphoenolpyruvate-protein kinase (PTS system EI component in bacteria)
[COG3412] Uncharacterized protein conserved in bacteria 
TIGRFAM ID[TIGR01003] Phosphotransferase System HPr (HPr) Family
[TIGR01417] phosphoenolpyruvate-protein phosphotransferase
[TIGR02364] dihydroxyacetone kinase, phosphotransfer subunit 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.436147 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTCGGCA TCGTCGTCGT CACGCATAGC GCGGTGTTGG GGCAGGGTCT CAGGGAACTG 
GCTGAGCAGA TGACACAGGG CCGAGTGCCG CTGGCTGTGG CCGGAGGCAT CGACGACCCC
GACCATCCGA TAGGAACCGA CCCCGTGCGG GTGATGACCG CCATCGAGGA GGTGCAACAG
GGCGACGGCG TGCTTGTGCT GATGGACCTT GGCAGCGCCC TCATGAGCGC GGAGACGGCC
CTCGACCTTC TGCCGCCGGA GGTGGCCTCT CAGGTACGAC TCAGTGCCGC GCCACTGGTG
GAGGGGCTCA TGGCGGCCGC TGTCCTCGCC TCGACCGGGG CCGACCTTGG AGCCGTGGCG
GAGGAGGCGC AGAGCGCGCT GGCGGCGAAG CGCGAACTGC TCGGTGCGGC GGCCCCCGCA
GCCCCGGCGA TGCCTTCAGC GCATCCGGAA GGGGCACGGG ATTCCACAGT GCCCTCGTCG
GCAATGCCCG CTGGCGAGGA ACTCACGCTG GTGGTTCCCA ACAGGCTCGG CTTGCATGCG
CGTCCGGCTG CACGCATCGT CACTGCACTG GGGCCGTTTG CCGCAGACGT GCAACTGGTG
CGGGGCGACC GCGTCGTCTC CGCGCGTTCG GTGAACCGCA TCGCGACGCT GGCCGTGCGT
GGCGGTGAGA CCGTGACCTT CAGGGCCGTG GGTGGCGATG CCGCCCTCGC CCTGCGAGCC
ATCGCAGCTC TTGCGGCAGC TCATTTCGGC GATGCGCCTG AAGCCCCGTC CAAAGGGGAG
GCCCCGTCCC CGGCGGAGGC GCCGAAGGAT GGGGACATGG CAGAGGCCGC CGTATCTGCC
GGTGTTCAAG GTGGAGGCGT CCTGCGCGGG GCCGCCGCAT CTCCGGGGCT GACCGTCGGA
AACGCGGTGT GGTACCGGCC CGCCTTCGAT GCCCCGGATG TCGCGCCCCT TGCGGACGAC
CCGGCGACCG AGGTCACCCG TCTGGATGCG GCCCTCGGCG CGGCGCGCAC CGAACTGGTC
GAACTCGAAC GCCGCACAGT TGCCGCAGCC GGTCGGAAAG AGGCCGAGAT ATTCGCCATG
CACCGGTTGC TGCTCGACGA TGTCACCATC GCCGGGGCGG CGCGTCAGCG CATAATGGAC
AGGCGCGAGG CAGCCGAATC CGCATGGTAT GAAGTGATCA GCGATGCAGC GGCGACCTTC
CGGCAACTTC CCGAAGGCTA CATGCGTGAA CGTGAGGCCG ACATGGTGGA TGTGGGCGCC
CGCGTACTGC GCCTGCTGAC CGGGGTTGCA GCGGTCGGCC CCCGTCTTGG CGGCCCTTCG
GTGCTGCTGG CCACCGACCT TGGTCCTTCG GATATGGCGA CCCTAGACCC CTCGCTGGTC
ATCGGCATCG TCACGGTGCA AGGCGGGGCC ACTTCCCATG CCGCCATCCT CGCCCGGTCA
CTGGGCATCC CTGCCGTGGC AGGGCTGGGG CCAGCACTGC AGGGGGTGGG CGAGGGCGAC
ATCGTGGCCC TCGATGGCGG CACCGGCGAC GTCTGGGTGA ATCCTGCACC GGGTGTGCGG
GCCGCCGTCG AAGCCCGTCG TGACGCATGG CTTGCAGGGC GCGAAGCGGC CCTCGCTGGT
GCGGCTGCGC CCGCAGTCAC CGCCGACGGC CGTGCTGTGC ACATACTCGC CAACATCGGT
TCTTCCGCAG ATGCCGGTGC GGCACTCAAG AACGGCGCAG AGGGCGTGGG GCTCTTCCGG
ACGGAGTTCC TGTTTCTCGA CCGGACATCC CCCCCCGGCG AGGAGGAACA GCTGACAGCC
TACGTGACCG CCGCGGCAGC CATGCAGGGC CGCCCTGTGG TGGTGCGCAC GCTCGATATC
GGCGGCGATA AACCGGTGTC CTATCTTGAA GGCTTCGCCA CCGGCGAGGA CAATCCCTTT
CTCGGGCTGC GAGGCATACG TTTCTGCCTT GAGCGGAAAC CGCTTTTCAT GACCCAGCTG
CGTGCGCTCT TGCGTGCTGC CGCCGTCCAT CCGCTGAAGG TCATGTTCCC CATGGTGGCG
CACCCCGGCG AACTCGCCGC TGCGAAGGCG TTGCTGGATG AAGCCCGTGC GGCACTCGAC
GCGGAAGGCA TGCCGCATGG CCAGCTTGAT GTGGGTATCA TGATCGAGGT GCCCGCGGCG
GTGGCCCTTG CCGACCAGCT GGCACGTGAC GCCGCCTTCT TCAGCATAGG CACCAACGAC
CTCGCCCAGT ATGTCATGGC GGCGGACAGG GGGAACGCCT CTGTGGCGGC CCTGTCGGAT
GCGTTGCACC CTGCCGTGTT GCGCATGGTG CGTGACACGG TGCGGGCGGG GCATGCTGCG
GGCATTCCCG TGGCGATATG CGGTGAGCTT GGGGGCAACC CGGAGGCCAT TCCCCTGCTG
GTGGGACTTG AGCTTGATGA ACTGAGCATG AACGGCCCCG CCATCCCACG TGCCAAGGAA
GTGGTGCGTG GGTGCGATAC AGGCACGTGC GCGGTTCTGG CCGACCGCGC CATGGCGTTG
CCCGATGCCG CGGCCGTGCG GCGGCTACTG CAGGGTGGTT CGTGA
 
Protein sequence
MVGIVVVTHS AVLGQGLREL AEQMTQGRVP LAVAGGIDDP DHPIGTDPVR VMTAIEEVQQ 
GDGVLVLMDL GSALMSAETA LDLLPPEVAS QVRLSAAPLV EGLMAAAVLA STGADLGAVA
EEAQSALAAK RELLGAAAPA APAMPSAHPE GARDSTVPSS AMPAGEELTL VVPNRLGLHA
RPAARIVTAL GPFAADVQLV RGDRVVSARS VNRIATLAVR GGETVTFRAV GGDAALALRA
IAALAAAHFG DAPEAPSKGE APSPAEAPKD GDMAEAAVSA GVQGGGVLRG AAASPGLTVG
NAVWYRPAFD APDVAPLADD PATEVTRLDA ALGAARTELV ELERRTVAAA GRKEAEIFAM
HRLLLDDVTI AGAARQRIMD RREAAESAWY EVISDAAATF RQLPEGYMRE READMVDVGA
RVLRLLTGVA AVGPRLGGPS VLLATDLGPS DMATLDPSLV IGIVTVQGGA TSHAAILARS
LGIPAVAGLG PALQGVGEGD IVALDGGTGD VWVNPAPGVR AAVEARRDAW LAGREAALAG
AAAPAVTADG RAVHILANIG SSADAGAALK NGAEGVGLFR TEFLFLDRTS PPGEEEQLTA
YVTAAAAMQG RPVVVRTLDI GGDKPVSYLE GFATGEDNPF LGLRGIRFCL ERKPLFMTQL
RALLRAAAVH PLKVMFPMVA HPGELAAAKA LLDEARAALD AEGMPHGQLD VGIMIEVPAA
VALADQLARD AAFFSIGTND LAQYVMAADR GNASVAALSD ALHPAVLRMV RDTVRAGHAA
GIPVAICGEL GGNPEAIPLL VGLELDELSM NGPAIPRAKE VVRGCDTGTC AVLADRAMAL
PDAAAVRRLL QGGS