Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dvul_2007 |
Symbol | |
ID | 4662898 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Desulfovibrio vulgaris DP4 |
Kingdom | Bacteria |
Replicon accession | NC_008751 |
Strand | - |
Start bp | 2337359 |
End bp | 2339923 |
Gene Length | 2565 bp |
Protein Length | 854 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 639820250 |
Product | phosphoenolpyruvate-protein phosphotransferase |
Protein accession | YP_967450 |
Protein GI | 120603050 |
COG category | [G] Carbohydrate transport and metabolism [S] Function unknown |
COG ID | [COG1080] Phosphoenolpyruvate-protein kinase (PTS system EI component in bacteria) [COG3412] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | [TIGR01003] Phosphotransferase System HPr (HPr) Family [TIGR01417] phosphoenolpyruvate-protein phosphotransferase [TIGR02364] dihydroxyacetone kinase, phosphotransfer subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.436147 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTCGGCA TCGTCGTCGT CACGCATAGC GCGGTGTTGG GGCAGGGTCT CAGGGAACTG GCTGAGCAGA TGACACAGGG CCGAGTGCCG CTGGCTGTGG CCGGAGGCAT CGACGACCCC GACCATCCGA TAGGAACCGA CCCCGTGCGG GTGATGACCG CCATCGAGGA GGTGCAACAG GGCGACGGCG TGCTTGTGCT GATGGACCTT GGCAGCGCCC TCATGAGCGC GGAGACGGCC CTCGACCTTC TGCCGCCGGA GGTGGCCTCT CAGGTACGAC TCAGTGCCGC GCCACTGGTG GAGGGGCTCA TGGCGGCCGC TGTCCTCGCC TCGACCGGGG CCGACCTTGG AGCCGTGGCG GAGGAGGCGC AGAGCGCGCT GGCGGCGAAG CGCGAACTGC TCGGTGCGGC GGCCCCCGCA GCCCCGGCGA TGCCTTCAGC GCATCCGGAA GGGGCACGGG ATTCCACAGT GCCCTCGTCG GCAATGCCCG CTGGCGAGGA ACTCACGCTG GTGGTTCCCA ACAGGCTCGG CTTGCATGCG CGTCCGGCTG CACGCATCGT CACTGCACTG GGGCCGTTTG CCGCAGACGT GCAACTGGTG CGGGGCGACC GCGTCGTCTC CGCGCGTTCG GTGAACCGCA TCGCGACGCT GGCCGTGCGT GGCGGTGAGA CCGTGACCTT CAGGGCCGTG GGTGGCGATG CCGCCCTCGC CCTGCGAGCC ATCGCAGCTC TTGCGGCAGC TCATTTCGGC GATGCGCCTG AAGCCCCGTC CAAAGGGGAG GCCCCGTCCC CGGCGGAGGC GCCGAAGGAT GGGGACATGG CAGAGGCCGC CGTATCTGCC GGTGTTCAAG GTGGAGGCGT CCTGCGCGGG GCCGCCGCAT CTCCGGGGCT GACCGTCGGA AACGCGGTGT GGTACCGGCC CGCCTTCGAT GCCCCGGATG TCGCGCCCCT TGCGGACGAC CCGGCGACCG AGGTCACCCG TCTGGATGCG GCCCTCGGCG CGGCGCGCAC CGAACTGGTC GAACTCGAAC GCCGCACAGT TGCCGCAGCC GGTCGGAAAG AGGCCGAGAT ATTCGCCATG CACCGGTTGC TGCTCGACGA TGTCACCATC GCCGGGGCGG CGCGTCAGCG CATAATGGAC AGGCGCGAGG CAGCCGAATC CGCATGGTAT GAAGTGATCA GCGATGCAGC GGCGACCTTC CGGCAACTTC CCGAAGGCTA CATGCGTGAA CGTGAGGCCG ACATGGTGGA TGTGGGCGCC CGCGTACTGC GCCTGCTGAC CGGGGTTGCA GCGGTCGGCC CCCGTCTTGG CGGCCCTTCG GTGCTGCTGG CCACCGACCT TGGTCCTTCG GATATGGCGA CCCTAGACCC CTCGCTGGTC ATCGGCATCG TCACGGTGCA AGGCGGGGCC ACTTCCCATG CCGCCATCCT CGCCCGGTCA CTGGGCATCC CTGCCGTGGC AGGGCTGGGG CCAGCACTGC AGGGGGTGGG CGAGGGCGAC ATCGTGGCCC TCGATGGCGG CACCGGCGAC GTCTGGGTGA ATCCTGCACC GGGTGTGCGG GCCGCCGTCG AAGCCCGTCG TGACGCATGG CTTGCAGGGC GCGAAGCGGC CCTCGCTGGT GCGGCTGCGC CCGCAGTCAC CGCCGACGGC CGTGCTGTGC ACATACTCGC CAACATCGGT TCTTCCGCAG ATGCCGGTGC GGCACTCAAG AACGGCGCAG AGGGCGTGGG GCTCTTCCGG ACGGAGTTCC TGTTTCTCGA CCGGACATCC CCCCCCGGCG AGGAGGAACA GCTGACAGCC TACGTGACCG CCGCGGCAGC CATGCAGGGC CGCCCTGTGG TGGTGCGCAC GCTCGATATC GGCGGCGATA AACCGGTGTC CTATCTTGAA GGCTTCGCCA CCGGCGAGGA CAATCCCTTT CTCGGGCTGC GAGGCATACG TTTCTGCCTT GAGCGGAAAC CGCTTTTCAT GACCCAGCTG CGTGCGCTCT TGCGTGCTGC CGCCGTCCAT CCGCTGAAGG TCATGTTCCC CATGGTGGCG CACCCCGGCG AACTCGCCGC TGCGAAGGCG TTGCTGGATG AAGCCCGTGC GGCACTCGAC GCGGAAGGCA TGCCGCATGG CCAGCTTGAT GTGGGTATCA TGATCGAGGT GCCCGCGGCG GTGGCCCTTG CCGACCAGCT GGCACGTGAC GCCGCCTTCT TCAGCATAGG CACCAACGAC CTCGCCCAGT ATGTCATGGC GGCGGACAGG GGGAACGCCT CTGTGGCGGC CCTGTCGGAT GCGTTGCACC CTGCCGTGTT GCGCATGGTG CGTGACACGG TGCGGGCGGG GCATGCTGCG GGCATTCCCG TGGCGATATG CGGTGAGCTT GGGGGCAACC CGGAGGCCAT TCCCCTGCTG GTGGGACTTG AGCTTGATGA ACTGAGCATG AACGGCCCCG CCATCCCACG TGCCAAGGAA GTGGTGCGTG GGTGCGATAC AGGCACGTGC GCGGTTCTGG CCGACCGCGC CATGGCGTTG CCCGATGCCG CGGCCGTGCG GCGGCTACTG CAGGGTGGTT CGTGA
|
Protein sequence | MVGIVVVTHS AVLGQGLREL AEQMTQGRVP LAVAGGIDDP DHPIGTDPVR VMTAIEEVQQ GDGVLVLMDL GSALMSAETA LDLLPPEVAS QVRLSAAPLV EGLMAAAVLA STGADLGAVA EEAQSALAAK RELLGAAAPA APAMPSAHPE GARDSTVPSS AMPAGEELTL VVPNRLGLHA RPAARIVTAL GPFAADVQLV RGDRVVSARS VNRIATLAVR GGETVTFRAV GGDAALALRA IAALAAAHFG DAPEAPSKGE APSPAEAPKD GDMAEAAVSA GVQGGGVLRG AAASPGLTVG NAVWYRPAFD APDVAPLADD PATEVTRLDA ALGAARTELV ELERRTVAAA GRKEAEIFAM HRLLLDDVTI AGAARQRIMD RREAAESAWY EVISDAAATF RQLPEGYMRE READMVDVGA RVLRLLTGVA AVGPRLGGPS VLLATDLGPS DMATLDPSLV IGIVTVQGGA TSHAAILARS LGIPAVAGLG PALQGVGEGD IVALDGGTGD VWVNPAPGVR AAVEARRDAW LAGREAALAG AAAPAVTADG RAVHILANIG SSADAGAALK NGAEGVGLFR TEFLFLDRTS PPGEEEQLTA YVTAAAAMQG RPVVVRTLDI GGDKPVSYLE GFATGEDNPF LGLRGIRFCL ERKPLFMTQL RALLRAAAVH PLKVMFPMVA HPGELAAAKA LLDEARAALD AEGMPHGQLD VGIMIEVPAA VALADQLARD AAFFSIGTND LAQYVMAADR GNASVAALSD ALHPAVLRMV RDTVRAGHAA GIPVAICGEL GGNPEAIPLL VGLELDELSM NGPAIPRAKE VVRGCDTGTC AVLADRAMAL PDAAAVRRLL QGGS
|
| |