Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Gdia_1899 |
Symbol | |
ID | 6975322 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Gluconacetobacter diazotrophicus PAl 5 |
Kingdom | Bacteria |
Replicon accession | NC_011365 |
Strand | - |
Start bp | 2114354 |
End bp | 2116210 |
Gene Length | 1857 bp |
Protein Length | 618 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 643391425 |
Product | phosphoenolpyruvate-protein phosphotransferase |
Protein accession | YP_002276274 |
Protein GI | 209544045 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1080] Phosphoenolpyruvate-protein kinase (PTS system EI component in bacteria) |
TIGRFAM ID | [TIGR01417] phosphoenolpyruvate-protein phosphotransferase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 33 |
Fosmid unclonability p-value | 0.0263393 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGACGA CTGACGCCGC CGGCCCCTCG TCCGCGCCGC GCCGGCGGGC TGCCGGCCGG CGGGGAACGC GGGCGGAACA CCGGCTGTCC GGCGATGCCA CGGTGCGCGG CATCGCCATC GGCCCGGCGG CGGTGGCGCT GGAAAGCCCG GCGCCGGATA TGGATCCCGA CGCCCGCGCC GCCGACCCCG CACCCGAGCT GGAGCGCCTG GCCGAGGCCG TGGAACGGTC GGTGCGGCAG GTCGAACGCC TGCGCGACCG CCTGGCGGTG CTGCCCGAGG ACAGCCAGAT CGAAATCGGA TCGCTGCTGG AGGTCTATCG CCGGATGCTC GGCCCGTCGC GCCTGCAGCG CGGCATCCGC CGCCGCATCG TCCAGGACGG ACTGACAGCC GAAGCCGCCG TCCGGCAGGA AACCGAAAGC CTGGCCCTGG CCCTGCTGGG CGGGGCCGAC GCCCCGGTGC CCGAGGGCGA GGACGCCGCC GCCGCCCAGC GCCGCGCGGG CGAATTCCGC GAAATCGGCC GGCGGCTGCT GCGCAACCTG GGGCGCATGC CCTTTCGCTC GTTCAGCGCG CTTCCCGAGG GCGCGGTGCT GGTCACCGAA CAGTTGCGCC CCGCCGATGC GGCGCTGATC GACCCGTCGC GCATCGTCGC GGTCGCGACC GAGGAAGGGG GCGCCACCGA CCACACCGCC ATCATGCTGC GCGCCCTGGG CATTCCCGCC GTGCTGGCCG CCCACGGGCT GATGGCGCGG GTGCGCGAGG GGGCCACCGT GGTGGTGGAC GGCACCGCCG GGCTGGTGGT GGTGGACCCG ACCGAGGATA CGCTGGCGGC GGCGCGCGGC GGGGTGGCCG AACATGCGCG CGAACGCCAG GCGCTGGGCC GGCTGCGCCG CCTGCCGGCC CGCCTGTCCA GCGGCGAGAA GCTGCATCTG CAGGCCAATC TGGAACTGCC GGCCGAACTG GCGCTGATCG CGCAGTCCGG CGCGTCGGGC ATCGGCCTGC TGCGCACGGA ATTCCTGTTC ATCAATGCCG AAACCATGCC GGACGAGGAC AGCCAGGCCG CGATCTATTC CGAAATCATC ACCGCGATGG CAGGGGATAC CACCACCATC CGCGTGGTGG ACTGGGGCGG CGAAAAGCAT AGCGAGGCCC TGAACCGCGC GGGGCTGGAC CGTGACGGCG ACAACGTCAA TCCGGCGCTG GGCGTGCGCG GCCTGCGGCT GCTGCTGCGC CATCCCGCGA TCCTGGAAAC CCAGTTCGCC GCGATCCTGA AGGCGTCGTC CGCCGGGCCG ATGCGCGTCA TGCTGCCGAT GGTCACGACC GTCCCGGAAC TGCGCGAGGC CCGCGACATC TATCAGCGCG TCGCGCGCCG CCTGCGCCGC CGGGGGGTGA AGCTGGGTGA CAGCCTGCCG CCGCTGGGCA TCATGGTCGA AACCCCGGCC GCCGCCATCA TGGGCGATGC GCTGGCGCAG GAAGCCGAAT TCCTGGCCAT CGGCACCAAC GACCTGACGA TGTACACGCT GGCGGCCGAT CGCGCCCTGG CCGATGTCGC GTCCCTCTAC CAGCCGCTGC ATCCCGCCGT GCTGCGCCTG ATCCAGACGG TGACCGAGGC CGCGCTGCGC CAGTATCGTC CGATTTCGAT CTGCGGGGAA ATCGCCGGCG ATCCGCGGGT GGTGCCGCTG CTGGTCGGGC TGGGCCTGCG CAGCTTCTCG ATGACCGCCT CCGCCGTGCC CCGGGTGAAG CGCAGGGTGC GCGCCCTGTC GTTCGAGGAC TGCCGACGGC TGGCCCACCG CGTGATGGAA TCCCCGGACG TGGCCGAAGT CCTGTCCCTG ATCGACGCCT TCGCCGCGGG GGGGTAG
|
Protein sequence | MKTTDAAGPS SAPRRRAAGR RGTRAEHRLS GDATVRGIAI GPAAVALESP APDMDPDARA ADPAPELERL AEAVERSVRQ VERLRDRLAV LPEDSQIEIG SLLEVYRRML GPSRLQRGIR RRIVQDGLTA EAAVRQETES LALALLGGAD APVPEGEDAA AAQRRAGEFR EIGRRLLRNL GRMPFRSFSA LPEGAVLVTE QLRPADAALI DPSRIVAVAT EEGGATDHTA IMLRALGIPA VLAAHGLMAR VREGATVVVD GTAGLVVVDP TEDTLAAARG GVAEHARERQ ALGRLRRLPA RLSSGEKLHL QANLELPAEL ALIAQSGASG IGLLRTEFLF INAETMPDED SQAAIYSEII TAMAGDTTTI RVVDWGGEKH SEALNRAGLD RDGDNVNPAL GVRGLRLLLR HPAILETQFA AILKASSAGP MRVMLPMVTT VPELREARDI YQRVARRLRR RGVKLGDSLP PLGIMVETPA AAIMGDALAQ EAEFLAIGTN DLTMYTLAAD RALADVASLY QPLHPAVLRL IQTVTEAALR QYRPISICGE IAGDPRVVPL LVGLGLRSFS MTASAVPRVK RRVRALSFED CRRLAHRVME SPDVAEVLSL IDAFAAGG
|
| |