Gene Gdia_1899 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGdia_1899 
Symbol 
ID6975322 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGluconacetobacter diazotrophicus PAl 5 
KingdomBacteria 
Replicon accessionNC_011365 
Strand
Start bp2114354 
End bp2116210 
Gene Length1857 bp 
Protein Length618 aa 
Translation table11 
GC content73% 
IMG OID643391425 
Productphosphoenolpyruvate-protein phosphotransferase 
Protein accessionYP_002276274 
Protein GI209544045 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1080] Phosphoenolpyruvate-protein kinase (PTS system EI component in bacteria) 
TIGRFAM ID[TIGR01417] phosphoenolpyruvate-protein phosphotransferase 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value0.0263393 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGACGA CTGACGCCGC CGGCCCCTCG TCCGCGCCGC GCCGGCGGGC TGCCGGCCGG 
CGGGGAACGC GGGCGGAACA CCGGCTGTCC GGCGATGCCA CGGTGCGCGG CATCGCCATC
GGCCCGGCGG CGGTGGCGCT GGAAAGCCCG GCGCCGGATA TGGATCCCGA CGCCCGCGCC
GCCGACCCCG CACCCGAGCT GGAGCGCCTG GCCGAGGCCG TGGAACGGTC GGTGCGGCAG
GTCGAACGCC TGCGCGACCG CCTGGCGGTG CTGCCCGAGG ACAGCCAGAT CGAAATCGGA
TCGCTGCTGG AGGTCTATCG CCGGATGCTC GGCCCGTCGC GCCTGCAGCG CGGCATCCGC
CGCCGCATCG TCCAGGACGG ACTGACAGCC GAAGCCGCCG TCCGGCAGGA AACCGAAAGC
CTGGCCCTGG CCCTGCTGGG CGGGGCCGAC GCCCCGGTGC CCGAGGGCGA GGACGCCGCC
GCCGCCCAGC GCCGCGCGGG CGAATTCCGC GAAATCGGCC GGCGGCTGCT GCGCAACCTG
GGGCGCATGC CCTTTCGCTC GTTCAGCGCG CTTCCCGAGG GCGCGGTGCT GGTCACCGAA
CAGTTGCGCC CCGCCGATGC GGCGCTGATC GACCCGTCGC GCATCGTCGC GGTCGCGACC
GAGGAAGGGG GCGCCACCGA CCACACCGCC ATCATGCTGC GCGCCCTGGG CATTCCCGCC
GTGCTGGCCG CCCACGGGCT GATGGCGCGG GTGCGCGAGG GGGCCACCGT GGTGGTGGAC
GGCACCGCCG GGCTGGTGGT GGTGGACCCG ACCGAGGATA CGCTGGCGGC GGCGCGCGGC
GGGGTGGCCG AACATGCGCG CGAACGCCAG GCGCTGGGCC GGCTGCGCCG CCTGCCGGCC
CGCCTGTCCA GCGGCGAGAA GCTGCATCTG CAGGCCAATC TGGAACTGCC GGCCGAACTG
GCGCTGATCG CGCAGTCCGG CGCGTCGGGC ATCGGCCTGC TGCGCACGGA ATTCCTGTTC
ATCAATGCCG AAACCATGCC GGACGAGGAC AGCCAGGCCG CGATCTATTC CGAAATCATC
ACCGCGATGG CAGGGGATAC CACCACCATC CGCGTGGTGG ACTGGGGCGG CGAAAAGCAT
AGCGAGGCCC TGAACCGCGC GGGGCTGGAC CGTGACGGCG ACAACGTCAA TCCGGCGCTG
GGCGTGCGCG GCCTGCGGCT GCTGCTGCGC CATCCCGCGA TCCTGGAAAC CCAGTTCGCC
GCGATCCTGA AGGCGTCGTC CGCCGGGCCG ATGCGCGTCA TGCTGCCGAT GGTCACGACC
GTCCCGGAAC TGCGCGAGGC CCGCGACATC TATCAGCGCG TCGCGCGCCG CCTGCGCCGC
CGGGGGGTGA AGCTGGGTGA CAGCCTGCCG CCGCTGGGCA TCATGGTCGA AACCCCGGCC
GCCGCCATCA TGGGCGATGC GCTGGCGCAG GAAGCCGAAT TCCTGGCCAT CGGCACCAAC
GACCTGACGA TGTACACGCT GGCGGCCGAT CGCGCCCTGG CCGATGTCGC GTCCCTCTAC
CAGCCGCTGC ATCCCGCCGT GCTGCGCCTG ATCCAGACGG TGACCGAGGC CGCGCTGCGC
CAGTATCGTC CGATTTCGAT CTGCGGGGAA ATCGCCGGCG ATCCGCGGGT GGTGCCGCTG
CTGGTCGGGC TGGGCCTGCG CAGCTTCTCG ATGACCGCCT CCGCCGTGCC CCGGGTGAAG
CGCAGGGTGC GCGCCCTGTC GTTCGAGGAC TGCCGACGGC TGGCCCACCG CGTGATGGAA
TCCCCGGACG TGGCCGAAGT CCTGTCCCTG ATCGACGCCT TCGCCGCGGG GGGGTAG
 
Protein sequence
MKTTDAAGPS SAPRRRAAGR RGTRAEHRLS GDATVRGIAI GPAAVALESP APDMDPDARA 
ADPAPELERL AEAVERSVRQ VERLRDRLAV LPEDSQIEIG SLLEVYRRML GPSRLQRGIR
RRIVQDGLTA EAAVRQETES LALALLGGAD APVPEGEDAA AAQRRAGEFR EIGRRLLRNL
GRMPFRSFSA LPEGAVLVTE QLRPADAALI DPSRIVAVAT EEGGATDHTA IMLRALGIPA
VLAAHGLMAR VREGATVVVD GTAGLVVVDP TEDTLAAARG GVAEHARERQ ALGRLRRLPA
RLSSGEKLHL QANLELPAEL ALIAQSGASG IGLLRTEFLF INAETMPDED SQAAIYSEII
TAMAGDTTTI RVVDWGGEKH SEALNRAGLD RDGDNVNPAL GVRGLRLLLR HPAILETQFA
AILKASSAGP MRVMLPMVTT VPELREARDI YQRVARRLRR RGVKLGDSLP PLGIMVETPA
AAIMGDALAQ EAEFLAIGTN DLTMYTLAAD RALADVASLY QPLHPAVLRL IQTVTEAALR
QYRPISICGE IAGDPRVVPL LVGLGLRSFS MTASAVPRVK RRVRALSFED CRRLAHRVME
SPDVAEVLSL IDAFAAGG