Gene Gdia_0233 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGdia_0233 
Symbol 
ID6973625 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGluconacetobacter diazotrophicus PAl 5 
KingdomBacteria 
Replicon accessionNC_011365 
Strand
Start bp256261 
End bp258555 
Gene Length2295 bp 
Protein Length764 aa 
Translation table11 
GC content67% 
IMG OID643389764 
Productpolyphosphate kinase 
Protein accessionYP_002274645 
Protein GI209542416 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0855] Polyphosphate kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones42 
Fosmid unclonability p-value0.61377 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGACCAGAC CGACGCGCAA GACGCCCGCG CGCCAGCCCT CCGCGCGCCC CGCTCCCAAG 
CGCTCGGTCC CCAAGCGTAC GGGGACGTCC GCGCGGCGGC GCGCCCCGGC ATCCGACACG
ACGCCGCCCG TGCCGCCCCC GCCCGTGGAC ATGCAGTCAC CCACGCGGTT CATCAACCGT
GAACTCTCGT GGCTGGATTT CAACCAGCGC GTGGTCGAGG AAGCCGACAA CCCCCGCAAT
CCGCTGCTGG AACGGCTGCG CTTCCTGTCG ATCAGCGCCG GCAACCTGGA CGAATTCTAT
TCCGTCCGCG TCGCCGGCCT GGTGGGGCAG GTGCGCGAGG GGCTGGTCAC CAGTTCGCCC
GATGGCCTGA CCCCGGTCCA GCAGCTTGCC GCCGTGCGCA CGCGCACGAT CCGGCTGCTG
CGCGAACAGC AGAGGATCTG GAAGGACCTG CGCGGCCTGC TGGCCGAGTC CGGGATCGTG
GTGTGCAGCC TGGACACCAT TTCCGACGCC GACCGCGACT GGCTGGACAG TTGCTTCATG
GACCGGATCT TCCCCGTCCT GACGCCGCTG GCCATCGATC CGGCCCATCC GCTGCCCTTC
ATCCCGCATA TGGGCCTGGC CCTGTCGCTG CGGCTGATGA GCCGGGACAC CGGCCGTTTC
GTCATGAGCG CGATGATCCT GCTGCCGGCG CAGATCGAGC GGTTCGTGCG GCTGCCGGCC
AACCTGTCGC CGCCCGGCGT GACGCGCTTC ATCCTGCTGG AAAACCTGAT CGCGTTGTGC
ATCGACCGGC TGTTTCCCGG CATGGTGGCG GGCGAATGGG GCGTGCTGCG CGTGATCCGC
GACACCGACG TGGAATTCGA GGACGAGGCC GAGGATCTGG TCCGGTCCTA CGAATCCGCC
CTGAAGCGCC GCCGCCGGGG GGTGGTCATC CACCTGGACA TCGATTCGCG CATGCCGGCC
GACCTGGGGC AGGCGGTGGC GACCGACCTG GCCGTTCCGC CCGACGAGGT CGAGATCCAG
CCCGGGCTGA TCGGCGTGGT CGACCTGAAG CAGTTGATCG TCGATGACCG GCCGGACCTG
CTGTTCCCGC CCTATACCCC CCGCTTTCCC GAACGGGTCG TCGATTTCGG CGGGGACTGC
TTCGCCGCCA TCCGCGCCAA GGACATGATC GTCCATCACC CGTTCGAAAG CTTCGACGTG
GTGATCCAGT TCCTGCGCCA GGCGGCCCTG GACCCGGCGG TGGTCGCGAT CAAGCAGACC
CTGTACCGCA CATCGCGCGA CAGCCCGATC GTCAAGGCGC TGATCGAGGC TGCCGAGGCC
GGAAAATCGG TCACCGCGAT GGTCGAACTG CGCGCCCGCT TCGATGAGGA AGCCAATATC
CGCCTGGCCC GCGCGCTGGA GGCCGCGGGC GTGCAGGTCG TATTCGGCTT CGCGGACCTG
AAGACCCATG CGAAGCTCAG CCTGGTCGTC CGGCGAGAGG GCGGGTCCCT GCGGTCCTAC
GCCCATTTCG GGACGGGCAA CTACCACCCG ATCACGGCCC GGATCTATAC CGACCTGTCC
TTCTTCACCT GCGATCCCGA GCTGGCACGG GATTCCGCGC GGCTGTTCAA CTACATGACC
GGCTATGCGC TGCCCACGCG GATGGAAGCC ATCGCCTTCT CGCCGGTGAC GATCCGCCGC
ACGCTGGAGG AACTGATCGA GGGCGAGATC GAGCATGTCC GCGCCGGCCG TCCCGGCCAG
ATCTGGCTGA AGATGAACTC GCTGGTCGAT CCCGACCTGA TCGACCGGCT GTACCGGGCG
TCGTGTGCCG GGGTGCGCAT CATGGGCGTG GTGCGCGGAA TCTGCTGCCT GCGGCCCGGC
GTGCCGGGCC TGTCGGAAAA CATCCGCATC AAGTCCATCG TCGGGCGGTT CCTGGAACAT
GCCCGGATCT TCGCCTTCGG CGACGGGCAC CGGCTGCCCT CGCGCCAGGC CCGCATCTAT
ATCTCGTCGG CCGACTGGAT GGTGCGCAAC ATGGACTGGC GGGTCGAAAG CATGGTGCCG
ATGCGCGACC CCACCGTCCA TGCCCAGGTG CTGGACCAGA TCATGGTCAC GGACCTGAAG
GATAACCTGC AATCGTGGAT CCTGCAGCAG AACGGTGTAT GGAGGCGGCT GGAGCCGGGT
GCGAAACCGT TTTCGGCCCA TGACTATTTC ATGACCAATC CCTCGCTGTC CGGTCGCGGC
CGGGCCGCGC AGGACAGCGC GATCCGGGTC AGCACTGCGC TGCCCCGCCA CCAGGACCGG
ATTCTCGATG ATTGA
 
Protein sequence
MTRPTRKTPA RQPSARPAPK RSVPKRTGTS ARRRAPASDT TPPVPPPPVD MQSPTRFINR 
ELSWLDFNQR VVEEADNPRN PLLERLRFLS ISAGNLDEFY SVRVAGLVGQ VREGLVTSSP
DGLTPVQQLA AVRTRTIRLL REQQRIWKDL RGLLAESGIV VCSLDTISDA DRDWLDSCFM
DRIFPVLTPL AIDPAHPLPF IPHMGLALSL RLMSRDTGRF VMSAMILLPA QIERFVRLPA
NLSPPGVTRF ILLENLIALC IDRLFPGMVA GEWGVLRVIR DTDVEFEDEA EDLVRSYESA
LKRRRRGVVI HLDIDSRMPA DLGQAVATDL AVPPDEVEIQ PGLIGVVDLK QLIVDDRPDL
LFPPYTPRFP ERVVDFGGDC FAAIRAKDMI VHHPFESFDV VIQFLRQAAL DPAVVAIKQT
LYRTSRDSPI VKALIEAAEA GKSVTAMVEL RARFDEEANI RLARALEAAG VQVVFGFADL
KTHAKLSLVV RREGGSLRSY AHFGTGNYHP ITARIYTDLS FFTCDPELAR DSARLFNYMT
GYALPTRMEA IAFSPVTIRR TLEELIEGEI EHVRAGRPGQ IWLKMNSLVD PDLIDRLYRA
SCAGVRIMGV VRGICCLRPG VPGLSENIRI KSIVGRFLEH ARIFAFGDGH RLPSRQARIY
ISSADWMVRN MDWRVESMVP MRDPTVHAQV LDQIMVTDLK DNLQSWILQQ NGVWRRLEPG
AKPFSAHDYF MTNPSLSGRG RAAQDSAIRV STALPRHQDR ILDD