Gene Gdia_1822 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGdia_1822 
Symbol 
ID6975244 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGluconacetobacter diazotrophicus PAl 5 
KingdomBacteria 
Replicon accessionNC_011365 
Strand
Start bp2021998 
End bp2024388 
Gene Length2391 bp 
Protein Length796 aa 
Translation table11 
GC content65% 
IMG OID643391347 
Productputative phosphoketolase 
Protein accessionYP_002276197 
Protein GI209543968 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3957] Phosphoketolase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones49 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACGAGA TGTCCCCCCT GTCCACCGCG CCCCTGACGG CCACGCCGCT GTCGGCGGCG 
GAACTCGGCC TGTTCAACCG GTGGTGGCAT GCGGCGAACT ATCTGTCGGT GGGCCAGATC
TACCTGCTGG CCAACCCGCT GCTGCGTGAA CCGCTGAAGC TGGAACATAC CAAGCCGCGC
CTGCTGGGCC ACTGGGGCAC GACCCCGGGG CTGAACTTCC TCTACCTGCA CCTCAACCGC
ATCATCCGCG CGCGGGACGC CAACATCCTC TTCATCGCCG GTCCCGGGCA TGGCGCGCCG
GGCGTCGTGG CCAATACGTA CCTCGAAGGC ACGTACAGCG AATATTTCCC CGAGGTGTCG
CAGGACGAAA ACGGGCTGGG CCGACTGCTG AAGCAGTTTT CCTTCCCCGG CGGCATTCCC
AGCCACGCGG CGGCGACGAC CCCGGGCTCG ATTCATGAGG GTGGCGAGCT GGGCTACTCG
CTCTCGCACG CCTATGGCGC GGTGTTCGAC AATCCCGACC TGATCGTGGC CTGCGTCATC
GGCGATGGCG AGGCCGAGAC CGGTCCGCTG GCCACGTCGT GGCACAGCAA CAAGTTCCTC
GACCCGCAGA CCGACGGCGC CGTGCTGCCG ATCCTGCACC TGAACGGCTA CAAGATCGCG
AACCCCACCA TCCTGGCCCG CATTCCGCAG CCCGAACTGG AAGCGCTGCT GCGGGGCTAC
GGGTACGATC CGATCTTCGT CGAAGGCCAC GACCCCGACC CAATGCACCA GAAGATGGCG
ACGGCCATGG ACCTGGCCTT CGACCGGATC GCCGCGATCC AGAAGGCGGC GCGCGAGGGC
GGGCAGACCG AACGCCCCAC CTGGCCGATG ATCGTCATGC GCAGCCCCAA GGGCTGGACC
GGGCCGAAGG AAATCGACGG CCTGCGGACC GAAGGCTACT GGCGGTCGCA TCAGGTGCCG
TTCTCGGACC TGACCAAGCC CGGGCACCTG CAGGCGCTGG AAACCTGGCT GCGCAGCTAC
AAGCCCGAGG ACCTGTTCGA CGAATCCGGC CGCCTGTTCG CCGACATCGC GGCCCTGGCC
CCGTCCGGCG CACGGCGCAT GAGCGACAAC CCCCATGCCA ATGGCGGACA GTTGCGCCAC
CCGCTGAAAC TGCCGGACAT CGCGAACTAC GCCGTGCCGG TCACCACGCC GGGCAGCGTG
ACGGGCGAGG GCACGCGCGT GCTGGGCACG TGGCTGCGCG ACGTGATGAA GGAAAACCTG
CCCTCGCGCA ATTTCCGTGT CCTGGCCCCG GACGAAAACA ATTCCAACCG CCTGAACGCG
GTGCTGGACG TCACCAACCG CGCGTGGAAC GCGGAAACGG TGGATTATGA CGACCACTTG
GCGCGCGACG GCCATGTGAT GGAAATCCTC AGCGAGCACA CCTGCCAGGG CTGGCTGGAA
GGCTACCTGC TGACCGGCCG CCATGGCTTC ATGTCGTGCT ACGAGGCGTT CATCCACATT
GTCGATTCGA TGGTGAATCA GCATGCGAAA TGGCTGAAGA CCTCGGCCGA AGTGCCGTGG
CGGCGGTCGA TTTCCTCGCT GAACTACCTG CTGACGTCGC ATGTCTGGCG CCAGGACCAT
AACGGCTTCA GCCACCAGGA CCCCGGCTTC ATCGATCATG TGATCAACAA GAAGGCCGAT
ATCGTCCGCG TGTACCTGCC GCCCGATGCC AACACCCTGC TCTGCACCGC CGCGCACTGC
CTGCATAGCT GGGACCGCAT CAACGTCATC GTGGCCGGCA AGCAGCCGGA ACCGCAATGG
CTGAGCATGG AAGACGCCAT CCGCCATTGC AGCGCCGGCA TCGGTATCTG GGAATGGGCC
AGCAACGACA AGGGCAGCGA ACCAGACGTG GTGATGGCCT GCGCCGGCGA CGTACCCACC
ATCGAAACGC TGGCGGCGGT CAAGCTGTTG CGTGAGCACG CGCCGGACCT GAAGATCCGC
GTCATCAACG TTGTGGACCT GATGACGCTG GAATCGGCCA GCCAGCACCC GCACGGCCTG
ACGGATTCCG CCTTCGACGC GCTGTTCACG CTGGACAAGC CGGTCATCTT CGCCTTCCAT
GGCTATCCGC AGCTGATCCA CAAGCTGATC TACCGCCGCG CCAACGCCCG GAACTTCCAC
GTCCACGGCT TCCGCGAGGA AGGATCGACC ACCACCCCGT TCGACATGGT GGTGCGCAAC
CACCTCGACC GGTTCCACAT CGTCTCGAAC GTCATCGACC GCGTGCCCGG CCTGGCCACC
CGCGCCGCCT ACGCCAAACA GGCCATCCGC GACAAGCTGG TGGACCACAC CCGCTACATC
GCCGAATACG GACGCGACAT GCCGGAAATC GAAGACTGGC GCTGGTCGTA A
 
Protein sequence
MNEMSPLSTA PLTATPLSAA ELGLFNRWWH AANYLSVGQI YLLANPLLRE PLKLEHTKPR 
LLGHWGTTPG LNFLYLHLNR IIRARDANIL FIAGPGHGAP GVVANTYLEG TYSEYFPEVS
QDENGLGRLL KQFSFPGGIP SHAAATTPGS IHEGGELGYS LSHAYGAVFD NPDLIVACVI
GDGEAETGPL ATSWHSNKFL DPQTDGAVLP ILHLNGYKIA NPTILARIPQ PELEALLRGY
GYDPIFVEGH DPDPMHQKMA TAMDLAFDRI AAIQKAAREG GQTERPTWPM IVMRSPKGWT
GPKEIDGLRT EGYWRSHQVP FSDLTKPGHL QALETWLRSY KPEDLFDESG RLFADIAALA
PSGARRMSDN PHANGGQLRH PLKLPDIANY AVPVTTPGSV TGEGTRVLGT WLRDVMKENL
PSRNFRVLAP DENNSNRLNA VLDVTNRAWN AETVDYDDHL ARDGHVMEIL SEHTCQGWLE
GYLLTGRHGF MSCYEAFIHI VDSMVNQHAK WLKTSAEVPW RRSISSLNYL LTSHVWRQDH
NGFSHQDPGF IDHVINKKAD IVRVYLPPDA NTLLCTAAHC LHSWDRINVI VAGKQPEPQW
LSMEDAIRHC SAGIGIWEWA SNDKGSEPDV VMACAGDVPT IETLAAVKLL REHAPDLKIR
VINVVDLMTL ESASQHPHGL TDSAFDALFT LDKPVIFAFH GYPQLIHKLI YRRANARNFH
VHGFREEGST TTPFDMVVRN HLDRFHIVSN VIDRVPGLAT RAAYAKQAIR DKLVDHTRYI
AEYGRDMPEI EDWRWS