Gene Gdia_3502 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGdia_3502 
Symbol 
ID6976954 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGluconacetobacter diazotrophicus PAl 5 
KingdomBacteria 
Replicon accessionNC_011365 
Strand
Start bp3834527 
End bp3835555 
Gene Length1029 bp 
Protein Length342 aa 
Translation table11 
GC content67% 
IMG OID643393022 
ProductTransketolase central region 
Protein accessionYP_002277841 
Protein GI209545612 
COG category[C] Energy production and conversion 
COG ID[COG0022] Pyruvate/2-oxoglutarate dehydrogenase complex, dehydrogenase (E1) component, eukaryotic type, beta subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.257276 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value0.0335061 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCAAGA AGAGCTATCG GCAGGCCATC AACGAGGCCC TGCGACTGGA AATGCGGCGC 
GACCCGCGCG TGATCCTGAT GGGCGAGGAC GTCGCCGGCG GACATGGCGG ATCGTCGGGC
GTCACCGACG CCTGGGGCGG CGTGCTGGGC GTCACCAAGG GCCTGTTGAG CGAATTCGGC
GAGGATCGCG TCCTGGACAC CCCGATCACG GAAGCATCCT ATATCGGCGC CGCCGCCGGG
GCCGCCGCGA CCGGCCTACG CCCCGTCGCC GAGCTGATGT TCGTCGATTT CGTGGGCTGC
TGCCTGGACC AGATCATGAA CCAGGCCGCC AAGTTCCGCT ACATGTTCGG CGGCAAGGCC
CGCACCCCGC TGGTCATCCG CGCCATGTTC GGCGCCGGCT TCAACGCCGC GGCCCAGCAC
AGCCAGGCGC TGTACCCGCT GTTCACCCAC ATTCCCGGGC TGAAGGTGGT CGTCCCGTCC
TCGCCCTACG AGGCCAAGGG CCTGCTGATC GAGGCGATCC GCGACGACGA TCCGGTGATC
TTCCTTGAAC ACAAGGTCAT GTATGACGAC GAGGAAGAGG TGCCCGACGA AGCCTATACC
ATCCCGTTCG GCGAGGCCAA CCTGACGCGT GAGGGCGACG ACCTGACGAT CGTGGCGTTC
GGCCGCATGG TGAAGCTGGC GAACGAGGCC GCCGACCGGC TGCAAAAGCA GGGCATCGGC
TGCACCGTCA TCGATCCGCG CACCACCTCG CCGCTGGATG CCGAGACGAT CCTGGACAGC
GTGACCGAGA CCGGCCGGCT GGTGATCGTC GATGAATCCA GCCCGCGCTG CAACATGGCC
GCCGACATCT CCGCCCTGGT GGCCGAACAG GCGTTCGACG CGCTGAAGGC CCCGATCCGG
CGGGTGATGC CACCCCACAC GCCGGTGCCG TTCGCATCGG TGCTGGAAAG CCTGTACCTG
CCCGACGTGG CGAAGATCGA AGCGGCTGCC CGTGCCGTGA TGACCCATCG CATCCGAGAG
GTCGCCTGA
 
Protein sequence
MSKKSYRQAI NEALRLEMRR DPRVILMGED VAGGHGGSSG VTDAWGGVLG VTKGLLSEFG 
EDRVLDTPIT EASYIGAAAG AAATGLRPVA ELMFVDFVGC CLDQIMNQAA KFRYMFGGKA
RTPLVIRAMF GAGFNAAAQH SQALYPLFTH IPGLKVVVPS SPYEAKGLLI EAIRDDDPVI
FLEHKVMYDD EEEVPDEAYT IPFGEANLTR EGDDLTIVAF GRMVKLANEA ADRLQKQGIG
CTVIDPRTTS PLDAETILDS VTETGRLVIV DESSPRCNMA ADISALVAEQ AFDALKAPIR
RVMPPHTPVP FASVLESLYL PDVAKIEAAA RAVMTHRIRE VA