Gene Gdia_0161 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGdia_0161 
Symbol 
ID6973553 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGluconacetobacter diazotrophicus PAl 5 
KingdomBacteria 
Replicon accessionNC_011365 
Strand
Start bp175354 
End bp176364 
Gene Length1011 bp 
Protein Length336 aa 
Translation table11 
GC content64% 
IMG OID643389695 
Productpyruvate dehydrogenase (acetyl-transferring) E1 component, alpha subunit 
Protein accessionYP_002274576 
Protein GI209542347 
COG category[C] Energy production and conversion 
COG ID[COG1071] Pyruvate/2-oxoglutarate dehydrogenase complex, dehydrogenase (E1) component, eukaryotic type, alpha subunit 
TIGRFAM ID[TIGR03182] pyruvate dehydrogenase E1 component, alpha subunit 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.149531 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value0.00745066 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGGCGAAA CTCGGAAATC CGCGACAGAG GCCGGACGCA ACAGTCCGTC GATGAGCAAG 
GAAGACCTGA CGCGCGCCTT TCATGACATG GTGCTGATCC GCCGGTTCGA GGAACGGGCC
GGCCAGCTTT ATGGCATGGG GCTGATCGGC GGCTTCTGCC ATCTGTATAT CGGCCAGGAA
GCCGTGGTCG TCGGCGTGCA GATGGAGCTG AAGCAGGGGG ACAAGATCAT CACCTCCTAC
CGCGACCATG GGCAGATGCT GGCCGCCGGC ATGGACCCGC GCGGCGTGAT GGCCGAACTG
ACCGGGCGCG AGGGCGGCTA TTCCCGCGGC AAGGGCGGGT CGATGCACAT GTTCTCGTCC
GAGAAGCATT TCTATGGCGG GCACGGCATC GTCGGCGCCC AGGTGTCGCT GGGTATCGGT
CTGGCCTTCG CCAACAAGTA TCGCGGCACG GACGAGGTCT CGATCGCCTA TTTCGGCGAG
GGCGCGTCCA GCCAGGGTCA GGTCTATGAA AGCTTCAACC TGGCGGCCCT TCACAAGCTG
CCCTGCGTAT TCGTGCTGGA AAACAACCAT TACGGCATGG GTACCAGCGT CGAGCGGTCG
TCGGCGTCCA AGGAATTGTG GCGCAATGGC GAGCCCTGGG GCATCCCGGG CCGTCAGGTC
GACGGCATGG ATGTCGAGGC CGTGCGCGAC GCGGCGCGCG AGGCGATCGA ACATTGCCGG
CAGGGCAAGG GACCGTACCT GCTGGAGATG ACGACCTATC GCTATCGCGG CCATTCGATG
TCCGACCCGG CGAAGTACCG CCCCCGCTCC GAAGTGGACG AGATGCGGAA GAATCATGAC
CCGATCGATC GGGTACGCAA GGAACTGCTG GCCATGGGCG TCGGGGAAGC CGAACTGAAG
ACGATCGAGG ACAAGGTGAA GGAAGTGGTC GTGGACGCCG CCGATTTCGC GCAGACCAGC
CCGGAGCCCG ATCCAGCGGA ATTGTGGACC GACGTGCTGG TGGAGGGCTG A
 
Protein sequence
MGETRKSATE AGRNSPSMSK EDLTRAFHDM VLIRRFEERA GQLYGMGLIG GFCHLYIGQE 
AVVVGVQMEL KQGDKIITSY RDHGQMLAAG MDPRGVMAEL TGREGGYSRG KGGSMHMFSS
EKHFYGGHGI VGAQVSLGIG LAFANKYRGT DEVSIAYFGE GASSQGQVYE SFNLAALHKL
PCVFVLENNH YGMGTSVERS SASKELWRNG EPWGIPGRQV DGMDVEAVRD AAREAIEHCR
QGKGPYLLEM TTYRYRGHSM SDPAKYRPRS EVDEMRKNHD PIDRVRKELL AMGVGEAELK
TIEDKVKEVV VDAADFAQTS PEPDPAELWT DVLVEG