Gene Gdia_0163 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGdia_0163 
Symbol 
ID6973555 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGluconacetobacter diazotrophicus PAl 5 
KingdomBacteria 
Replicon accessionNC_011365 
Strand
Start bp177727 
End bp179001 
Gene Length1275 bp 
Protein Length424 aa 
Translation table11 
GC content69% 
IMG OID643389697 
Productpyruvate dehydrogenase complex dihydrolipoamide acetyltransferase 
Protein accessionYP_002274578 
Protein GI209542349 
COG category[C] Energy production and conversion 
COG ID[COG0508] Pyruvate/2-oxoglutarate dehydrogenase complex, dihydrolipoamide acyltransferase (E2) component, and related enzymes 
TIGRFAM ID[TIGR01349] pyruvate dehydrogenase complex dihydrolipoamide acetyltransferase, long form 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.625512 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value0.00113333 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGTCCGTGA ATATCCTGAT GCCGGCGCTG TCGCCGACGA TGACCGAGGG CAAGCTGTCC 
CGCTGGCTGA AGAAGGAAGG CGACGCGATC CATTCCGGCG ACGTGATCGC CGAGATCGAG
ACCGACAAGG CGACGATGGA GGTCGAGGCC GTGGATGACG GCCTGCTGGG CCGCATCCTG
GTTTCCGAAG GCACCGAAGG GGTGAAGGTC AACGCGCCGA TCGCCATCGT GGTGGCGGAA
GGCGAGAGCG TTCCCGATGA CGCAGCCCCC GTGGCGGCTG CTCCGGCGGC GGCTCCCGTG
GCGGCGGCCC CGGTTTCCGA GGCCAAGGCA CCGGCGATCG CGGCCGCCCC GGCCGTGCCC
CAGGGCGCGG CGCCGGCTCC GGCCCAGGGC ACGCGCGTCT TCGCGTCGCC GTTGGCGCGG
CGCATCGCGG CGCAGAAGGG GATCGACCTG TCCGGCGTGA AGGGCAGCGG CCCGAATGGC
CGGATCGTGC GTCGCGACGT CGAATCCGCG ACGGCGGCGC CCGTGGCGGC CCCGGTACCA
TCCCCGGCAC CGTCCGCCCC GGCCGCAGCG ATCGAGGCGC CGCATACCGC CGTGCCGAAC
TCGACCATCC GCAAGGTCAT CGCCCGGCGG CTGACCGAGG CGAAGTCGAC CATCCCGCAT
TTCTACGTGG CGATGGATGT GGAACTGGAC GCGCTGCTGG ACCTGCGGGC GAAGCTGAAC
GCGGCCTCGC CGGCCGAGGG GCCGGGAGCG TTCAAGCTGT CGGTCAACGA CATGCTGATC
AAGGCGGTGG CGGTAACCCT GCGCCGGGTG CCGAAGGTCA ATGCATCCTA TACCGAGGAC
GCGACGATCC TGTACGACGA TGTCGATGTC TCGGTCGCCG TGTCGATCGC CGATGGGCTG
ATCACGCCGA TCGTGCGCCA GGCCGACCGC AAGTCGCTGC GCGAGATCAG CGAGGACGCG
AAGGATCTGA TCACCCGCGC CCGTGCCGGC AAGCTGAAGC CGCAGGAATT TCAGGGCGGA
TCGTTCTCGA TCTCGAACAT GGGCATGTAT GGGGTGAAGG AATTCTCGGC CATCATCAAT
CCGCCCCAGG CCGCCATCCT GGCCATCGCG GCGGCTGAGA AGCGCGCCGT GGTCAAGGAC
GACGCAATCC GGATCGCCAC CGTGATGACG GTGACGCTGT CGGTCGATCA TCGCGTCGTC
GACGGCGCCC TGGCCGCCGA ATGGGTTTCG ACCTTCCGCT CGGTGGTCGA ATCGCCGCTG
AGCCTGGTGG TCTGA
 
Protein sequence
MSVNILMPAL SPTMTEGKLS RWLKKEGDAI HSGDVIAEIE TDKATMEVEA VDDGLLGRIL 
VSEGTEGVKV NAPIAIVVAE GESVPDDAAP VAAAPAAAPV AAAPVSEAKA PAIAAAPAVP
QGAAPAPAQG TRVFASPLAR RIAAQKGIDL SGVKGSGPNG RIVRRDVESA TAAPVAAPVP
SPAPSAPAAA IEAPHTAVPN STIRKVIARR LTEAKSTIPH FYVAMDVELD ALLDLRAKLN
AASPAEGPGA FKLSVNDMLI KAVAVTLRRV PKVNASYTED ATILYDDVDV SVAVSIADGL
ITPIVRQADR KSLREISEDA KDLITRARAG KLKPQEFQGG SFSISNMGMY GVKEFSAIIN
PPQAAILAIA AAEKRAVVKD DAIRIATVMT VTLSVDHRVV DGALAAEWVS TFRSVVESPL
SLVV