Gene GSU2654 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGSU2654 
SymbolpdhA 
ID2685638 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sulfurreducens PCA 
KingdomBacteria 
Replicon accessionNC_002939 
Strand
Start bp2924871 
End bp2925929 
Gene Length1059 bp 
Protein Length352 aa 
Translation table11 
GC content68% 
IMG OID637127344 
Productpyruvate dehydrogenase complex E1 component, alpha subunit 
Protein accessionNP_953699 
Protein GI39997748 
COG category[C] Energy production and conversion 
COG ID[COG1071] Pyruvate/2-oxoglutarate dehydrogenase complex, dehydrogenase (E1) component, eukaryotic type, alpha subunit 
TIGRFAM ID[TIGR03181] pyruvate dehydrogenase E1 component, alpha subunit 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCCGAAA CAACGATTCA GACATTCGGC GTCCGGCGTC TTGAGATCAT CGCTGCGGAC 
GGCACGGCGG ACGAGGCGCT TCTTCCCGAC CTTTCGGGCG ACCAGCTGCG GCGGCTCCAC
TACCTGATGC TCCTGACCCG CACCTTTGAC CGCCGCGCCC TGGCGCTCCA GCGGGAAGGC
AGAATCGGCA CCTATCCCTC GGTGCTTGGC CAGGAGGCGG CCCAGGTGGG GAGCGCCTTC
GCCCTTCAGC CAAGCGACTG GGTCTTTCCC TCCTTCCGGG AGATGGGCGC CCACCTCACC
CTCGGCTACC CGGTCCACCA GCTCTTCCAG TACTGGGGCG GCGACGAGCG GGGGCTGCGT
ACCCCGGACG GCATGAACCT CTTTCCCATC TGCGTTTCCG TGGGGACCCA CATCCCCCAT
GCCGCAGGGG CGGCGCTGGC GGCCAGGGCC AGGGGCGACC GGTCGGCCGT GGCGGCCTAC
TTCGGCGACG GCGCCACCTC CAAGGGGGAC TTTCACGAAG GGTTCAACCT GGCCGGCGCC
CTGAAGTTGC CGGTGGTCTT CATCTGCCAG AACAACCAGT GGGCCATTTC GGTGCCCCTG
GCGGCCCAGA CCGCTGCCCC GACCCTGGCC CAGAAGGCGC TGGCCTACGG TTTCGAGGGC
ATCCAGGTGG ACGGCAACGA CGTGCTGGCC GTATTCCGCG CCACGGGCGA GGCCCTGGTC
AGGGCCCGCG ACGGGGGAGG CCCCACCTTC ATCGAATGCC TCACCTACCG CATGGCCGAC
CACACCACCG CCGACGATGC GAGCCGCTAT CGTCCCCCGG CTGATGTGGA GGCGTGGCGC
GACCGGGACC CCCTGCTCCG CTTCGAGCGG TTCCTGGCAA AGCGCGGCCT CTGGAACGGG
GATTACGGAG CCGAGGTACA GGCAAAGGCC GAGGGAGAGA TCGACGAAGC GGTACGGCGC
TACGAATCGG TGCCGCCGCC GGAGCCGGGG GAGATGTTCG CCTTCACCTG TGCGGAGTTG
AGTCCGCGGC AGAGGCGGCA ACAGGAAAAC ATCCGCTAG
 
Protein sequence
MPETTIQTFG VRRLEIIAAD GTADEALLPD LSGDQLRRLH YLMLLTRTFD RRALALQREG 
RIGTYPSVLG QEAAQVGSAF ALQPSDWVFP SFREMGAHLT LGYPVHQLFQ YWGGDERGLR
TPDGMNLFPI CVSVGTHIPH AAGAALAARA RGDRSAVAAY FGDGATSKGD FHEGFNLAGA
LKLPVVFICQ NNQWAISVPL AAQTAAPTLA QKALAYGFEG IQVDGNDVLA VFRATGEALV
RARDGGGPTF IECLTYRMAD HTTADDASRY RPPADVEAWR DRDPLLRFER FLAKRGLWNG
DYGAEVQAKA EGEIDEAVRR YESVPPPEPG EMFAFTCAEL SPRQRRQQEN IR