Gene GM21_1967 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_1967 
Symbol 
ID8137301 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp2282556 
End bp2283533 
Gene Length978 bp 
Protein Length325 aa 
Translation table11 
GC content64% 
IMG OID644869581 
Productpyruvate dehydrogenase (acetyl-transferring) E1 component, alpha subunit 
Protein accessionYP_003021778 
Protein GI253700589 
COG category[C] Energy production and conversion 
COG ID[COG1071] Pyruvate/2-oxoglutarate dehydrogenase complex, dehydrogenase (E1) component, eukaryotic type, alpha subunit 
TIGRFAM ID[TIGR03182] pyruvate dehydrogenase E1 component, alpha subunit 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones144 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGGACA ATCTTAAGGA TCTGCTTCCT GAAGAGGAGT TGCTGAAATT CTACGAGCAG 
ATGGTGCTTT GCCGCGAGTT CGAGGAGAGC TGCGCCGAGC AGTACTCCAA GGGGCATATC
ACCGGCTTCC TCCATCTCTA TAGCGGCCAG GAGGCGGTCG CCGTCGGCTG CTCCGCCGGG
CTGCAGCCTG CCGACTACAT TCTGTCGGCC TACCGCGACC ACGCCCAGGC CATCGTGAGG
GGGGCCGATC CCAAAAGGGT CATGGCCGAA CTCTTCGGCA AGGCGACCGG CCTTTGCAAG
GGAAAAGGGG GGTCGATGCA CCTTTTCGCG CCCGAGCTTA ACTTTATGGG GGGGTACGCC
ATCGTGGGGG GGCAGTTCCC CATCGCCACC GGGCTTGCCT GGGGAAGCCA GCTCCAGGAA
CAAGACCGCA TCACCGCCTG CTTCTTTGGC GACGGCTCGA TGAACCAGGG AACCTTCCAC
GAGTCGCTGA ACTGGGCCAG GCTTTGGGAC CTCCCCGTGC TCTTCATCTG CGAGAACAAC
TTCTACGGCA TCGGCACCGA GGTACATCGC GCCTCGGCCC AGGCGGCACT GCACCGGCGC
ACCTGCGGCT ATGACATACC CAGCGAGAAA GTGGACGGCA TGGACGTGGT GGCCATGTAC
CAGGCGACCA AGAGGGCGGC GGAGTGGGTG AGGGAGCGCC AGCGCCCCTA TTTCATAGAG
GCGGTCACCT ACAGGTTCCG CGGCCACTCC ATGTCCGACC CAGCCAAGTA CCGCAGCTCC
TCGGAGGCGG AGGTCTGGAA AAGCCGCGAC CCGATACCGA ACCTCTCGCG CCGACTGCTG
GAGGAGGGGA TAGCGGACCA GGCGCGCCTG GACGAGATAG ACCGGCGCGC CCTGGCCCAG
GTCCAGGAGG CGGTCCGCTT CGCAGAGGAT TCCCCCTGGC CTGAGGACTC TGAAATCTGG
AACGACATCT ACGTGTGA
 
Protein sequence
MADNLKDLLP EEELLKFYEQ MVLCREFEES CAEQYSKGHI TGFLHLYSGQ EAVAVGCSAG 
LQPADYILSA YRDHAQAIVR GADPKRVMAE LFGKATGLCK GKGGSMHLFA PELNFMGGYA
IVGGQFPIAT GLAWGSQLQE QDRITACFFG DGSMNQGTFH ESLNWARLWD LPVLFICENN
FYGIGTEVHR ASAQAALHRR TCGYDIPSEK VDGMDVVAMY QATKRAAEWV RERQRPYFIE
AVTYRFRGHS MSDPAKYRSS SEAEVWKSRD PIPNLSRRLL EEGIADQARL DEIDRRALAQ
VQEAVRFAED SPWPEDSEIW NDIYV