Gene GM21_0475 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_0475 
Symbol 
ID8135784 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp583248 
End bp584303 
Gene Length1056 bp 
Protein Length351 aa 
Translation table11 
GC content68% 
IMG OID644868093 
Productpyruvate dehydrogenase (acetyl-transferring) E1 component, alpha subunit 
Protein accessionYP_003020313 
Protein GI253699124 
COG category[C] Energy production and conversion 
COG ID[COG1071] Pyruvate/2-oxoglutarate dehydrogenase complex, dehydrogenase (E1) component, eukaryotic type, alpha subunit 
TIGRFAM ID[TIGR03181] pyruvate dehydrogenase E1 component, alpha subunit 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value0.000000000000527736 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCCCGAAG AAATCCTTGC CACTTTCCAA GTGAAACGCC TGAGCGTTCT CAATGAGAAT 
GGCAGCGCCG ATCTTGCCCT CATGCCGGAG TTGTCGGCCG ACCAGATCCG GCGCATGTAC
CAGCTCATGG TGCTCTCGCG CTGCTTCGAC GAGCGTGCCG TTTCCTTGCA GCGGGAGGGG
CGCCTGGGAA CCTACCCCCC CATACGGGGG CAGGAGGCGG CCCAGGTGGG GAGCGCCTTC
GCGCTCCAGC CCAACGACTG GGTGTTCCCC TCTTTCCGCG AGATGGGGGC GCACCTGACG
CTGGGGTATC CCATCCCGCA GCTCCTCCAG TACTGGACCG GGGACGAGCG GGCCCAGAAG
GCGCCGCCGC AGCTCAACAT CTTTCCCTTC TGCGTGGCCG TGGGTAGCCA GATTCCCCAT
GCGGTAGGGG CCGCGCTCGC CGCCCGCTAC CGGCGGGATT CGGCCGCCGT GGCGGTGTAT
TTCGGCGACG GGGCGACCTC CAAGGGGGAC TTCCACGAGG CGATGAACAT GGCGGGGGTC
TACCAGCTCC CCATAGTCTT CATCTGCCAG AACAACCAGT GGGCCATCTC GGTCCCGCTC
AAGGGGCAGA CGGCCTCGGC GTCGCTGGCC CAGAAGGCGC TCGCCTACGG GTTCGAAGGG
GTGCAGGTGG ACGGCAACGA CGTCCTCGCG GTCTATCGCG CCACGAAGCA GGCGCTGGAA
AAGGCGAGAA GCGGCGGCGG CCCAACCTTC CTGGAATGCC TCACCTACCG CATGGCCGAC
CACACCACGG CCGACGACGC CGGGCGCTAC CGCTCGGACG AGGAGGTGGC GCTTTGGAAC
GGGCGGGATC CCATCCTCAG GCTGGAGCGC TTCTTAGCCG CAAGCGGCGC CTGGACCCCG
GAGCAGGGGA GGTGGGCCAA GGAGGAGGCG ACCGCGCTGA TCGACCGGGG GGTAAGGGAG
ATGGAGGCGG TACCCCCCCC CGCCGCGTCG GAGCTTTTCG ACGGCACCCT GGCGGCACTC
ACCCCGCGGC AGGCCGGGCA AAGAGAGGGA CGCTGA
 
Protein sequence
MPEEILATFQ VKRLSVLNEN GSADLALMPE LSADQIRRMY QLMVLSRCFD ERAVSLQREG 
RLGTYPPIRG QEAAQVGSAF ALQPNDWVFP SFREMGAHLT LGYPIPQLLQ YWTGDERAQK
APPQLNIFPF CVAVGSQIPH AVGAALAARY RRDSAAVAVY FGDGATSKGD FHEAMNMAGV
YQLPIVFICQ NNQWAISVPL KGQTASASLA QKALAYGFEG VQVDGNDVLA VYRATKQALE
KARSGGGPTF LECLTYRMAD HTTADDAGRY RSDEEVALWN GRDPILRLER FLAASGAWTP
EQGRWAKEEA TALIDRGVRE MEAVPPPAAS ELFDGTLAAL TPRQAGQREG R