Gene GM21_1974 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_1974 
Symbol 
ID8137308 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp2289545 
End bp2291005 
Gene Length1461 bp 
Protein Length486 aa 
Translation table11 
GC content64% 
IMG OID644869588 
Productbranched-chain alpha-keto acid dehydrogenase subunit E2 
Protein accessionYP_003021785 
Protein GI253700596 
COG category[C] Energy production and conversion 
COG ID[COG0508] Pyruvate/2-oxoglutarate dehydrogenase complex, dihydrolipoamide acyltransferase (E2) component, and related enzymes 
TIGRFAM ID[TIGR01349] pyruvate dehydrogenase complex dihydrolipoamide acetyltransferase, long form 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones116 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACGAGA TCGTCATGCC GAAACTGTCC GACACCATGA CCGAGGGGAG GCTTGTCTCC 
TGGAAGAAAA AGGTAGGGGA GAGCGTGGCC CGCGGGGAGG TGATTGCCGA GGTCGAGACC
GACAAGGCGA ACATGGAACT GGAAGCCTAC GTCTCGGGCG AATTGCTTGA GATAAGGGTG
CAGACGGGGG ACCTGGTGCC GGTGGGAACG GTGATCGCCA TCATCGGCAA GGCCGATGAA
AAGGGAGCCG GCGCTACGCA GCAGTCGGCA CCGGTGCCGC ATGTGGAGCC TGAACCGCAA
AGGCCGCAAG GGGAGGCTCC TGCCGGTCCG CCGGCCGCGC CGATGGTAGA GCCAAGGGTG
GAAGAGCCTG AGTCGGCAGC CGCAGAACCG CCCGCCTCGA CAGGGGTGAA GTCTGCCGAG
GGTGTCGATC TCGTCCCTGC TGAAGGGGAG AAACAGGCAC CTGCCGCGCC TCCCAGGCCC
GGCGCCGAAG AGTCGCAGCC TGCCGGCGAG CCGAAAAAGC CGTACCGCCC GGAGAGCCCG
GAGAGCCCGG AGAGCCCGCC GGAGGTGGCC GCCCCGAACG TCGAACTTGC AACCGGAGCC
GGGCGGGAAA AGGCTGCACC CGTGGTCCGG CGCCGTGCCC GCGAACTGGG GATTGATCTT
GCCCAGGTGC AGGGGAGCGG TCCCGAAGGG CGTATTCTGC TCGCCGACCT GGATTTGCAG
GGGACGGAAC CTGCGCCGGC TGGCCAAGCA CCCCAGGCTG CGGCTGAAGC GGCGCCGGCG
CCTCAGGGCG AGGGGCCAAG GCCGATGTCG CGGCTGCGGA GCGCTGTCGC CAAGACGGTC
ACGGAATCCT GGCACAATAT CCCGCACTTC ACAGTCACCG TGGATGTCGA GATGGACGAG
GCGGAAGCGG TCCGGCGCCA GTTGAAGCAA ACCGGCATGC CGGTCAGCGT CAACGATCTC
ATCGTGAAAG CGGTGGCTAT GGCACTGCGA CAGTTCCCGC AGATGAACGC TAGCTTTACG
CCGGAAGGGC TGCAGTTCCA TGGCGACATC AACATAGCGA TAGCTGTAGG TATGTCTGAC
GGCGTTCTCA TGCCGGTGCT CTCCGGTTGT CAGCAGCGCT CGCTGCTGGA GATAGCGCAG
GAAGCAAAGA AACTCGTGGA ACGTGCCCGC TCCGGCAGTC TCAGCGAGCA AGAGATGCAA
GGCGGGACCT TCTCCGTTTC CAACCTCGGC ATGTTCGGTG TCGGCAGCTT CAGCGCCATC
ATCTATCCCT CGCAGTCAGG AGTACTTGCC GTCGGGACCG TCTCCGAAGT CGCCAGGATG
AATTCGGGCG TTTTGAGCAG CACCAAGGTG ATGAAGGTCA CCCTTTCCGC CGATCACCGT
GTGATTGACG GCGCCTATGC CGCACAATTT CTGGCCGGGT TGAAGGAGAT CCTGGAGAAC
CCAGTCCGGC TTCTCATCTG A
 
Protein sequence
MNEIVMPKLS DTMTEGRLVS WKKKVGESVA RGEVIAEVET DKANMELEAY VSGELLEIRV 
QTGDLVPVGT VIAIIGKADE KGAGATQQSA PVPHVEPEPQ RPQGEAPAGP PAAPMVEPRV
EEPESAAAEP PASTGVKSAE GVDLVPAEGE KQAPAAPPRP GAEESQPAGE PKKPYRPESP
ESPESPPEVA APNVELATGA GREKAAPVVR RRARELGIDL AQVQGSGPEG RILLADLDLQ
GTEPAPAGQA PQAAAEAAPA PQGEGPRPMS RLRSAVAKTV TESWHNIPHF TVTVDVEMDE
AEAVRRQLKQ TGMPVSVNDL IVKAVAMALR QFPQMNASFT PEGLQFHGDI NIAIAVGMSD
GVLMPVLSGC QQRSLLEIAQ EAKKLVERAR SGSLSEQEMQ GGTFSVSNLG MFGVGSFSAI
IYPSQSGVLA VGTVSEVARM NSGVLSSTKV MKVTLSADHR VIDGAYAAQF LAGLKEILEN
PVRLLI