Gene GM21_0477 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_0477 
Symbol 
ID8135786 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp585384 
End bp586601 
Gene Length1218 bp 
Protein Length405 aa 
Translation table11 
GC content67% 
IMG OID644868095 
Productbranched-chain alpha-keto acid dehydrogenase subunit E2 
Protein accessionYP_003020315 
Protein GI253699126 
COG category[C] Energy production and conversion 
COG ID[COG0508] Pyruvate/2-oxoglutarate dehydrogenase complex, dihydrolipoamide acyltransferase (E2) component, and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.00000000000000178114 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGTCTATCG ATTTCAAGCT CCCCGATCTG GGTGAAGGCA TCGCCGAGGT GGAACTGCGT 
CGCTGGCTGG TGGCGGAAGG TGACGCCGTC GCGGAACACC AGCCGCTGGT CGAGGTGGAG
ACGGACAAGG CGGTGGTCGA GGTCCCGTCC CCGCGCTCCG GTGTCGTCGC CCGCCTCCAC
CGCAAGGAGG GGGAGACGGT TCAGGTCGGC GCGACGCTGG TGACTTTCGC CGAGGCGAAG
GAGGCCGGCA GGAGGGAGGA GCCCGAAGGG GAGCGCAGGC CGGCGCAGCG CCCACCCTCG
GTCGGCATCG TCGGCTCGCT GCCGGAACCG GAGGCGGCGA CTCAGGCTCC GCCGGCGGGG
TTCGAGGGAC TGGCGACCCC GATGGTTAGG AAGATGGCGC GGGAGCGGGG TATAGACCTG
AAAAGCGTGC GGGGGACCGG GCCGCGCGGT TGCATAAAGC CCGAGGATCT GGACCAGATT
CCCCAGTCGG CGCAGAAGGC GAAGCCGGCG CCGCAAGACG GGGAACGGGT GCCGCTCAGA
GGCCTGCGGC GTACCATCGC CCGGAACGTG CTGGCCTCCC AAAAGACCAC CGCCTTCGTC
ACCAGTATGG AAGAGGTCGA CATCACGGAC ATATGGGAGA TGCGGGGGCG CGAGCAGGGG
GAGGTGGAGT CGCGCGGCGC CCACCTGACT TTCCTACCCT TCTTCATCAA GGCGGTCCAG
CACGCGCTGC GTGAACACCC GCTTTTGAAC GGCTCCATCG ACGACGAGGC GCAGGAACTG
GTGCTGAAAA AGCAGTACCA TTTCGGGATC GCGGTGGACA CCCCGGAGGG GCTCATGGTC
CCGGTGATCC GGGATGTGGA CAAGAAGAGC ATCATCGAGC TGGCGCAGGC GGTCCAGGAA
CTCGGCCGCA AGGCGCGCGA ACGGAGCATC TCGCTGGAGG AGCTGCGCGG CAGCAGTTTC
ACCATCACCA ACTACGGCCA CTTTGGCGGC ACCTTCGCCA CCCCCATCAT CAACTGGCCC
GACGTCGCCA TCATGGGGTT CGGACGCATC GTGGAGCGCC CCTGGGTGCA CCGGGGCCAG
ATCGCCATCA GGAAGATCCT GCCGCTGTCG CTCACCTTCG ATCACCGAGC CACCGACGGC
GCCGACGCCG CGCGGTTTCT GGGCAAGGTG CTCCGCTACC TCGAGGACCC CGCGCTCCTC
TTTCTGGACA GCGCCTAG
 
Protein sequence
MSIDFKLPDL GEGIAEVELR RWLVAEGDAV AEHQPLVEVE TDKAVVEVPS PRSGVVARLH 
RKEGETVQVG ATLVTFAEAK EAGRREEPEG ERRPAQRPPS VGIVGSLPEP EAATQAPPAG
FEGLATPMVR KMARERGIDL KSVRGTGPRG CIKPEDLDQI PQSAQKAKPA PQDGERVPLR
GLRRTIARNV LASQKTTAFV TSMEEVDITD IWEMRGREQG EVESRGAHLT FLPFFIKAVQ
HALREHPLLN GSIDDEAQEL VLKKQYHFGI AVDTPEGLMV PVIRDVDKKS IIELAQAVQE
LGRKARERSI SLEELRGSSF TITNYGHFGG TFATPIINWP DVAIMGFGRI VERPWVHRGQ
IAIRKILPLS LTFDHRATDG ADAARFLGKV LRYLEDPALL FLDSA