Gene MCA3001 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMCA3001 
SymbolaceF 
ID3103554 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylococcus capsulatus str. Bath 
KingdomBacteria 
Replicon accessionNC_002977 
Strand
Start bp3176376 
End bp3177686 
Gene Length1311 bp 
Protein Length436 aa 
Translation table11 
GC content67% 
IMG OID637172127 
Productbranched-chain alpha-keto acid dehydrogenase subunit E2 
Protein accessionYP_115389 
Protein GI53802926 
COG category[C] Energy production and conversion 
COG ID[COG0508] Pyruvate/2-oxoglutarate dehydrogenase complex, dihydrolipoamide acyltransferase (E2) component, and related enzymes 
TIGRFAM ID[TIGR01348] pyruvate dehydrogenase complex dihydrolipoamide acetyltransferase, long form 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.188631 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCCCAAG AACAAACCGT CGTCGTCCCC GATATCGGCG ACTTCAAGGA CGTCGAGATC 
ATCGAGGTAC TGGTCAAGCC TGGCGACAAG GTCGCGGCGA ACGACTCGCT GATCACCCTG
GAGAGCGACA AGGCGGCGAT GGAGATTCCT TCGCCCTATT CCGGCACCGT CACCGAACTG
CATGTGCGCG TCGGCAGCAA AGTCAGCATG GGGACGCCCA TTCTGCAGCT GCGGGAGGAC
GAGGGCACGG ACGCCTCTGG GGCGGCTTCC GCCCCGGTAG AACCGGCGCC GGCCGAGCCT
GTTTCCCCGC CACCGGCCGC AGCCGCCCCG GCTGCGGGCG ACACCCAGGC AGTCCCTGCG
TCGGCGCCCA CACCTCCTGC GCCGATGCCG GTCGCCGAGG AAGGCTCGGG ACCGGCCCAC
GCCAGCCCCG CCGTACGTCG CTTCGCCCGC GAACTCGGCG TGGACGTGGC GAAAGTCCGC
GGCACCGGCC CGAAAGGCCG CATCCTGAAA ACGGACGTCC AGTCTTTCGT GAAACAAGCC
GTCGCGACAG CAGAACGGAC GGGCGGCGGC GGTTTCGCCG TGCCGGCGAT GCCAGAGATC
GATTTCGCCC AGTTCGGCCC GATCGAACGC CAGCCGCTGT CGCGAATCCA GAAGCTCTCT
TCCGCCAACC TCCACCGCAC CTGGCTCACC GTCCCCCACG TCACCCAGCA TGACGAGGCC
GACATCACCG AGCTGGAAGC CTTCCGAAAT GCGCTCAAGG CCGAATCCGC CAAGCGCGGC
GTCAAACTGA CCCTCCTGCC ATTCATCATC AAGGCTGCGG TCGCTGCGCT GAAGGACTTC
CCGCGCTTCA ACGCGTCAGT CGCTCCCAAT GGCGAGGAGC TGATCCTGAA GCGCTACTAC
CACGTGGGTT TTGCGGTGGA CACGCCGGAC GGCCTGGTGG TGCCGGTGAT CCGCGACGCC
GACACCAAAG GTATCTGGGA CATCGCCGCC GAACTCGCCG CCATCGGCGA CAAAGCCCGC
GGCAAGAAGC TGCGCACCGC CGATCTGCAG GGCGGCACCT TCACCGTCTC CAGCCTGGGA
GGCATCGGCG GTATCGCCTT CACCCCGATC ATCAACGCCC CCGAGGTCGC CATCCTGGGT
GTCTCCAAGG CGCAGTTGCG ACCGGTATTC CAAGACGGCC AGTTCGTCCC GCGGCTGATG
CTGCCCTTGT CGCTGTCCTA CGATCACCGG GTGATCGACG GCGCCGACGG CGTCCGCTTC
GTCACCCATG TCAGCAGCCT GCTCGCCGAC ATGCGGCGGG TCCTGCTCTA A
 
Protein sequence
MAQEQTVVVP DIGDFKDVEI IEVLVKPGDK VAANDSLITL ESDKAAMEIP SPYSGTVTEL 
HVRVGSKVSM GTPILQLRED EGTDASGAAS APVEPAPAEP VSPPPAAAAP AAGDTQAVPA
SAPTPPAPMP VAEEGSGPAH ASPAVRRFAR ELGVDVAKVR GTGPKGRILK TDVQSFVKQA
VATAERTGGG GFAVPAMPEI DFAQFGPIER QPLSRIQKLS SANLHRTWLT VPHVTQHDEA
DITELEAFRN ALKAESAKRG VKLTLLPFII KAAVAALKDF PRFNASVAPN GEELILKRYY
HVGFAVDTPD GLVVPVIRDA DTKGIWDIAA ELAAIGDKAR GKKLRTADLQ GGTFTVSSLG
GIGGIAFTPI INAPEVAILG VSKAQLRPVF QDGQFVPRLM LPLSLSYDHR VIDGADGVRF
VTHVSSLLAD MRRVLL