Gene Mext_2789 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMext_2789 
Symbol 
ID5832131 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium extorquens PA1 
KingdomBacteria 
Replicon accessionNC_010172 
Strand
Start bp3127477 
End bp3128889 
Gene Length1413 bp 
Protein Length470 aa 
Translation table11 
GC content67% 
IMG OID641368591 
Productpyruvate dehydrogenase complex dihydrolipoamide acetyltransferase 
Protein accessionYP_001640251 
Protein GI163852208 
COG category[C] Energy production and conversion 
COG ID[COG0508] Pyruvate/2-oxoglutarate dehydrogenase complex, dihydrolipoamide acyltransferase (E2) component, and related enzymes 
TIGRFAM ID[TIGR01349] pyruvate dehydrogenase complex dihydrolipoamide acetyltransferase, long form 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones47 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGATCA ACGTCCTGAT GCCCGCGCTC TCCCCGACCA TGGAGAAGGG CAACCTCGCC 
AAGTGGCTCA AGAAGGAGGG CGACGCCATC AAGTCCGGCG ACGTCATCGC CGAGATCGAG
ACCGACAAGG CCACCATGGA GGTCGAGGCG GTCGATGAGG GCGTGCTCGC CAAGATTCTC
GTGGCCGAAG GCACCGCCGA CGTTCCGGTC AACGAGCTGA TCGCGCTGAT CGCTGAAGAG
GGTGAGGATC CGGGCAGCGT CCAGGCGCCT AAGGGTGGTG CCGAGGCGAA GACCGCCCCC
GTCGAGCCGA AGGGCACGCC CGACCAGAAC GCCGCGCCCG ATGGCTCCCA CGCCTCCTAC
GCGCGCGTCG ATCAGGTGCC CGAAGGTGCC AAGCCGAACG GCGCTGCGCA GCCGGCTGGC
TCCGGCGATC GCGTCTTCGC CTCGCCGCTC GCGCGCCGCA TCGCGAAGCA GGAAGGCGTC
GATCTCTCGG CAGTGAAGGG CTCGGGTCCG CATGGCCGCG TGATCCAGCG CGACGTGCAG
GCGGCGATCG AGAACGGCAC GGCGAAGGCC GATGCGGCGG CCAAGCCCGA GGCCAAATCG
GAGGCCAAGA GTGCTCCTGC TCCCGAGAAA ACCGCGCCGA AGGCGGCTTC CGGCGGCGGC
GCCCCGGCCG GGCTCAGCCT CGATCAGGTC AAGGGCTTCT ACGAGAAGGG CAGCTTCGAG
GAAGTGCCGC TCGACGGCAT GCGCAAGACC ATCGCCAAGC GCCTCACCGA GGCCATGCAG
GTCGCGCCGC ACTTCTACCT CACCGTCGAT TGCGAACTCG ATGCGCTGAT GAAGCTGCGC
GAGACGCTCA ACAACTCGGC CGGCAAGGAC AAGGACGGCA AGCCGCTGTT CAAGCTCTCG
GTGAACGACT TCGTCATCAA GGCGATGGGC CTCGCGCTCA CCCGCGTCCC CGCCGCCAAC
GCCGTCTGGG CGGAGGACCG CATCCTGCGC TTCACGCACG CCGAGGTCGG CGTCGCGGTG
GCGATCGATG GCGGCCTATT CACCCCGGTG ATCCGCAAGG CCGACCAGAA GACGCTCTCC
ACCATCTCCA ACGAGATGAA GGATTTCGCC GGCCGGGCGC GTGCCAAGAA GCTGAAGCCC
GAGGAGTACC AGGGCGGCGT CACCTCAGTG TCGAACCTCG GCATGTTCGG CATCAAGCAC
TTCACGGCGG TGATCAACCC GCCGCAATCG AGCATCCTCG CGGTCGGCGC GGGCGAGAAG
CGCGTGGTGG TGAAGGACGG GCAGCCGACC GTTGCCCAGG TGATGACGGC GACCCTCTCC
TGCGATCACC GCGTCCTCGA CGGCGCGCTC GGCGCCGAGT TGATCGCGGC CTTCAAGGGA
CTGATCGAGA ACCCGATGGG GATGCTCGTC TAA
 
Protein sequence
MPINVLMPAL SPTMEKGNLA KWLKKEGDAI KSGDVIAEIE TDKATMEVEA VDEGVLAKIL 
VAEGTADVPV NELIALIAEE GEDPGSVQAP KGGAEAKTAP VEPKGTPDQN AAPDGSHASY
ARVDQVPEGA KPNGAAQPAG SGDRVFASPL ARRIAKQEGV DLSAVKGSGP HGRVIQRDVQ
AAIENGTAKA DAAAKPEAKS EAKSAPAPEK TAPKAASGGG APAGLSLDQV KGFYEKGSFE
EVPLDGMRKT IAKRLTEAMQ VAPHFYLTVD CELDALMKLR ETLNNSAGKD KDGKPLFKLS
VNDFVIKAMG LALTRVPAAN AVWAEDRILR FTHAEVGVAV AIDGGLFTPV IRKADQKTLS
TISNEMKDFA GRARAKKLKP EEYQGGVTSV SNLGMFGIKH FTAVINPPQS SILAVGAGEK
RVVVKDGQPT VAQVMTATLS CDHRVLDGAL GAELIAAFKG LIENPMGMLV