Gene Mpe_A1601 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpe_A1601 
Symbol 
ID4787007 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylibium petroleiphilum PM1 
KingdomBacteria 
Replicon accessionNC_008825 
Strand
Start bp1728512 
End bp1729525 
Gene Length1014 bp 
Protein Length337 aa 
Translation table11 
GC content67% 
IMG OID640090169 
Productacetoin dehydrogenase complex, E1 component, beta subunit 
Protein accessionYP_001020798 
Protein GI124266794 
COG category[C] Energy production and conversion 
COG ID[COG0022] Pyruvate/2-oxoglutarate dehydrogenase complex, dehydrogenase (E1) component, eukaryotic type, beta subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCAGAA AAATCAGCAT GAAGCAGGCG ATCAACGAGG CGCTTGATCA GGAAATGACG 
CGCGACCCGT CGGTGATCGT GCTCGGCGAG GACATCGTCG GCGGCGCCGG CGGCCAGGGC
GAGATGGACG CGTGGGGCGG CGTGCTCGGC GTGACCAAGG GCCTGTACGC CAAGCATGGC
GACCGCCTGA TGGACACACC GCTGTCCGAG AGCGCCTACG TCGGAGCGGC CGTCGGTGCG
GCCGCCTGCG GCATGCGTCC TGTCGCCGAA CTGATGTTCA TCGACTTCAT GGGCGTGTGC
TTCGACCAGA TCTTCAACCA GGCCGCCAAG TTCCGCTACA TGTTCGGCGG CAAGGCCGAG
ACGCCGGTCG TGATCCGCGC GATGGTCGGC GCCGGCTTCC GCGCCGCCGC CCAGCACAGC
CAGATGCTGA CGCCGCTGTT CACCCACATC CCCGGCCTGA AGGTGGTGTG CCCGAGCACG
CCCTACGACA CCAAGGGCAT GCTGATCCAG GCGATCCGCG ACAACGACCC GGTGATCTTC
TGCGAGCACA AGAACCTCTA CGGCTTCGAG GGCGAGGTGC CCGAGGCCTC GTACGCGATC
CCGTTCGGCG AGGCGAACGT GGTGCGTGAA GGCAAGCACG CCACCATCGT GACCTACGGT
CTGATGGTGC ACCGCTCGCT CGACGCCGCC GCCACACTGG CCAAGGAAGG TGTCGAGGTC
GAGATCGTCG ACCTGCGCTC GCTGTCGCCG ATCGACATGG ACACAGTGCT GGACAGCGTC
ACGAAGACCG GCCGCCTGAT CTGCGTCGAC GAGGCCAGCC CGCGCTGCAA CATCGGTACC
GACGTGTCGG CCCAGGTCGC GATGCAGGCC TTCGGCGCGC TCAAGGCCCA GATCGAACTG
GTGTCGCCGC CGCACGTGCC GGTGCCCTTC TCCCCGACCC TCGAGGATCT CTACATTCCG
TCGGCCGCGC AGGTCGCCGA CGCGGTGCGC CGCACCATGA AAGGAAAGCA CTGA
 
Protein sequence
MARKISMKQA INEALDQEMT RDPSVIVLGE DIVGGAGGQG EMDAWGGVLG VTKGLYAKHG 
DRLMDTPLSE SAYVGAAVGA AACGMRPVAE LMFIDFMGVC FDQIFNQAAK FRYMFGGKAE
TPVVIRAMVG AGFRAAAQHS QMLTPLFTHI PGLKVVCPST PYDTKGMLIQ AIRDNDPVIF
CEHKNLYGFE GEVPEASYAI PFGEANVVRE GKHATIVTYG LMVHRSLDAA ATLAKEGVEV
EIVDLRSLSP IDMDTVLDSV TKTGRLICVD EASPRCNIGT DVSAQVAMQA FGALKAQIEL
VSPPHVPVPF SPTLEDLYIP SAAQVADAVR RTMKGKH