Gene Mpe_A2010 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpe_A2010 
Symbol 
ID4784230 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylibium petroleiphilum PM1 
KingdomBacteria 
Replicon accessionNC_008825 
Strand
Start bp2152917 
End bp2154344 
Gene Length1428 bp 
Protein Length475 aa 
Translation table11 
GC content66% 
IMG OID640090580 
Productdihydrolipoamide dehydrogenase 
Protein accessionYP_001021203 
Protein GI124267199 
COG category[C] Energy production and conversion 
COG ID[COG1249] Pyruvate/2-oxoglutarate dehydrogenase complex, dihydrolipoamide dehydrogenase (E3) component, and related enzymes 
TIGRFAM ID[TIGR01350] dihydrolipoamide dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.316164 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCAAGA CTTTCGACGT CGTCGTCATC GGCGGCGGCC CGGGCGGCTA CATCGCTGCC 
ATCCGTGCGG CGCAGCTGGG TTTCAACACC GCCTGCATCG ACGAGTGGAA GAACGACAAG
GGCGGTCCAG CGCCGGGCGG CACCTGCACG AACGTGGGCT GCATCCCGTC GAAGGCGCTG
CTGCAGTCGA GCGAGCACTT CGAACACGCC GGCCATGCGT TCGCCGACCA CGGCATCGGA
CTGAAGGATC TGAGCATCGA CGTCGCGAAG ATGCTGGGGC GCAAGGACAC CGTCGTGAAG
CAGAACAACG ACGGCATCCT CTATCTGTTC AAGAAGAACA AGGTCAGCTT CTTCCACGGC
CGCGGCTCGT TCGTGAAGGC CGGTGACGCC GGCTACGAGA TCAAGGTCAG CGGCGCCACC
GAGGACACGC TGATCGGCAA GGACATCATC ATCGCGACCG GCTCGAGTGC ACGTGCGCTG
CCGGGTGCAC CGTTCGACGA GGAGAGCATC CTCAGCAACG ACGGGGCGCT GCGCATCCCG
TCGGTGCCGG CGAAGCTGGG CGTCATCGGC TCGGGCGTGA TCGGCCTCGA GATGGGCTCG
GTGTGGCGCC GCCTGGGCGC CGAGGTGACG GTGCTGGAAG CGCTGCCGAC CTTCCTGGGC
GCGGTCGACG AACAGATCGC CAAGGAAGCC CAGAAGGCCT TCATGAGGCA GCGCCTGAAG
ATCGAGCTGG GCGTGAAGAT CAGCGAAGTC AAGAAGGACA AGAAGGGCGT CAGCGTCAGT
TACACCAGCG CCAAGGGCGA TGCCAAGACG CTGGAAGTCG ACAAGCTGAT CGTGTCGATC
GGCCGCGTGC CCAACACCAC CGGCCTGAAC GCCGAGGCGG TGGGACTGAA GCTCGACGAG
CGCGGCGCGA TCGTGGTCGA CGACGACTGC CGCACCAACC TGCCGAAGGT GTGGGCCATC
GGCGACGTGG TGCGCGGCCC GATGCTCGCC CACAAGGCGG AAGAAGAGGG CGTGGCGGTC
GCGGAGCGCA TTGCCGGCCA GCATGGACAC GTCAACTTCA ACACCATCCC CTGGGTCATC
TATACCAGTC CGGAGATCGC CTGGGTCGGC CAGACCGAGC AGCAGCTCAA GGCGGCGGGC
CGCGCCTACA AGGCCGGAAC CTTCCCGTTC CTGGCCAACG GTCGTGCGCG TGCGCTCGGC
GACACGACCG GCATGGTGAA GTTCCTGGCG GACGCTGCGA CCGACGAGAT CCTCGGCGTG
CACATCGTCG GACCGATGGC CAGCGAACTG ATCGCTGAGG CGGTGGTGGC GATGGAGTTC
AAGGCCAGCG CCGAGGACAT TGCCCGCATC TGCCACGCGC ACCCGTCGCT GTCGGAAGCG
ACCAAGGAGG CCGCCCTGGC CGTGGACAAG CGCACACTGA ATTTCTGA
 
Protein sequence
MSKTFDVVVI GGGPGGYIAA IRAAQLGFNT ACIDEWKNDK GGPAPGGTCT NVGCIPSKAL 
LQSSEHFEHA GHAFADHGIG LKDLSIDVAK MLGRKDTVVK QNNDGILYLF KKNKVSFFHG
RGSFVKAGDA GYEIKVSGAT EDTLIGKDII IATGSSARAL PGAPFDEESI LSNDGALRIP
SVPAKLGVIG SGVIGLEMGS VWRRLGAEVT VLEALPTFLG AVDEQIAKEA QKAFMRQRLK
IELGVKISEV KKDKKGVSVS YTSAKGDAKT LEVDKLIVSI GRVPNTTGLN AEAVGLKLDE
RGAIVVDDDC RTNLPKVWAI GDVVRGPMLA HKAEEEGVAV AERIAGQHGH VNFNTIPWVI
YTSPEIAWVG QTEQQLKAAG RAYKAGTFPF LANGRARALG DTTGMVKFLA DAATDEILGV
HIVGPMASEL IAEAVVAMEF KASAEDIARI CHAHPSLSEA TKEAALAVDK RTLNF