Gene Mpe_A0361 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpe_A0361 
Symbol 
ID4786852 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylibium petroleiphilum PM1 
KingdomBacteria 
Replicon accessionNC_008825 
Strand
Start bp396929 
End bp398368 
Gene Length1440 bp 
Protein Length479 aa 
Translation table11 
GC content69% 
IMG OID640088916 
Productlactaldehyde dehydrogenase 
Protein accessionYP_001019558 
Protein GI124265554 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTCACCG AATACAGAAA CTACATCGAC GGCGAGTTCC TGGCCAACCG CTCGGGCGCC 
CTGATCGACG TGCACAACCC GGCCACCCAC GAGCTGCTCG CCCGTGTGCC CGACGCCCCG
AACGACGTCG TCGACCTGGC CGTGCAGGCC GCACGCACCG CGCAGCCGGG GTGGGCGAAG
CTGCCCGCGA TCCAGCGCGC CCAGCACCTG CGTGCCATCG CCGCCCGGCT GCGCGAGAAC
GTGGAGGAAC TGGCCCACAC CATCACCGCC GAGCAGGGCA AGGTGCTGGG TCTGGCGCGC
GTGGAGGTGA ACTTCACCGC CGACTACATG GACTACATGG CCGAGTGGGC GCGCCGCCTC
GAGGGCGAGG TGCTCACCAG TGACCGCGTC GGCGAGAGCA TCTTCCTGAT GCGCAAGCCG
ATCGGCGTGG CCGCCGGCAT CCTGCCGTGG AACTTCCCGT TCTTCCTGAT CGCGCGCAAG
CTGGCGCCGG CGCTGATCAC CGGCAACACC ATCGTGATCA AGCCGAGCGA GATCACGCCG
ATCAACGCCT TCGAGTTCGC GCGCCTGGCC TCGCAGACCG ACCTGCCGCG CGGCGTGTTC
AACCTGGTGG GCGGCACCGG CGCCGGCGCC GGCGCGCAGC TCACCTCGCA CCGCGACGTG
GGCATCGTGT CGTTCACCGG CAGCGTGGAG ACCGGCACGC GCATCATGAC CGCGGCGTCG
AAGAACCTCA CGCGCGTGAA CCTCGAGCTC GGCGGCAAGG CACCGGCCAT CGTGCTGGCC
GACGCCGACC TCGACCTGGC GGTGAAGGCC ATCTACGACT CGCGCGTGAT CAACACCGGA
CAGGTGTGCA ACTGCGCCGA GCGCGTGTAC GTGCAGCGCA AGGTGGCCGA CGAGTTCACC
AGCAAGATCG CCGCGCGCAT GGCCGGCACG CTGTACGGCG ACCCGCTGGC CCAGCCCGAC
GTGGCGATGG GTCCGCTGGT CAGCCAGGCC GGCCTCGACA AGGTGGCGGG CATGGTGGAC
CGCGCCCGCG CGGCCGGCGC CAGCATCGTG CAAGGTGGCC GCAAGGCCAA CCGCGACAAG
GGCTACCACT ACGAGCCCAC CGTCATCGCG AACTGCAGCG CCGACATGGA GATCATGCGC
AAGGAGATCT TCGGGCCGGT GCTGCCGATC CAGGTGGTGG ACGAGCTCGA CGAGGCGATC
GCGCTGGCGA ACGACTCCGA CTACGGCCTG ACCTCGTCGA TCTTCACCAA GGACCTGAAC
TCGGCCATGC GCGCGGTGCG CGACCTGCAG TTCGGCGAGA CCTACGTGAA CCGCGAGCAC
TTCGAGGCGA TGCAGGGCTT CCACGCCGGC CGCAAGAAGT CGGGCATCGG CGGGGCCGAT
GGCAAGCACG GCCTGTACGA GTTCACCGAG ACGCACGTGG TCTACATCCA GCACGGCTGA
 
Protein sequence
MVTEYRNYID GEFLANRSGA LIDVHNPATH ELLARVPDAP NDVVDLAVQA ARTAQPGWAK 
LPAIQRAQHL RAIAARLREN VEELAHTITA EQGKVLGLAR VEVNFTADYM DYMAEWARRL
EGEVLTSDRV GESIFLMRKP IGVAAGILPW NFPFFLIARK LAPALITGNT IVIKPSEITP
INAFEFARLA SQTDLPRGVF NLVGGTGAGA GAQLTSHRDV GIVSFTGSVE TGTRIMTAAS
KNLTRVNLEL GGKAPAIVLA DADLDLAVKA IYDSRVINTG QVCNCAERVY VQRKVADEFT
SKIAARMAGT LYGDPLAQPD VAMGPLVSQA GLDKVAGMVD RARAAGASIV QGGRKANRDK
GYHYEPTVIA NCSADMEIMR KEIFGPVLPI QVVDELDEAI ALANDSDYGL TSSIFTKDLN
SAMRAVRDLQ FGETYVNREH FEAMQGFHAG RKKSGIGGAD GKHGLYEFTE THVVYIQHG