Gene Mpe_A1541 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpe_A1541 
Symbol 
ID4783559 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylibium petroleiphilum PM1 
KingdomBacteria 
Replicon accessionNC_008825 
Strand
Start bp1661655 
End bp1662971 
Gene Length1317 bp 
Protein Length438 aa 
Translation table11 
GC content70% 
IMG OID640090108 
Producthomoserine dehydrogenase 
Protein accessionYP_001020738 
Protein GI124266734 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0460] Homoserine dehydrogenase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACCGA TCCAAGTGGG CCTGCTCGGC GCGGGCGTGG TGGCTGGCGG CGTGGTGCAG 
GTGCTGCAGC GCAACCGGGC CGAGATCCAG CGCCGCGCCG GCCGCGGCAT CGAGGTCGCC
GCGGTCGCCG CGCGCGACGT GGCCAAGGCG CGCGCCACGC TCGGCAGCGA GGTGGTGCTG
ACCGACGACT TCGACGCCCT GGTGCGCCGG CCCGACATCG ACGTGGTCGT CGAGGTGATC
GGCGGCACGA CACGGGCCCG CGAGCTGGTG CTGGCCGCCA TCGCCGCCGG CAAGCCGGTG
GTCACCGCCA ACAAGGCGCT GCTGGCGGTA CACGGCACCG AGATCTTCGC GGCGGCGCGC
GAGCGCGGTG TGGCGGTCGG CTTCGAGGCG GCAGTGGCCG GCGGCATCCC GATCATCAAG
GCGCTGCGCG AGGGCCTGAC CGCGAACCGC ATCAACTGGA TCGCCGGCAT CATCAACGGC
ACCACCAACT TCATCCTGTC GGAGATGCGC TCCAAGGGGC TGGACTTCGG CACCGTGCTG
AAGGAAGCCC AGCGCCTGGG CTACGCCGAG ACCGACCCGA CCTTCGACAT CGAGGGCATC
GACGCCGCGC ACAAGGCGAC GATCATGAGT GCGATCGCCT TCGGCACACC GGTGCAGTTC
GAGCACGCCT ACATCGAAGG CATCACCAAG CTGCAGGCGG CCGACATCCG CTACGCCGAA
CAGCTGGGCT ACCGCATCAA GCTGCTGGGC ATCACCAAGC GCCGCGAAGA TGCGCAGGGC
ATCGAGCTGC GCGTGCACCC GACGCTGATC CCGGGCACGC GCCTGATCGC CAATGTGGAG
GGTGCGATGA ATGCCGTGGT GGTGCAGGGC GACGCGGTCG GCTCCACGCT CTACTACGGC
AAGGGCGCCG GCGCTGAGCC GACGGCCTCG GCCGTGGTCG CCGACCTGGT CGACGTGACC
CGGCTCATCA CCGCCGACCC CGATCACCGC GTACCGCACC TGGCCTTCCA TCCCGATGCG
GTGGCGGCCA CGCCCATCCT GCCGATCGAG CAGGTCGTCA CCGCCTTCTA CCTGCGCCTG
CAGGTCGCCG ACAAGGCGGG CGTGCTGGCC AACATCACCC GCATCCTGGC CGACCACGCG
ATCTCGATCG ATGCCGTGCT GCAGCGCGAG TCGGCCGAGG GCGAGAGCCA GACCGACCTG
ATCATCCTGA CCCACGACAC GCTCGAAGGG AAGATGAACG CGGCGCTGGC GCAGATGCAG
GCGCTGCCGA CCGTGCTGGC GCCCATCGTG CGCATCCGCA AGGAAGAACT GTCCTGA
 
Protein sequence
MKPIQVGLLG AGVVAGGVVQ VLQRNRAEIQ RRAGRGIEVA AVAARDVAKA RATLGSEVVL 
TDDFDALVRR PDIDVVVEVI GGTTRARELV LAAIAAGKPV VTANKALLAV HGTEIFAAAR
ERGVAVGFEA AVAGGIPIIK ALREGLTANR INWIAGIING TTNFILSEMR SKGLDFGTVL
KEAQRLGYAE TDPTFDIEGI DAAHKATIMS AIAFGTPVQF EHAYIEGITK LQAADIRYAE
QLGYRIKLLG ITKRREDAQG IELRVHPTLI PGTRLIANVE GAMNAVVVQG DAVGSTLYYG
KGAGAEPTAS AVVADLVDVT RLITADPDHR VPHLAFHPDA VAATPILPIE QVVTAFYLRL
QVADKAGVLA NITRILADHA ISIDAVLQRE SAEGESQTDL IILTHDTLEG KMNAALAQMQ
ALPTVLAPIV RIRKEELS