Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mpe_A1541 |
Symbol | |
ID | 4783559 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylibium petroleiphilum PM1 |
Kingdom | Bacteria |
Replicon accession | NC_008825 |
Strand | + |
Start bp | 1661655 |
End bp | 1662971 |
Gene Length | 1317 bp |
Protein Length | 438 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 640090108 |
Product | homoserine dehydrogenase |
Protein accession | YP_001020738 |
Protein GI | 124266734 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0460] Homoserine dehydrogenase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAACCGA TCCAAGTGGG CCTGCTCGGC GCGGGCGTGG TGGCTGGCGG CGTGGTGCAG GTGCTGCAGC GCAACCGGGC CGAGATCCAG CGCCGCGCCG GCCGCGGCAT CGAGGTCGCC GCGGTCGCCG CGCGCGACGT GGCCAAGGCG CGCGCCACGC TCGGCAGCGA GGTGGTGCTG ACCGACGACT TCGACGCCCT GGTGCGCCGG CCCGACATCG ACGTGGTCGT CGAGGTGATC GGCGGCACGA CACGGGCCCG CGAGCTGGTG CTGGCCGCCA TCGCCGCCGG CAAGCCGGTG GTCACCGCCA ACAAGGCGCT GCTGGCGGTA CACGGCACCG AGATCTTCGC GGCGGCGCGC GAGCGCGGTG TGGCGGTCGG CTTCGAGGCG GCAGTGGCCG GCGGCATCCC GATCATCAAG GCGCTGCGCG AGGGCCTGAC CGCGAACCGC ATCAACTGGA TCGCCGGCAT CATCAACGGC ACCACCAACT TCATCCTGTC GGAGATGCGC TCCAAGGGGC TGGACTTCGG CACCGTGCTG AAGGAAGCCC AGCGCCTGGG CTACGCCGAG ACCGACCCGA CCTTCGACAT CGAGGGCATC GACGCCGCGC ACAAGGCGAC GATCATGAGT GCGATCGCCT TCGGCACACC GGTGCAGTTC GAGCACGCCT ACATCGAAGG CATCACCAAG CTGCAGGCGG CCGACATCCG CTACGCCGAA CAGCTGGGCT ACCGCATCAA GCTGCTGGGC ATCACCAAGC GCCGCGAAGA TGCGCAGGGC ATCGAGCTGC GCGTGCACCC GACGCTGATC CCGGGCACGC GCCTGATCGC CAATGTGGAG GGTGCGATGA ATGCCGTGGT GGTGCAGGGC GACGCGGTCG GCTCCACGCT CTACTACGGC AAGGGCGCCG GCGCTGAGCC GACGGCCTCG GCCGTGGTCG CCGACCTGGT CGACGTGACC CGGCTCATCA CCGCCGACCC CGATCACCGC GTACCGCACC TGGCCTTCCA TCCCGATGCG GTGGCGGCCA CGCCCATCCT GCCGATCGAG CAGGTCGTCA CCGCCTTCTA CCTGCGCCTG CAGGTCGCCG ACAAGGCGGG CGTGCTGGCC AACATCACCC GCATCCTGGC CGACCACGCG ATCTCGATCG ATGCCGTGCT GCAGCGCGAG TCGGCCGAGG GCGAGAGCCA GACCGACCTG ATCATCCTGA CCCACGACAC GCTCGAAGGG AAGATGAACG CGGCGCTGGC GCAGATGCAG GCGCTGCCGA CCGTGCTGGC GCCCATCGTG CGCATCCGCA AGGAAGAACT GTCCTGA
|
Protein sequence | MKPIQVGLLG AGVVAGGVVQ VLQRNRAEIQ RRAGRGIEVA AVAARDVAKA RATLGSEVVL TDDFDALVRR PDIDVVVEVI GGTTRARELV LAAIAAGKPV VTANKALLAV HGTEIFAAAR ERGVAVGFEA AVAGGIPIIK ALREGLTANR INWIAGIING TTNFILSEMR SKGLDFGTVL KEAQRLGYAE TDPTFDIEGI DAAHKATIMS AIAFGTPVQF EHAYIEGITK LQAADIRYAE QLGYRIKLLG ITKRREDAQG IELRVHPTLI PGTRLIANVE GAMNAVVVQG DAVGSTLYYG KGAGAEPTAS AVVADLVDVT RLITADPDHR VPHLAFHPDA VAATPILPIE QVVTAFYLRL QVADKAGVLA NITRILADHA ISIDAVLQRE SAEGESQTDL IILTHDTLEG KMNAALAQMQ ALPTVLAPIV RIRKEELS
|
| |