Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mpe_A0970 |
Symbol | |
ID | 4787116 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylibium petroleiphilum PM1 |
Kingdom | Bacteria |
Replicon accession | NC_008825 |
Strand | + |
Start bp | 1033006 |
End bp | 1034358 |
Gene Length | 1353 bp |
Protein Length | 450 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 640089532 |
Product | D-glucarate dehydratase |
Protein accession | YP_001020167 |
Protein GI | 124266163 |
COG category | [M] Cell wall/membrane/envelope biogenesis [R] General function prediction only |
COG ID | [COG4948] L-alanine-DL-glutamate epimerase and related enzymes of enolase superfamily |
TIGRFAM ID | [TIGR03247] glucarate dehydratase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.886707 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 35 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTCAACA CCTCCTTGTC TTCCACGCCG CGTGTCACCG CGATGCGCGT CATCCCGGTC GCGGGGCGCG ACTCGATGCT GATGAACCTG AGCGGCGCGC ACGGGCCGTT CTTCACCCGC AACCTCGTGA TCCTGAGCGA CAACGCCGGG CGGAGCGGCG TCGGCGAGGT GCCGGGCGGC GAGAAGATCC GCCAGACGCT GGAGGACGCG CGCGAGTGGG TCGTCGGCCA ACCGGTCGGC GACGTGCAGC GCGTGCTGCG TACGGTGCGC GAGCGCTTCG CCGACCGCGA CGCCGGCGGC CGCGGGCTGC AGACCTTCGA CCTGCGCACC ACCATCCACG TCGTGACGGC CGTCGAGTCG GCGCTGCTCG ACCTGCTGGG CCAGCACCTC GAGCTGCCGG TCGCGGCGCT GCTCGGCGAG GGCCAGCAGC GCACGTCGGT CGAGATGCTC GGCTACCTGT TCTTCGTCGG TGACCGCACG AAGACCGACC TGGCCTACGC GGGCCCGGCC GACGAGGCGA CCGACGCCGA CGACTGGCAG CGCCTGCGCC ACGAGCCGGC GATGACGCCC GAGGAGGTGG TGCGGCTGGC GGAGGCGGCG CGCGCGCGCT ACGGCTTCAA CGACTTCAAG CTCAAGGGCG GCGTGCTGGC CGGCGACGCC GAGGTCGACG CGGTCACCGC GATCCACGAG CGCTTCCCCG ACGCCCGCGT GACGCTCGAT CCCAACGGCG GCTGGCTGCT CAAGGACGCG ATCCGCCTCG GCCAGCGCAT GCGCGGCGTG GTGGCCTACG CCGAGGACCC CTGCGGTGCG GAAGAGGGCT ACTCCGGCCG CGAGGTGATG GCCGAGTTCC GCCGCGCCAC CGGCCTGCCG ACCGCGACCA ACATGGTCGC CACCGACTGG CGCCAGCTCA CGCACGCGCT GTCGCTGCAG TCGGTCGACA TCCCGCTCGC CGATCCGCAT TTCTGGACGA TGGCCGGCTC GGTGCGCGTG GCGCAGACCT GCCGCGACTG GGGCCTGACC TGGGGCTCGC ACTCGAACAA CCACTTCGAC GTCTCGCTGG CGATGTTCAC CCACGTCGGT GCCGCGGCGC CGGGCCGCGT GACCGCGATC GACACGCACT GGATCTGGCA GGACGGACAG CGCCTGACGA AGGAGCCGCT GCAGATCGTC GGTGGCCACG TGCGGGTGCC GCAGCGGCCG GGCCTGGGCA TCGAGCTCGA CATGGCCGAG GTCGAGAAGG CCCACCGCCT CTACCTCGAG CACGGCCTGG GCGCGCGCGA CGACGCGGTG GCGATGCAGC ACCTGATCCC GAACTGGCGC TTCGATCCAA AACGGCCCTG CATGCTGCGC TGA
|
Protein sequence | MLNTSLSSTP RVTAMRVIPV AGRDSMLMNL SGAHGPFFTR NLVILSDNAG RSGVGEVPGG EKIRQTLEDA REWVVGQPVG DVQRVLRTVR ERFADRDAGG RGLQTFDLRT TIHVVTAVES ALLDLLGQHL ELPVAALLGE GQQRTSVEML GYLFFVGDRT KTDLAYAGPA DEATDADDWQ RLRHEPAMTP EEVVRLAEAA RARYGFNDFK LKGGVLAGDA EVDAVTAIHE RFPDARVTLD PNGGWLLKDA IRLGQRMRGV VAYAEDPCGA EEGYSGREVM AEFRRATGLP TATNMVATDW RQLTHALSLQ SVDIPLADPH FWTMAGSVRV AQTCRDWGLT WGSHSNNHFD VSLAMFTHVG AAAPGRVTAI DTHWIWQDGQ RLTKEPLQIV GGHVRVPQRP GLGIELDMAE VEKAHRLYLE HGLGARDDAV AMQHLIPNWR FDPKRPCMLR
|
| |