Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mext_4008 |
Symbol | |
ID | 5831489 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium extorquens PA1 |
Kingdom | Bacteria |
Replicon accession | NC_010172 |
Strand | - |
Start bp | 4448203 |
End bp | 4449087 |
Gene Length | 885 bp |
Protein Length | 294 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 641369800 |
Product | 5-oxopent-3-ene-1,2,5-tricarboxylate decarboxylase |
Protein accession | YP_001641450 |
Protein GI | 163853407 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG0179] 2-keto-4-pentenoate hydratase/2-oxohepta-3-ene-1,7-dioic acid hydratase (catechol pathway) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 38 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGTTGA TCCGGCACGG AGCGAGCGGT GACGAGAAGC CGGGCCTCGT GGATGCGAAG GGCGGCTTGC GCGACCTGTC GGGAACCCTG CGTGATCTTG CCGGACCGGG CCTCTCACGG GAATCGCTCG ACCGACTCGC GCGGATCGAC CCCGAAAGCC TGCCGCTGCT GCCGCCCGGC ACCCGCCTCG GCCCCTGCGT CGGCGGCACC CGCAACTTCG TGGCGATCGG CCTGAACTAC GCCGACCACG CCGCCGAGAC CGGGGCCGCG ATCCCGGCCG AGCCGATCAT CTTCAACAAG GCGCCCTCCT GCATCGTCGG CCCCAACGAC ACGGTGATCC TGCCCAAAGC CTCGGCCAAG ACCGACTGGG AGGTGGAACT CGCCGTGGTG ATCGGCGCCC GCGCCTCCTA CGTCCATGCC AACGAGGCGC TGCGCTACGT GGCGGGCGTG TGCATCTGCA ACGACCTGTC CGAGCGCGAA TTCCAGATGG AGCGCGGCGG CACCTGGACC AAGGGCAAGG GCTGCCCGAC CTTCGGCCCG CTCGGCCCCT GGCTCGTCAC CCTCGACGAG ATCCCGGACC TGAAGAACCT CTCGATGAGC CTCGATCTCA ACGGCCGGCG GATGCAGACG GGCTCGACCG CGACGATGAT CTTCGACGTG GCGCAGATCG TCGCCTACGT CTCGCATTTC ATGATGCTGG AGCCTGGCGA CGTCATCACC ACCGGCACCC CGCCGGGCGT CGGCCTCGGC ATGAAGCCGC CGCGCTACCT CAGGAGCGAC GACGAGATGG TGCTGCGCAT CGACGGCCTC GGCGAGCAGC GCCAGCGGGT CGTGGCCTTC GATGACTGGA CCGCCAAGGT CGCCGCCGGC GAACCGACGA ACTGA
|
Protein sequence | MKLIRHGASG DEKPGLVDAK GGLRDLSGTL RDLAGPGLSR ESLDRLARID PESLPLLPPG TRLGPCVGGT RNFVAIGLNY ADHAAETGAA IPAEPIIFNK APSCIVGPND TVILPKASAK TDWEVELAVV IGARASYVHA NEALRYVAGV CICNDLSERE FQMERGGTWT KGKGCPTFGP LGPWLVTLDE IPDLKNLSMS LDLNGRRMQT GSTATMIFDV AQIVAYVSHF MMLEPGDVIT TGTPPGVGLG MKPPRYLRSD DEMVLRIDGL GEQRQRVVAF DDWTAKVAAG EPTN
|
| |