Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Pmen_1697 |
Symbol | |
ID | 5109671 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pseudomonas mendocina ymp |
Kingdom | Bacteria |
Replicon accession | NC_009439 |
Strand | + |
Start bp | 1866505 |
End bp | 1867644 |
Gene Length | 1140 bp |
Protein Length | 379 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 640502926 |
Product | homogentisate 1,2-dioxygenase |
Protein accession | YP_001187193 |
Protein GI | 146306728 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG3508] Homogentisate 1,2-dioxygenase |
TIGRFAM ID | [TIGR01015] homogentisate 1,2-dioxygenase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 33 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 34 |
Fosmid unclonability p-value | 0.196029 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCCGCC AATGGATCCG CTTTCCCCTG CGTGAAGGCG AGTGCTCGCG CCAGGCGCAT TGCGACCTGC CGCAGGGCAC CTACGAGCGC GAGATGGGCC GTGAGGGCTT CTTCGGCCCC ACCGCGCACC TGCACCACAA GCATCCGCCC ACCGGCTGGA TCGACTGGGA AGGCCCGCTG CGCCCGCACG CGTTCAACTT CAACGACATC CCCAGCGAGC GCGACTGCCC GCTGGCGGCG CCGCTGACCC TGCACAACGC CGATGTGAAG TTGCGCGTGT GGCGCACCCA CGGCGCCATG CGCCACCTGG TGCGCAACGC CGACGGCGAC GAGCTGCTGT TCGTCCACGA GGGGGCAGGG CACCTGTATT GCGATTTCGG CCATCTGGAG TACCGCGACG GCGATTACCT GCTGATCCCC CGCGGCACCG CCTGGCGCAT CGAGGCCAGC ACGCCGAGCT ACTTCCTGCT GATCGAGAAC AGCGACGGCG CCTACCAGCT GCCGGACAAG GGCCTGCTGG GCCCACAGGC GATCTTCGAC CCCGCCGTGC TCGATCATCC GCGGCTCGAC GAGGCCTTCA AGGCGCAGCA GGACGAGAAC ACCTGGCAGA TCAGGATCAA GCGGCGCAAT CAGATCAGCA CCGTGACCTA CCCGTACAAC CCGCTGGACG TGGTCGGCTG GCACGGCGAC AACACCGTGG TGCGCCTGAA CTGGCGCGAC ATTCGTCCGC TGCTCAGTCA CCGCTATCAC CTGCCGCCGT CGGCGCACAC CACCTTCGTC GCCAACGGCT TCGTGGTCTG CACCTTCACC CCGCGGCCGG TCGAATCCGA CCCCGGCGCG CTCAAGGTGC CGTTCTATCA CAACAACGAC GACTACGACG AAGTGCTGTT TTACCACCGC GGCAACTTCT TCAGCCGCGA CAACATCGAG GCCGGGATGG TCACTCTGCA CCCCTGCGGT TTCCCCCACG GGCCGCACCC CAAGGCGCTG AAAAAGAGCC AGGAGGACCC GGCGACCTTC ATCGACGAGG TGGCGGTGAT GATCGACACC CGCCGCGCCC TGGAAGTGGC CGATGCCGCC GACGCGGTGG ACGTGGCCGA GTACGTCAAC TCCTGGCGCG CGCCGGGTAC ACAAGGTTAA
|
Protein sequence | MSRQWIRFPL REGECSRQAH CDLPQGTYER EMGREGFFGP TAHLHHKHPP TGWIDWEGPL RPHAFNFNDI PSERDCPLAA PLTLHNADVK LRVWRTHGAM RHLVRNADGD ELLFVHEGAG HLYCDFGHLE YRDGDYLLIP RGTAWRIEAS TPSYFLLIEN SDGAYQLPDK GLLGPQAIFD PAVLDHPRLD EAFKAQQDEN TWQIRIKRRN QISTVTYPYN PLDVVGWHGD NTVVRLNWRD IRPLLSHRYH LPPSAHTTFV ANGFVVCTFT PRPVESDPGA LKVPFYHNND DYDEVLFYHR GNFFSRDNIE AGMVTLHPCG FPHGPHPKAL KKSQEDPATF IDEVAVMIDT RRALEVADAA DAVDVAEYVN SWRAPGTQG
|
| |