Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_1268 |
Symbol | mhpA |
ID | 5704481 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | - |
Start bp | 1468568 |
End bp | 1470160 |
Gene Length | 1593 bp |
Protein Length | 530 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 641270783 |
Product | 3-(3-hydroxyphenyl)propionate hydroxylase |
Protein accession | YP_001536164 |
Protein GI | 159036911 |
COG category | [C] Energy production and conversion [H] Coenzyme transport and metabolism |
COG ID | [COG0654] 2-polyprenyl-6-methoxyphenol hydroxylase and related FAD-dependent oxidoreductases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 4 |
Fosmid unclonability p-value | 0.000150974 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | TTGAGCGACG CGCACGTCGT CATCGTCGGC AACGGGCCCG TCGGTGCCAC CCTGTCGGTG CTGCTCGCCC AACGCGGCTG GCGGGTGACC GTGATCGAAC GCCGCCCCCG GCCGTACCGG CTTCCTCGGG CGACCAGCTT CGACGGTGAG ACCGCCCGCC TCCTGGCCGA CACCGGGATC GGCCCGGACC TCGGCCGGAT CACGGCTCCG GCCAACGGCT ACCAGTGGCG CACGGCCGCC GGTGAGACCC TGCTGGATAT CGCGTTCAGC ACGCCCGGCG CGTACGGCTG GCCGGACGTG AACACGATGC ACCAGCCGGC CCTGGAGGAG CTCATCGCCA CCCGGGCGAC GTCGTTGCCC AGTCTCACCC TGCTGCGCGG ACACGAGGTC GTGGGCATCA GCGAGCACCA CGGCCGGGTG GAGGTGATCG CCACCGACGA CGACGGCACG ACGCGGCGGG TCTCCGCGCA CTGGGTCGTG GGATGCGACG GCGCCAACAG CTTCGTCCGC CACCACCTCG ACGTTCCCGT GACGGACCTC GGGTTCTCCT ACGAGTGGCT GCTCTGCGAC GTCGAACTCC ACGAGCCGCG CGAGTTCGTC CCCACCAATG TGCAGATCTG CGATCCGGCG CGGCCCACGA CCCAGGTGGG CGGCGGCCCT GGACGACGGC GCTGGGAATT CATGCGCCTG CCCGGCGAAA GCACGGCCGA GCTGAACCGG GACGAGACGG CGTGGCGTCT GCTGGCACCG TTCGGCGTCA GACCCGACAC CGCCACCCTC CTGCGCCACA CCACGTACAT CTTCCAGGCC CGGTGGGCCG AGCGGTGGCG GGTGGGTCAC GTCCTGCTGG CCGGCGACGC GGCCCACCTC ATGCCACCGT TCGCCGGTCA GGGCATGTGC GCCGGCATCC GGGACGTCGT CAACCTGGCG TGGAAGCTCG ACCTCACCCT GCGCGGACTC GCGGCAGAGT CCCTGATCGA CTCCTACCAA CAGGAACGCC GCGCACAGGC CAAGGAGGCG ATCCTGGCGT CGGTCCAGCT GGGCCGAGTG ATCTGCGTGA CCGATCCGGT GGCCGCCGCC GAGCGCGACG CCACGGTGCT GGCCAACCGC CGAGGGCAGC CCCCGGTCCG GCCGGATCCG GCGAAGCCGC TCTCCGACGG GCTGCTGCAC CGGCGGCCCG GGGCGGACGC GGCCGAGCCG CCGGCCGGTG CGGTCGCGCC GCAGGGCCGA GTGGCCCTGG GCCGCGACAT CGGGCTGTTC GACGACGTCG TCGGCCGGGG CTTCGTCCTG CTGACCACCG AGGATCCGCA CACCGCACTC GACGACGATC GGCTGTCGTT CCTCGGCACG CTGGACACCC ACATGGTGCG GCTGCTGCCC CCCGGTAACG CGGTGGAGCA GGGCGCGGTG GACGTGGACG ACGTCTACCG CCCGTATCTG ACGCGGTGCG GGGCGACCGC CCTGCTCATC CGGCCCGATC ACCACGTCTT CGGTGCTGGA AGCGGCCCGA GCGGCATCCG GGACCTCGTC GACGACCTGC GACACCAGCT ACGAGCTCCG GCGCCGGTGA GTGCGCTAGG GACCGTCGGC TGA
|
Protein sequence | MSDAHVVIVG NGPVGATLSV LLAQRGWRVT VIERRPRPYR LPRATSFDGE TARLLADTGI GPDLGRITAP ANGYQWRTAA GETLLDIAFS TPGAYGWPDV NTMHQPALEE LIATRATSLP SLTLLRGHEV VGISEHHGRV EVIATDDDGT TRRVSAHWVV GCDGANSFVR HHLDVPVTDL GFSYEWLLCD VELHEPREFV PTNVQICDPA RPTTQVGGGP GRRRWEFMRL PGESTAELNR DETAWRLLAP FGVRPDTATL LRHTTYIFQA RWAERWRVGH VLLAGDAAHL MPPFAGQGMC AGIRDVVNLA WKLDLTLRGL AAESLIDSYQ QERRAQAKEA ILASVQLGRV ICVTDPVAAA ERDATVLANR RGQPPVRPDP AKPLSDGLLH RRPGADAAEP PAGAVAPQGR VALGRDIGLF DDVVGRGFVL LTTEDPHTAL DDDRLSFLGT LDTHMVRLLP PGNAVEQGAV DVDDVYRPYL TRCGATALLI RPDHHVFGAG SGPSGIRDLV DDLRHQLRAP APVSALGTVG
|
| |