Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_57166 |
Symbol | PHH2 |
ID | 4838208 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009043 |
Strand | + |
Start bp | 1690753 |
End bp | 1692927 |
Gene Length | 2175 bp |
Protein Length | 724 aa |
Translation table | 12 |
GC content | 41% |
IMG OID | 640389523 |
Product | phenol 2-monooxygenase |
Protein accession | XP_001383611 |
Protein GI | 150864677 |
COG category | [C] Energy production and conversion [H] Coenzyme transport and metabolism |
COG ID | [COG0654] 2-polyprenyl-6-methoxyphenol hydroxylase and related FAD-dependent oxidoreductases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.368445 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.635098 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACTGCTA CTAATTCTAA AATTAAACAC TCGAACACCG ACGTCTTGAT CGTGGGTGCT GGTCCCTCCG GTCTAATGGC AGCAACCTGG TTGGCCAGAA CTGGTGTACC CTTCAGGATA ATCGATAAGA GATCAAATGA CATTTTTTCT GGACAGGCTG ATGGTTTACA ATGTCGTTCG CTCGAAGTGT TCAAATCCTT TTCGGGCACA ACCTTTGATA TGGCTGGAAT GGATTCGGCT TGGAAGATGG GTAACCATAT GATTGAAATT TGTTTCTGGA GTCCAGACGA GAACGGCAAG TTGGTGAGAT CAGGAAGGAT TCCCGATACT ATCCCAGGAA TTTCTCGTTT TCAGCAGGTA GTTATTCACC AGGGCTACAT TGAGAATTGG TTTGAAAAAT CAATCAAGAA GTTCTCTGGA GACTCGGTCG AGGTGGAAAG GCCATTCTTG CCTTTGAACA TCAAGATCGA TGAAAGTAAA GTTGACGATC CAGACGAATA TCCCGTAGAA ATCTTAGTCA GAAACTTGAG TAAGGATAAA TTGTCTAAAC CTGAACAATA CGACAATGCT GTAGCTAATG GTCTCTACAG ACAGTTCGAA GGTGATCAAG AGAAGTTCTA TGACAACATG CAATCAGACA TTGAAAACGA TCCTACTTTG GATGTAAGCG AGTATGAAAT TATTCACTGC AAGTACTTGC TTGGTTCAGA TGGTGCCCAT AGTTGGGTGA GAAAACAATT AGGAATTGAC ATGGATGGTG AAACCACCGA TTTTGTTTGG GGTGTCTTGG ATATGGTTCC AATCACTGAC TTCCCTGATA TCAGAAGTCG TTGTGCAATC CATTCCAAAG ACTCTGGTTC AGTTATGGTT ATTCCAAGAG AGAATGATTT GGTTAGATTG TATATCCAGT TGAAGGAAGT TCCTAGAGAC CCTAGGACAA AAACAGAAGC TTCAAAATAC ACTGGAAATG TCGATGACAA GAATGCCGCT TCAAAGGGAA GAATAGATCG TTCGAAGATT ACTCCAGAAC TAATCTTAAA AAATGCACAA GAAATCATGC TGCCATTCAA GTTAGAAATG ACAGACTTGG ATTGGTTCAC TGGCTATCAA ATTGGTCAAA GAGTCAGTCC TCAATTCAAC AAGTATAACA GAGTATTTAT ATCAGGCGAT GCTTGTCATA CACATTCACC AAAAGCTGGT CAAGGTATGA ATGTCTCCAT GATGGATACT TACAATCTTG GGTTCAAGTT GGCACTTGTA TGCAAAGGTT TGGCCAAACA AGACATATTG AAGACTTACG AAAGTGAAAG ACTTCAAGTT GCTAAAGACT TGATTGCCTT TGACCATAAG CTTTCGAGAA TGTTCAGTGG TAAACCTATG ATTCCTCAAG CGGAATCTTT AGAAGAAGGA GTTGATATGG ACGAGTTCCA TAAAGCTTTC CAATTAGGTA ATGAATTTGC TTCGGGTACC ATCGTTGACT ACAACGACTC GGTATTGATC GACAAATCGA ACGTTCTACC TGCTGATGAA AAAGATAGAA CGATGTCCAA GTATGCAAAG AAGATACCTG TAGGAAGAAG ATTGAACACA GTCAAGATTA TCGCGCATGC TGACGGAAGA CCTTATCATA TAGCTGATCG TCTTCTTTCG GACGGCCGTT TCAGAGTATT GATATTTTCG GGTGATGTTA AGAAGTTCAC AAAAAACATG GACACATTGG ATGAATTCCA GACTTACCTT GAATCTGAAG AACATTTTGC TAAGAAGTTC ACACCTCCAA ATGCCTTCGA TAACTCTGTT ATCGATGTCA TTACAGTTCA CGCATCTAAC CGTTACGATG TAGAACTATT TGACTTCCCA GAATTTACTA GGCCACTTGA TTTCAAGTCT AGACGTGACT ACTGGAGACT TTATTCTGGT GTTGGCCGGA CTTACCATGA AGGTACACTT GATGCCTATG AAGAGTACGG CATTGACAAA GAAAAAGGTG CTGTCGTAGT TGTCAGACCA GACGGCTATG TTTCTTTGGT CACTGAGTTT GGTATTTCTG GATTGAAGGA AATCGATGCC TACTTTGACA AGTTCATGAT TCCTCAGTCG AAAAACGCTT TACCAGTTAA GTCACAAGGA GTTGATGACA AAAAGAGATT CGTGAAGCCA TTGTTAGCTG TTTAG
|
Protein sequence | MTATNSKIKH SNTDVLIVGA GPSGLMAATW LARTGVPFRI IDKRSNDIFS GQADGLQCRS LEVFKSFSGT TFDMAGMDSA WKMGNHMIEI CFWSPDENGK LVRSGRIPDT IPGISRFQQV VIHQGYIENW FEKSIKKFSG DSVEVERPFL PLNIKIDESK VDDPDEYPVE ILVRNLSKDK LSKPEQYDNA VANGLYRQFE GDQEKFYDNM QSDIENDPTL DVSEYEIIHC KYLLGSDGAH SWVRKQLGID MDGETTDFVW GVLDMVPITD FPDIRSRCAI HSKDSGSVMV IPRENDLVRL YIQLKEVPRD PRTKTEASKY TGNVDDKNAA SKGRIDRSKI TPELILKNAQ EIMSPFKLEM TDLDWFTGYQ IGQRVSPQFN KYNRVFISGD ACHTHSPKAG QGMNVSMMDT YNLGFKLALV CKGLAKQDIL KTYESERLQV AKDLIAFDHK LSRMFSGKPM IPQAESLEEG VDMDEFHKAF QLGNEFASGT IVDYNDSVLI DKSNVLPADE KDRTMSKYAK KIPVGRRLNT VKIIAHADGR PYHIADRLLS DGRFRVLIFS GDVKKFTKNM DTLDEFQTYL ESEEHFAKKF TPPNAFDNSV IDVITVHASN RYDVELFDFP EFTRPLDFKS RRDYWRLYSG VGRTYHEGTL DAYEEYGIDK EKGAVVVVRP DGYVSLVTEF GISGLKEIDA YFDKFMIPQS KNALPVKSQG VDDKKRFVKP LLAV
|
| |