Gene PICST_57166 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_57166 
SymbolPHH2 
ID4838208 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009043 
Strand
Start bp1690753 
End bp1692927 
Gene Length2175 bp 
Protein Length724 aa 
Translation table12 
GC content41% 
IMG OID640389523 
Productphenol 2-monooxygenase 
Protein accessionXP_001383611 
Protein GI150864677 
COG category[C] Energy production and conversion
[H] Coenzyme transport and metabolism 
COG ID[COG0654] 2-polyprenyl-6-methoxyphenol hydroxylase and related FAD-dependent oxidoreductases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.368445 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.635098 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTGCTA CTAATTCTAA AATTAAACAC TCGAACACCG ACGTCTTGAT CGTGGGTGCT 
GGTCCCTCCG GTCTAATGGC AGCAACCTGG TTGGCCAGAA CTGGTGTACC CTTCAGGATA
ATCGATAAGA GATCAAATGA CATTTTTTCT GGACAGGCTG ATGGTTTACA ATGTCGTTCG
CTCGAAGTGT TCAAATCCTT TTCGGGCACA ACCTTTGATA TGGCTGGAAT GGATTCGGCT
TGGAAGATGG GTAACCATAT GATTGAAATT TGTTTCTGGA GTCCAGACGA GAACGGCAAG
TTGGTGAGAT CAGGAAGGAT TCCCGATACT ATCCCAGGAA TTTCTCGTTT TCAGCAGGTA
GTTATTCACC AGGGCTACAT TGAGAATTGG TTTGAAAAAT CAATCAAGAA GTTCTCTGGA
GACTCGGTCG AGGTGGAAAG GCCATTCTTG CCTTTGAACA TCAAGATCGA TGAAAGTAAA
GTTGACGATC CAGACGAATA TCCCGTAGAA ATCTTAGTCA GAAACTTGAG TAAGGATAAA
TTGTCTAAAC CTGAACAATA CGACAATGCT GTAGCTAATG GTCTCTACAG ACAGTTCGAA
GGTGATCAAG AGAAGTTCTA TGACAACATG CAATCAGACA TTGAAAACGA TCCTACTTTG
GATGTAAGCG AGTATGAAAT TATTCACTGC AAGTACTTGC TTGGTTCAGA TGGTGCCCAT
AGTTGGGTGA GAAAACAATT AGGAATTGAC ATGGATGGTG AAACCACCGA TTTTGTTTGG
GGTGTCTTGG ATATGGTTCC AATCACTGAC TTCCCTGATA TCAGAAGTCG TTGTGCAATC
CATTCCAAAG ACTCTGGTTC AGTTATGGTT ATTCCAAGAG AGAATGATTT GGTTAGATTG
TATATCCAGT TGAAGGAAGT TCCTAGAGAC CCTAGGACAA AAACAGAAGC TTCAAAATAC
ACTGGAAATG TCGATGACAA GAATGCCGCT TCAAAGGGAA GAATAGATCG TTCGAAGATT
ACTCCAGAAC TAATCTTAAA AAATGCACAA GAAATCATGC TGCCATTCAA GTTAGAAATG
ACAGACTTGG ATTGGTTCAC TGGCTATCAA ATTGGTCAAA GAGTCAGTCC TCAATTCAAC
AAGTATAACA GAGTATTTAT ATCAGGCGAT GCTTGTCATA CACATTCACC AAAAGCTGGT
CAAGGTATGA ATGTCTCCAT GATGGATACT TACAATCTTG GGTTCAAGTT GGCACTTGTA
TGCAAAGGTT TGGCCAAACA AGACATATTG AAGACTTACG AAAGTGAAAG ACTTCAAGTT
GCTAAAGACT TGATTGCCTT TGACCATAAG CTTTCGAGAA TGTTCAGTGG TAAACCTATG
ATTCCTCAAG CGGAATCTTT AGAAGAAGGA GTTGATATGG ACGAGTTCCA TAAAGCTTTC
CAATTAGGTA ATGAATTTGC TTCGGGTACC ATCGTTGACT ACAACGACTC GGTATTGATC
GACAAATCGA ACGTTCTACC TGCTGATGAA AAAGATAGAA CGATGTCCAA GTATGCAAAG
AAGATACCTG TAGGAAGAAG ATTGAACACA GTCAAGATTA TCGCGCATGC TGACGGAAGA
CCTTATCATA TAGCTGATCG TCTTCTTTCG GACGGCCGTT TCAGAGTATT GATATTTTCG
GGTGATGTTA AGAAGTTCAC AAAAAACATG GACACATTGG ATGAATTCCA GACTTACCTT
GAATCTGAAG AACATTTTGC TAAGAAGTTC ACACCTCCAA ATGCCTTCGA TAACTCTGTT
ATCGATGTCA TTACAGTTCA CGCATCTAAC CGTTACGATG TAGAACTATT TGACTTCCCA
GAATTTACTA GGCCACTTGA TTTCAAGTCT AGACGTGACT ACTGGAGACT TTATTCTGGT
GTTGGCCGGA CTTACCATGA AGGTACACTT GATGCCTATG AAGAGTACGG CATTGACAAA
GAAAAAGGTG CTGTCGTAGT TGTCAGACCA GACGGCTATG TTTCTTTGGT CACTGAGTTT
GGTATTTCTG GATTGAAGGA AATCGATGCC TACTTTGACA AGTTCATGAT TCCTCAGTCG
AAAAACGCTT TACCAGTTAA GTCACAAGGA GTTGATGACA AAAAGAGATT CGTGAAGCCA
TTGTTAGCTG TTTAG
 
Protein sequence
MTATNSKIKH SNTDVLIVGA GPSGLMAATW LARTGVPFRI IDKRSNDIFS GQADGLQCRS 
LEVFKSFSGT TFDMAGMDSA WKMGNHMIEI CFWSPDENGK LVRSGRIPDT IPGISRFQQV
VIHQGYIENW FEKSIKKFSG DSVEVERPFL PLNIKIDESK VDDPDEYPVE ILVRNLSKDK
LSKPEQYDNA VANGLYRQFE GDQEKFYDNM QSDIENDPTL DVSEYEIIHC KYLLGSDGAH
SWVRKQLGID MDGETTDFVW GVLDMVPITD FPDIRSRCAI HSKDSGSVMV IPRENDLVRL
YIQLKEVPRD PRTKTEASKY TGNVDDKNAA SKGRIDRSKI TPELILKNAQ EIMSPFKLEM
TDLDWFTGYQ IGQRVSPQFN KYNRVFISGD ACHTHSPKAG QGMNVSMMDT YNLGFKLALV
CKGLAKQDIL KTYESERLQV AKDLIAFDHK LSRMFSGKPM IPQAESLEEG VDMDEFHKAF
QLGNEFASGT IVDYNDSVLI DKSNVLPADE KDRTMSKYAK KIPVGRRLNT VKIIAHADGR
PYHIADRLLS DGRFRVLIFS GDVKKFTKNM DTLDEFQTYL ESEEHFAKKF TPPNAFDNSV
IDVITVHASN RYDVELFDFP EFTRPLDFKS RRDYWRLYSG VGRTYHEGTL DAYEEYGIDK
EKGAVVVVRP DGYVSLVTEF GISGLKEIDA YFDKFMIPQS KNALPVKSQG VDDKKRFVKP
LLAV