Gene PICST_50409 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_50409 
SymbolNHG2 
ID4840879 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009048 
Strand
Start bp241978 
End bp243321 
Gene Length1344 bp 
Protein Length447 aa 
Translation table12 
GC content44% 
IMG OID640392194 
ProductSalicylate hydroxylase (Salicylate 1-monooxygenase) 
Protein accessionXP_001386445 
Protein GI150866748 
COG category[C] Energy production and conversion
[H] Coenzyme transport and metabolism 
COG ID[COG0654] 2-polyprenyl-6-methoxyphenol hydroxylase and related FAD-dependent oxidoreductases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0406434 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAATTA CACAGTCAAG AATTCCAACT TCTTCCATTT CCCTACATTT CATTGTTGTT 
GGTGCTGGTT TGGGTGGAGT AACGGCTTCG ATTGCTCTCA CCTTAGCTGG GCATAAGGTG
ACCGTTTTAG AAGCTGCTCC CATATTAGGC GAAGTTGGTG CTGGGATACA GATTCCTCCT
CCTTCAGTTA AGATTCTTCA GGCTATTGGA GCCTTGGACG AAGTACTCTC CAATGCTACT
TTTCCTCGTG AGTTTCAGAT TCATAGTTGG AAAGAAGGCA AGATTATTTC CAAACAGAAC
TTGATCCCCT ACACTATGGA AAAGTACAAT TCTCACTACT TGCACATTCA TCGGGCCGAC
TACCATCGAG CCTTGGTTAA CCGAGCCATT GAAGTCGGAG TTGAGATTGT TTTGGATGCC
CGGGTCAACC ACATTGATTT TGAAAAGGCC ACCGTTGCTA CTGCTCGTGG AGAAGTTTAT
TCTGGGGACG TCGTAGTTGG ATTCGATGGC ATAAAGTCTC GCTTGAGATC CTTCATTTTG
GGCTACGAAG ACTTACCTTA CGACACAGGC GATCTTGCCT ATAGGGCAAT AATCAAAGTA
AGCGAAATGA AGAAGGATCC GGAGCTTGTT CCTTTCATTG AAGAACCAAA CATACACTTC
TGGTGGGGTC CTCGATGCCA TGTTGTATTA TATTTATTGC AAGGCGGTGA GAGTTGCAAT
GTTGTCATAC TTACACCAGA TACCTTGCCT AAAGATGAAG CAGTTCAGCC TGCCAAGGTA
GAGGAGTTAT TGGAGTTGTT TAAGGACTGG GATCCAAGAT TGAACTCCAT CTTTAAAAAC
ATACATAGTA CCAGCAAATG GAGATTGCAA AATTCAAGAG AATTATCCAC ATGGACACAT
GAAGAAGGAA ATGTGATCAT CTTGGGAGAT GCATCGCATG CCACTTTGCC CTACTTGGCA
TCCGGAGCCT CTCAAGCCTT GGAAGATGCT GCGGTATTGG CAGGTTTGTT CGGCAGAATT
GAACATCGGG GCCAGATTCA TGATCTTCTC AATCTAACGG AATCTTTGAG AAAATGGAGA
TCTACTCAAG TTGTCCAAGG TTCCACTCAA TGCAGAAATA TCTACCATTT ACCAGATGGT
AAAGAACAAC AGTTAAGAGA TATGAAATTG CAAATCTCTC CTCCAAAGAT CGGCTGTCCC
AACAGGTGGA GAGATCCTGT TTTCCAGGAA TTTTTATGGG GTTACGAAGC GTTTGACGAG
GCTGAAAGAG GGTGGGCAGA GTATAAGAAG GGAAAGATTG CGAAGTACAC CTTCGATCTG
CTTTACGATG AAGTGAAATT ATAG
 
Protein sequence
MTITQSRIPT SSISLHFIVV GAGLGGVTAS IALTLAGHKV TVLEAAPILG EVGAGIQIPP 
PSVKILQAIG ALDEVLSNAT FPREFQIHSW KEGKIISKQN LIPYTMEKYN SHYLHIHRAD
YHRALVNRAI EVGVEIVLDA RVNHIDFEKA TVATARGEVY SGDVVVGFDG IKSRLRSFIL
GYEDLPYDTG DLAYRAIIKV SEMKKDPELV PFIEEPNIHF WWGPRCHVVL YLLQGGESCN
VVILTPDTLP KDEAVQPAKV EELLELFKDW DPRLNSIFKN IHSTSKWRLQ NSRELSTWTH
EEGNVIILGD ASHATLPYLA SGASQALEDA AVLAGLFGRI EHRGQIHDLL NLTESLRKWR
STQVVQGSTQ CRNIYHLPDG KEQQLRDMKL QISPPKIGCP NRWRDPVFQE FLWGYEAFDE
AERGWAEYKK GKIAKYTFDS LYDEVKL