Gene PICST_31033 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_31033 
SymbolNHG3 
ID4838214 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009043 
Strand
Start bp1569924 
End bp1571162 
Gene Length1239 bp 
Protein Length412 aa 
Translation table12 
GC content40% 
IMG OID640389529 
ProductSalicylate hydroxylase (Salicylate 1-monooxygenase) 
Protein accessionXP_001383923 
Protein GI126134797 
COG category[C] Energy production and conversion
[H] Coenzyme transport and metabolism 
COG ID[COG0654] 2-polyprenyl-6-methoxyphenol hydroxylase and related FAD-dependent oxidoreductases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.261149 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.11275 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAAAGG AATGGAAACC ATTAAACGTC GCAGTATTAG GTGCTGGATT AGGTGGTCTT 
GCTGCAGCCA TTGCCATGAG AAGAAATGGT CATACTGTTA CAGTTTATGA AAGATATCAC
TTTGCTGGTG AAGTTGGTGC CTCATTATCG TGCGCATCAA ATGGTGGTAA ACATTTGAAG
GAATGGGGAA TTGACTTTGA TGCCGCCAAA CCAATCATCT TGAAGGATTT AATCAGGCAC
GATTGGAAGA CGGGCGAAAT CGAGGGTGTT TACAGCTTGG GAGACTATGA GAAGGCTTTT
GGAACTCCAT ACTATAACTT TCACAGAATA GACATTCATA ATGTTTTGAT GGATACTGCA
ACACAAGAAA AGGGGGAGGG TACACCATGT AAGTTGTTAG TTGATTATAA AGTCATTGAT
GTCAACCATG AAAGTGGGCA CATGGTCTTT GAAAACGGCA AAGAAGCATA TGCTGACTTG
ATTATTGCAG CAGATGGCAT CAGATCTACA ACAAGAGAGA AAATTGGTGT CATACCAGAA
TTTGGGATTT CAACCTCATG CTGTTACAGA TGCTTATTTA GGACAGAAGA CGTTCATAAG
TTGGGATTAA AGGATTTTTC AAAGAATGAA GCCATTGAAT TTTGGGGTGG TAACGACAAA
AATAAGATTG TTTTGTCTCC ATGTTCAGAT GGTGAAATTG TTTCATGTTA CTGCTTCTAT
CCGGCTGAGA TCAATGACTT GCGTGAAGAT GGTTGGAACA ATGAAGCTAC TCCTGAGCAA
TTATTAGCCA CATTTCCAGA GTTAGATGGT GCCTTGAAGG AACTATTCAA GATTGCGTTT
GATATCAAAC AATGGAGACT ATATGTTCAC AAACAGTATC CATATTGGGT CAAGGGAAAA
GTCGGCTTAT TAGGTGACGC TGCCCACCCT CAAATGCCAG ATCAATCTCA AGGTGCAGTG
ATGGCATTTG AAGATGCAGC TGCCTTTGGT TACATTTTCA GCAAAATGTT TAATTTTTCC
CCACAAGATG GATTAAAAGT GTATCAATCT GTTAGACAAC CAAGAGCCAA CAAAATTCAA
GCCGCTTCAT TGAGAGCTAG AGAAAACTTG AATGAAAGAA TCGGTTGGTC TTCGGGAGCT
GCTGATTTGA AAGATGAAAA CAGGCTTACA ATCGAAGAAG TTTGTTCGTA TAATATCAAG
GCTGATATTG ATCAAATTGT CAAGAACATG GGTCTTTGA
 
Protein sequence
MTKEWKPLNV AVLGAGLGGL AAAIAMRRNG HTVTVYERYH FAGEVGASLS CASNGGKHLK 
EWGIDFDAAK PIILKDLIRH DWKTGEIEGV YSLGDYEKAF GTPYYNFHRI DIHNVLMDTA
TQEKGEGTPC KLLVDYKVID VNHESGHMVF ENGKEAYADL IIAADGIRST TREKIGVIPE
FGISTSCCYR CLFRTEDVHK LGLKDFSKNE AIEFWGGNDK NKIVLSPCSD GEIVSCYCFY
PAEINDLRED GWNNEATPEQ LLATFPELDG ALKELFKIAF DIKQWRLYVH KQYPYWVKGK
VGLLGDAAHP QMPDQSQGAV MAFEDAAAFG YIFSKMFNFS PQDGLKVYQS VRQPRANKIQ
AASLRARENL NERIGWSSGA ADLKDENRLT IEEVCSYNIK ADIDQIVKNM GL