Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_50409 |
Symbol | NHG2 |
ID | 4840879 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009048 |
Strand | + |
Start bp | 241978 |
End bp | 243321 |
Gene Length | 1344 bp |
Protein Length | 447 aa |
Translation table | 12 |
GC content | 44% |
IMG OID | 640392194 |
Product | Salicylate hydroxylase (Salicylate 1-monooxygenase) |
Protein accession | XP_001386445 |
Protein GI | 150866748 |
COG category | [C] Energy production and conversion [H] Coenzyme transport and metabolism |
COG ID | [COG0654] 2-polyprenyl-6-methoxyphenol hydroxylase and related FAD-dependent oxidoreductases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.0406434 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACAATTA CACAGTCAAG AATTCCAACT TCTTCCATTT CCCTACATTT CATTGTTGTT GGTGCTGGTT TGGGTGGAGT AACGGCTTCG ATTGCTCTCA CCTTAGCTGG GCATAAGGTG ACCGTTTTAG AAGCTGCTCC CATATTAGGC GAAGTTGGTG CTGGGATACA GATTCCTCCT CCTTCAGTTA AGATTCTTCA GGCTATTGGA GCCTTGGACG AAGTACTCTC CAATGCTACT TTTCCTCGTG AGTTTCAGAT TCATAGTTGG AAAGAAGGCA AGATTATTTC CAAACAGAAC TTGATCCCCT ACACTATGGA AAAGTACAAT TCTCACTACT TGCACATTCA TCGGGCCGAC TACCATCGAG CCTTGGTTAA CCGAGCCATT GAAGTCGGAG TTGAGATTGT TTTGGATGCC CGGGTCAACC ACATTGATTT TGAAAAGGCC ACCGTTGCTA CTGCTCGTGG AGAAGTTTAT TCTGGGGACG TCGTAGTTGG ATTCGATGGC ATAAAGTCTC GCTTGAGATC CTTCATTTTG GGCTACGAAG ACTTACCTTA CGACACAGGC GATCTTGCCT ATAGGGCAAT AATCAAAGTA AGCGAAATGA AGAAGGATCC GGAGCTTGTT CCTTTCATTG AAGAACCAAA CATACACTTC TGGTGGGGTC CTCGATGCCA TGTTGTATTA TATTTATTGC AAGGCGGTGA GAGTTGCAAT GTTGTCATAC TTACACCAGA TACCTTGCCT AAAGATGAAG CAGTTCAGCC TGCCAAGGTA GAGGAGTTAT TGGAGTTGTT TAAGGACTGG GATCCAAGAT TGAACTCCAT CTTTAAAAAC ATACATAGTA CCAGCAAATG GAGATTGCAA AATTCAAGAG AATTATCCAC ATGGACACAT GAAGAAGGAA ATGTGATCAT CTTGGGAGAT GCATCGCATG CCACTTTGCC CTACTTGGCA TCCGGAGCCT CTCAAGCCTT GGAAGATGCT GCGGTATTGG CAGGTTTGTT CGGCAGAATT GAACATCGGG GCCAGATTCA TGATCTTCTC AATCTAACGG AATCTTTGAG AAAATGGAGA TCTACTCAAG TTGTCCAAGG TTCCACTCAA TGCAGAAATA TCTACCATTT ACCAGATGGT AAAGAACAAC AGTTAAGAGA TATGAAATTG CAAATCTCTC CTCCAAAGAT CGGCTGTCCC AACAGGTGGA GAGATCCTGT TTTCCAGGAA TTTTTATGGG GTTACGAAGC GTTTGACGAG GCTGAAAGAG GGTGGGCAGA GTATAAGAAG GGAAAGATTG CGAAGTACAC CTTCGATCTG CTTTACGATG AAGTGAAATT ATAG
|
Protein sequence | MTITQSRIPT SSISLHFIVV GAGLGGVTAS IALTLAGHKV TVLEAAPILG EVGAGIQIPP PSVKILQAIG ALDEVLSNAT FPREFQIHSW KEGKIISKQN LIPYTMEKYN SHYLHIHRAD YHRALVNRAI EVGVEIVLDA RVNHIDFEKA TVATARGEVY SGDVVVGFDG IKSRLRSFIL GYEDLPYDTG DLAYRAIIKV SEMKKDPELV PFIEEPNIHF WWGPRCHVVL YLLQGGESCN VVILTPDTLP KDEAVQPAKV EELLELFKDW DPRLNSIFKN IHSTSKWRLQ NSRELSTWTH EEGNVIILGD ASHATLPYLA SGASQALEDA AVLAGLFGRI EHRGQIHDLL NLTESLRKWR STQVVQGSTQ CRNIYHLPDG KEQQLRDMKL QISPPKIGCP NRWRDPVFQE FLWGYEAFDE AERGWAEYKK GKIAKYTFDS LYDEVKL
|
| |