Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_31033 |
Symbol | NHG3 |
ID | 4838214 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009043 |
Strand | - |
Start bp | 1569924 |
End bp | 1571162 |
Gene Length | 1239 bp |
Protein Length | 412 aa |
Translation table | 12 |
GC content | 40% |
IMG OID | 640389529 |
Product | Salicylate hydroxylase (Salicylate 1-monooxygenase) |
Protein accession | XP_001383923 |
Protein GI | 126134797 |
COG category | [C] Energy production and conversion [H] Coenzyme transport and metabolism |
COG ID | [COG0654] 2-polyprenyl-6-methoxyphenol hydroxylase and related FAD-dependent oxidoreductases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.261149 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.11275 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACAAAGG AATGGAAACC ATTAAACGTC GCAGTATTAG GTGCTGGATT AGGTGGTCTT GCTGCAGCCA TTGCCATGAG AAGAAATGGT CATACTGTTA CAGTTTATGA AAGATATCAC TTTGCTGGTG AAGTTGGTGC CTCATTATCG TGCGCATCAA ATGGTGGTAA ACATTTGAAG GAATGGGGAA TTGACTTTGA TGCCGCCAAA CCAATCATCT TGAAGGATTT AATCAGGCAC GATTGGAAGA CGGGCGAAAT CGAGGGTGTT TACAGCTTGG GAGACTATGA GAAGGCTTTT GGAACTCCAT ACTATAACTT TCACAGAATA GACATTCATA ATGTTTTGAT GGATACTGCA ACACAAGAAA AGGGGGAGGG TACACCATGT AAGTTGTTAG TTGATTATAA AGTCATTGAT GTCAACCATG AAAGTGGGCA CATGGTCTTT GAAAACGGCA AAGAAGCATA TGCTGACTTG ATTATTGCAG CAGATGGCAT CAGATCTACA ACAAGAGAGA AAATTGGTGT CATACCAGAA TTTGGGATTT CAACCTCATG CTGTTACAGA TGCTTATTTA GGACAGAAGA CGTTCATAAG TTGGGATTAA AGGATTTTTC AAAGAATGAA GCCATTGAAT TTTGGGGTGG TAACGACAAA AATAAGATTG TTTTGTCTCC ATGTTCAGAT GGTGAAATTG TTTCATGTTA CTGCTTCTAT CCGGCTGAGA TCAATGACTT GCGTGAAGAT GGTTGGAACA ATGAAGCTAC TCCTGAGCAA TTATTAGCCA CATTTCCAGA GTTAGATGGT GCCTTGAAGG AACTATTCAA GATTGCGTTT GATATCAAAC AATGGAGACT ATATGTTCAC AAACAGTATC CATATTGGGT CAAGGGAAAA GTCGGCTTAT TAGGTGACGC TGCCCACCCT CAAATGCCAG ATCAATCTCA AGGTGCAGTG ATGGCATTTG AAGATGCAGC TGCCTTTGGT TACATTTTCA GCAAAATGTT TAATTTTTCC CCACAAGATG GATTAAAAGT GTATCAATCT GTTAGACAAC CAAGAGCCAA CAAAATTCAA GCCGCTTCAT TGAGAGCTAG AGAAAACTTG AATGAAAGAA TCGGTTGGTC TTCGGGAGCT GCTGATTTGA AAGATGAAAA CAGGCTTACA ATCGAAGAAG TTTGTTCGTA TAATATCAAG GCTGATATTG ATCAAATTGT CAAGAACATG GGTCTTTGA
|
Protein sequence | MTKEWKPLNV AVLGAGLGGL AAAIAMRRNG HTVTVYERYH FAGEVGASLS CASNGGKHLK EWGIDFDAAK PIILKDLIRH DWKTGEIEGV YSLGDYEKAF GTPYYNFHRI DIHNVLMDTA TQEKGEGTPC KLLVDYKVID VNHESGHMVF ENGKEAYADL IIAADGIRST TREKIGVIPE FGISTSCCYR CLFRTEDVHK LGLKDFSKNE AIEFWGGNDK NKIVLSPCSD GEIVSCYCFY PAEINDLRED GWNNEATPEQ LLATFPELDG ALKELFKIAF DIKQWRLYVH KQYPYWVKGK VGLLGDAAHP QMPDQSQGAV MAFEDAAAFG YIFSKMFNFS PQDGLKVYQS VRQPRANKIQ AASLRARENL NERIGWSSGA ADLKDENRLT IEEVCSYNIK ADIDQIVKNM GL
|
| |