Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_56402 |
Symbol | NHG1.2 |
ID | 4837441 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009042 |
Strand | + |
Start bp | 1732573 |
End bp | 1733853 |
Gene Length | 1281 bp |
Protein Length | 426 aa |
Translation table | 12 |
GC content | 44% |
IMG OID | 640388756 |
Product | Salicylate hydroxylase (Salicylate 1-monooxygenase) |
Protein accession | XP_001382556 |
Protein GI | 126132062 |
COG category | [C] Energy production and conversion [H] Coenzyme transport and metabolism |
COG ID | [COG0654] 2-polyprenyl-6-methoxyphenol hydroxylase and related FAD-dependent oxidoreductases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.476468 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.198075 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | CCAAACAAAC CATCAATTAT TCAATTTAGA ACGCTTCAAC AAAAAATGAC TGCAACTCCT TTCAACATTA TTATCTGCGG TGGTGGTATC GGTGGTATTG CTGCTGCCAT TGGCTTAAGA AAGAAGGGCC ACAACGTAAC CATCTTAGAA GGTTCTTCAG CCTTAAACGA AGTTGGTGCT GGTATCCAAA TGCCTCCTAA TTCTGTTCGT GTCCTTAAGG AATATGGTAT CTTTGACAAA TTCTTGCCTT ACATTACCAG ACCTCAAAAT ATTTGTCTTA GAAGGTATGA CAATGGAAAC ATTCTCTCCA TGACTCCATT GGATCCAGAG ATGACAAAGT CTTACGGCAA CCCATACGTT TTGATCCACC GTGCTGATTA CCAGAGAATC TTGTATGAAT CTGCTGTGGA GCTTGGAGTT GACTACAAAG TTAACTCAAG AATCTCGTCG GTTGACCAGG AAGCAGTGAC TGTTACTTTA GTTGATGGCA CTGTACACCA TGCTGACTTC ATTGTCGGTG CTGATGGAAT TCGTTCTAAG GTTAGAGACA CTGCTGTGGT TCCCGAAGAA AAGGTTTTGC CAGTAAAGAG TTCCAACTGT GCCTACAGAG CTACCATACC AAAGGAAGTC ATGTTGGCAG ACCCCGAAGT TGCTTATCTT ATGTCTGATG TCAACTCCAA CTGCTGGATC GGTTACAGAA GACACGTCAT GGCTTACCCA ATAAGAAATG GTGAATTGTA CAACATGGTG TTATGCCACC CTGGCGAGGC TACTGTTGGT GTTTGGAACG AACCAGGCAA TCTTGAGGAA ATGAGAAACC ACTACAAAAA TTTCGATCCT GTTGTGGTCA AGCTTTTGTC CAAGGTTCAA TCTGTTCTCA AATGGGTTCT TGCAGACTTG CCTACTCTTC CTCGTTTTGT CAGCGAGAGC GGAAAGGTTG TTTTGATCGG AGACGCTGCC CATGCAATGT TACCTTACTT GGCTCAAGGC GCTGCACAAG CAATCGAAGA CGGAGCTACA TTAGCTGACG AAATCAGCAA GTGCAGCTCC ACCAAGGAAA TTCCTCAAGC TCTCCAAAAC TATCAAAAGA GAAGAAAGAG AAGAGTGCAA GCTGTTCAAG CTGGTGCTCA AAACAATGGT AAAGTCTGGC ACTTGCCAGA TGGCGAAGAA CAAGAAGAAA GAGATGCCAA AATGAAGAAG AGAGATGACA ACAACCCAGA TCAATGGTCT GACATTGAAT ACCAACGCTG GCTCTTCGGT TGGAACGCTT TTACTGATTA G
|
Protein sequence | PNKPSIIQFR TLQQKMTATP FNIIICGGGI GGIAAAIGLR KKGHNVTILE GSSALNEVGA GIQMPPNSVR VLKEYGIFDK FLPYITRPQN ICLRRYDNGN ILSMTPLDPE MTKSYGNPYV LIHRADYQRI LYESAVELGV DYKVNSRISS VDQEAVTVTL VDGTVHHADF IVGADGIRSK VRDTAVVPEE KVLPVKSSNC AYRATIPKEV MLADPEVAYL MSDVNSNCWI GYRRHVMAYP IRNGELYNMV LCHPGEATVG VWNEPGNLEE MRNHYKNFDP VVVKLLSKVQ SVLKWVLADL PTLPRFVSES GKVVLIGDAA HAMLPYLAQG AAQAIEDGAT LADEISKCSS TKEIPQALQN YQKRRKRRVQ AVQAGAQNNG KVWHLPDGEE QEERDAKMKK RDDNNPDQWS DIEYQRWLFG WNAFTD
|
| |