Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_31126 |
Symbol | NHG1.1 |
ID | 4838309 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009043 |
Strand | - |
Start bp | 1830970 |
End bp | 1832205 |
Gene Length | 1236 bp |
Protein Length | 411 aa |
Translation table | 12 |
GC content | 42% |
IMG OID | 640389624 |
Product | Salicylate hydroxylase (Salicylate 1-monooxygenase) |
Protein accession | XP_001383962 |
Protein GI | 126134875 |
COG category | [C] Energy production and conversion [H] Coenzyme transport and metabolism |
COG ID | [COG0654] 2-polyprenyl-6-methoxyphenol hydroxylase and related FAD-dependent oxidoreductases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACTGCAA CTCATTTCAA CATTATTATC TGCGGTGGTG GTATTGGAGG CATTGCTGCT TCCATCGGCT TGAGGAAGAA AGGTCACAAT GTTACTATTT TAGAAGGTTC ATCATCTTTA AGTGAAGTTG GTGCTGGTAT CCAAATGCCT CCCAACTCCG TTCATGTCCT TAAGGAGTAT GGTATCTTCG ACAAATTCTT GCCTTACATT ACCAGGCCAA AAAACATTTG TCTCAGAAGA TATGACAATG GTAAGGTTCT CTCCATGACT CCATTGGATC CAGAGATGAC AAAGTCCTAT GGTAACCCAT ACGTTTTGAT TCATCGTGCT GATTACCAAA GAATCTTGCA CGAATCTGCA CTAGAACTTG GAGTTGAATA CAAACTCAAC TCTAGAATAG CTTCCGTTGA CCAAGAAGCA GTTACTGTTA CTATGGTTGA TGGTAATATC CACCAAGCTG ACATCATTGT TGGAGCTGAT GGTATTCGTT CGAAGGTTAG AGATAGTGCA GTAGTTACCG AAGAAAAAGT ACTACCACTT AAATGTTCAA ACTGTGCCTA CAGAGCTACT ATTCCAAGGG AAGTAATGTT GTCTGACCCA GAAATTGCTC ATCTTATGAC TGATGTCAAC TCCAACTGCT GGATAGGTTA CAGACGACAC ATTATGGCAT ACCCAATAAG AAATGGAGAG CTATACAATA TGGTGTTGTG TCATCCTGGT GAAGCCTCTG TTGGTGTCTG GAATGAACCT GGCGATGTAG AAGAAATGAG ACACCACTAC AGAAACTTCG ATCCCATTGT TGTTAGACTT TTATCAAAGG TTCAATCTGT TCTCAAGTGG GTTCTTGCCG ATTTGCCTAT GCTTCCACGT TATGTCAGTG AGAGCGGCAA AGTCGTTTTG ATTGGAGATG CAGCTCATGC TATGTTGCCA TATTTGGCTC AAGGGGCTGC TCAAGCTATT GAAGACGGCG CTACCTTGGC AGACGAAATC AACATGTGCA AGTCCAGCAG CGACATCCCA GCTGCTCTTA AAAACTACCA AAAGAGAAGA AAGAGAAGAG TTGAGGCTGT TCAAGCAGGT GCTCACAAAA ATGGTCACGT CTGGCACTTA CCAGATGGTG AAGAGCAAGA AGAAAGAGAT ACTAAAATGA TGAAGAGAGA TGATAATAAC CCAGATCAAT GGTCTGATAT TGAATACCAA CGCTGGCTCT TTGGATGGAA CGCTTTTATT GATTAG
|
Protein sequence | MTATHFNIII CGGGIGGIAA SIGLRKKGHN VTILEGSSSL SEVGAGIQMP PNSVHVLKEY GIFDKFLPYI TRPKNICLRR YDNGKVLSMT PLDPEMTKSY GNPYVLIHRA DYQRILHESA LELGVEYKLN SRIASVDQEA VTVTMVDGNI HQADIIVGAD GIRSKVRDSA VVTEEKVLPL KCSNCAYRAT IPREVMLSDP EIAHLMTDVN SNCWIGYRRH IMAYPIRNGE LYNMVLCHPG EASVGVWNEP GDVEEMRHHY RNFDPIVVRL LSKVQSVLKW VLADLPMLPR YVSESGKVVL IGDAAHAMLP YLAQGAAQAI EDGATLADEI NMCKSSSDIP AALKNYQKRR KRRVEAVQAG AHKNGHVWHL PDGEEQEERD TKMMKRDDNN PDQWSDIEYQ RWLFGWNAFI D
|
| |