Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_28907 |
Symbol | NHG4 |
ID | 4851646 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009068 |
Strand | + |
Start bp | 2439460 |
End bp | 2440800 |
Gene Length | 1341 bp |
Protein Length | 446 aa |
Translation table | |
GC content | 44% |
IMG OID | 640393354 |
Product | Salicylate hydroxylase (Salicylate 1-monooxygenase) |
Protein accession | XP_001386800 |
Protein GI | 126275143 |
COG category | [C] Energy production and conversion [H] Coenzyme transport and metabolism |
COG ID | [COG0654] 2-polyprenyl-6-methoxyphenol hydroxylase and related FAD-dependent oxidoreductases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.283926 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.292215 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACTGCTG GAGAAGAGCC TTTCGATATT GCTGTTATCG GCTCTGGTCT AGCTGGAACT TTCGCAACTA TTGCCTTATC GCATTTGCCC AATGTGAAAA TCACTTCCTA TGAGAAAACG GATGCCCCTA AAGAAGTGGG AGCCTGGATT TCGCTTACCA ATGCTACGTT CGATGTACTC TCTAACTTTG TAGACATAAA CAGCCTCAAT CGCATTGCTT TCAAGGGGGA CACCAACAAC GAGTATCTCA CTCGACACTG GAAAACAGGT GAAGTCATCT TCCGCCAACC GACCTTCAAC CGTCCTAGAC CTTTTGTAGA AGCCAGAACT CATAGAATCC CTTTGCATGA TCTTCTTCTC AGCTATGTTC CACCAGATGT CATTCATTAT AGCCATGACG TCAAGAATCT AAGTTTGCAA TCTGACGGTA CCATTATCAA CTTCACTGAT GGCACTTCCA GCAGGAAGCA CGATTTGGTA GTTGTTGCTG ATGGGATCTA TTCGAGAATC AGACGTCAGT TTTATCCAGA TGGCAAGATC AAATACAAAG GCTTGGTGGC CTACAGAAGT GTGTTTCCAG CATCATTGGT ATCGCACCTC GAAGTCAAGG AAGACACTTC AGTGTGGGTC AAAGATGGAA CGGTAATTTT CCTTTCCAGG CTTGGACTAG ACCAGTATGG AATCGTGGCT ATTCTAAGTG AACCAGAGTC TACGGCTTCT CAATTGAGCT GGGACAAGTC TACTGGGAAC TGGGGCAAAC ATCGCCTTGT AGAGCATTTC GCAGAGTGGG ACCCCTACAT CAACGATGTC ATCAAGTCGA TTCCAGAAAT CAGGGCTTAT CCTTTGGAAC AGGCTCCTTG GCTCAGCAAT TTGGTCATAG AAGATAAAAT TGTCTTTATC GGAGATGCTG GTCATCCAAC GTCTGGTATC TACGGTTCTG GGGCTTCGTT TGGATTCTCT GATGTTTGGG CGCTTTATAG AGCCTTGCAA GAAACATCGT CCAACTATTG GATTAAGAAC AATACCCCTA CGTTTAAATA CAATGCCAAA TTGGCCCTAT TCCTCTTTAA TGAGACGAGA AGATATTTCT TGCAACGGGT AGAACAGCAG GTGTCAATTG ACTCTCAGGT CAAAAGGGAG AATCTAACTG AAATAGACGA CAAGGAATGG ACAGACCGTT ATCTTATCGT CAGAGCAGGG GGTGATTGGA TCCGGTCTCA TAATGTAGAA CTAGAATTCC AGAAAGTTAG AGATCAATAT TTGAACTTGC TTCAAAAGAA TGTCACCAGC TTCAAAACAT CTGGGAAGGG ATTGTCTACG TTGGATTTAC CTAAACTCTA G
|
Protein sequence | MTAGEEPFDI AVIGSGLAGT FATIALSHLP NVKITSYEKT DAPKEVGAWI SLTNATFDVL SNFVDINSLN RIAFKGDTNN EYLTRHWKTG EVIFRQPTFN RPRPFVEART HRIPLHDLLL SYVPPDVIHY SHDVKNLSLQ SDGTIINFTD GTSSRKHDLV VVADGIYSRI RRQFYPDGKI KYKGLVAYRS VFPASLVSHL EVKEDTSVWV KDGTVIFLSR LGLDQYGIVA ILSEPESTAS QLSWDKSTGN WGKHRLVEHF AEWDPYINDV IKSIPEIRAY PLEQAPWLSN LVIEDKIVFI GDAGHPTSGI YGSGASFGFS DVWALYRALQ ETSSNYWIKN NTPTFKYNAK LALFLFNETR RYFLQRVEQQ VSIDSQVKRE NLTEIDDKEW TDRYLIVRAG GDWIRSHNVE LEFQKVRDQY LNLLQKNVTS FKTSGKGLST LDLPKL
|
| |