Gene PICST_56402 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_56402 
SymbolNHG1.2 
ID4837441 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009042 
Strand
Start bp1732573 
End bp1733853 
Gene Length1281 bp 
Protein Length426 aa 
Translation table12 
GC content44% 
IMG OID640388756 
ProductSalicylate hydroxylase (Salicylate 1-monooxygenase) 
Protein accessionXP_001382556 
Protein GI126132062 
COG category[C] Energy production and conversion
[H] Coenzyme transport and metabolism 
COG ID[COG0654] 2-polyprenyl-6-methoxyphenol hydroxylase and related FAD-dependent oxidoreductases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.476468 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.198075 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
CCAAACAAAC CATCAATTAT TCAATTTAGA ACGCTTCAAC AAAAAATGAC TGCAACTCCT 
TTCAACATTA TTATCTGCGG TGGTGGTATC GGTGGTATTG CTGCTGCCAT TGGCTTAAGA
AAGAAGGGCC ACAACGTAAC CATCTTAGAA GGTTCTTCAG CCTTAAACGA AGTTGGTGCT
GGTATCCAAA TGCCTCCTAA TTCTGTTCGT GTCCTTAAGG AATATGGTAT CTTTGACAAA
TTCTTGCCTT ACATTACCAG ACCTCAAAAT ATTTGTCTTA GAAGGTATGA CAATGGAAAC
ATTCTCTCCA TGACTCCATT GGATCCAGAG ATGACAAAGT CTTACGGCAA CCCATACGTT
TTGATCCACC GTGCTGATTA CCAGAGAATC TTGTATGAAT CTGCTGTGGA GCTTGGAGTT
GACTACAAAG TTAACTCAAG AATCTCGTCG GTTGACCAGG AAGCAGTGAC TGTTACTTTA
GTTGATGGCA CTGTACACCA TGCTGACTTC ATTGTCGGTG CTGATGGAAT TCGTTCTAAG
GTTAGAGACA CTGCTGTGGT TCCCGAAGAA AAGGTTTTGC CAGTAAAGAG TTCCAACTGT
GCCTACAGAG CTACCATACC AAAGGAAGTC ATGTTGGCAG ACCCCGAAGT TGCTTATCTT
ATGTCTGATG TCAACTCCAA CTGCTGGATC GGTTACAGAA GACACGTCAT GGCTTACCCA
ATAAGAAATG GTGAATTGTA CAACATGGTG TTATGCCACC CTGGCGAGGC TACTGTTGGT
GTTTGGAACG AACCAGGCAA TCTTGAGGAA ATGAGAAACC ACTACAAAAA TTTCGATCCT
GTTGTGGTCA AGCTTTTGTC CAAGGTTCAA TCTGTTCTCA AATGGGTTCT TGCAGACTTG
CCTACTCTTC CTCGTTTTGT CAGCGAGAGC GGAAAGGTTG TTTTGATCGG AGACGCTGCC
CATGCAATGT TACCTTACTT GGCTCAAGGC GCTGCACAAG CAATCGAAGA CGGAGCTACA
TTAGCTGACG AAATCAGCAA GTGCAGCTCC ACCAAGGAAA TTCCTCAAGC TCTCCAAAAC
TATCAAAAGA GAAGAAAGAG AAGAGTGCAA GCTGTTCAAG CTGGTGCTCA AAACAATGGT
AAAGTCTGGC ACTTGCCAGA TGGCGAAGAA CAAGAAGAAA GAGATGCCAA AATGAAGAAG
AGAGATGACA ACAACCCAGA TCAATGGTCT GACATTGAAT ACCAACGCTG GCTCTTCGGT
TGGAACGCTT TTACTGATTA G
 
Protein sequence
PNKPSIIQFR TLQQKMTATP FNIIICGGGI GGIAAAIGLR KKGHNVTILE GSSALNEVGA 
GIQMPPNSVR VLKEYGIFDK FLPYITRPQN ICLRRYDNGN ILSMTPLDPE MTKSYGNPYV
LIHRADYQRI LYESAVELGV DYKVNSRISS VDQEAVTVTL VDGTVHHADF IVGADGIRSK
VRDTAVVPEE KVLPVKSSNC AYRATIPKEV MLADPEVAYL MSDVNSNCWI GYRRHVMAYP
IRNGELYNMV LCHPGEATVG VWNEPGNLEE MRNHYKNFDP VVVKLLSKVQ SVLKWVLADL
PTLPRFVSES GKVVLIGDAA HAMLPYLAQG AAQAIEDGAT LADEISKCSS TKEIPQALQN
YQKRRKRRVQ AVQAGAQNNG KVWHLPDGEE QEERDAKMKK RDDNNPDQWS DIEYQRWLFG
WNAFTD