Gene PICST_28907 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_28907 
SymbolNHG4 
ID4851646 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009068 
Strand
Start bp2439460 
End bp2440800 
Gene Length1341 bp 
Protein Length446 aa 
Translation table 
GC content44% 
IMG OID640393354 
ProductSalicylate hydroxylase (Salicylate 1-monooxygenase) 
Protein accessionXP_001386800 
Protein GI126275143 
COG category[C] Energy production and conversion
[H] Coenzyme transport and metabolism 
COG ID[COG0654] 2-polyprenyl-6-methoxyphenol hydroxylase and related FAD-dependent oxidoreductases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.283926 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.292215 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTGCTG GAGAAGAGCC TTTCGATATT GCTGTTATCG GCTCTGGTCT AGCTGGAACT 
TTCGCAACTA TTGCCTTATC GCATTTGCCC AATGTGAAAA TCACTTCCTA TGAGAAAACG
GATGCCCCTA AAGAAGTGGG AGCCTGGATT TCGCTTACCA ATGCTACGTT CGATGTACTC
TCTAACTTTG TAGACATAAA CAGCCTCAAT CGCATTGCTT TCAAGGGGGA CACCAACAAC
GAGTATCTCA CTCGACACTG GAAAACAGGT GAAGTCATCT TCCGCCAACC GACCTTCAAC
CGTCCTAGAC CTTTTGTAGA AGCCAGAACT CATAGAATCC CTTTGCATGA TCTTCTTCTC
AGCTATGTTC CACCAGATGT CATTCATTAT AGCCATGACG TCAAGAATCT AAGTTTGCAA
TCTGACGGTA CCATTATCAA CTTCACTGAT GGCACTTCCA GCAGGAAGCA CGATTTGGTA
GTTGTTGCTG ATGGGATCTA TTCGAGAATC AGACGTCAGT TTTATCCAGA TGGCAAGATC
AAATACAAAG GCTTGGTGGC CTACAGAAGT GTGTTTCCAG CATCATTGGT ATCGCACCTC
GAAGTCAAGG AAGACACTTC AGTGTGGGTC AAAGATGGAA CGGTAATTTT CCTTTCCAGG
CTTGGACTAG ACCAGTATGG AATCGTGGCT ATTCTAAGTG AACCAGAGTC TACGGCTTCT
CAATTGAGCT GGGACAAGTC TACTGGGAAC TGGGGCAAAC ATCGCCTTGT AGAGCATTTC
GCAGAGTGGG ACCCCTACAT CAACGATGTC ATCAAGTCGA TTCCAGAAAT CAGGGCTTAT
CCTTTGGAAC AGGCTCCTTG GCTCAGCAAT TTGGTCATAG AAGATAAAAT TGTCTTTATC
GGAGATGCTG GTCATCCAAC GTCTGGTATC TACGGTTCTG GGGCTTCGTT TGGATTCTCT
GATGTTTGGG CGCTTTATAG AGCCTTGCAA GAAACATCGT CCAACTATTG GATTAAGAAC
AATACCCCTA CGTTTAAATA CAATGCCAAA TTGGCCCTAT TCCTCTTTAA TGAGACGAGA
AGATATTTCT TGCAACGGGT AGAACAGCAG GTGTCAATTG ACTCTCAGGT CAAAAGGGAG
AATCTAACTG AAATAGACGA CAAGGAATGG ACAGACCGTT ATCTTATCGT CAGAGCAGGG
GGTGATTGGA TCCGGTCTCA TAATGTAGAA CTAGAATTCC AGAAAGTTAG AGATCAATAT
TTGAACTTGC TTCAAAAGAA TGTCACCAGC TTCAAAACAT CTGGGAAGGG ATTGTCTACG
TTGGATTTAC CTAAACTCTA G
 
Protein sequence
MTAGEEPFDI AVIGSGLAGT FATIALSHLP NVKITSYEKT DAPKEVGAWI SLTNATFDVL 
SNFVDINSLN RIAFKGDTNN EYLTRHWKTG EVIFRQPTFN RPRPFVEART HRIPLHDLLL
SYVPPDVIHY SHDVKNLSLQ SDGTIINFTD GTSSRKHDLV VVADGIYSRI RRQFYPDGKI
KYKGLVAYRS VFPASLVSHL EVKEDTSVWV KDGTVIFLSR LGLDQYGIVA ILSEPESTAS
QLSWDKSTGN WGKHRLVEHF AEWDPYINDV IKSIPEIRAY PLEQAPWLSN LVIEDKIVFI
GDAGHPTSGI YGSGASFGFS DVWALYRALQ ETSSNYWIKN NTPTFKYNAK LALFLFNETR
RYFLQRVEQQ VSIDSQVKRE NLTEIDDKEW TDRYLIVRAG GDWIRSHNVE LEFQKVRDQY
LNLLQKNVTS FKTSGKGLST LDLPKL