Gene PICST_31126 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_31126 
SymbolNHG1.1 
ID4838309 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009043 
Strand
Start bp1830970 
End bp1832205 
Gene Length1236 bp 
Protein Length411 aa 
Translation table12 
GC content42% 
IMG OID640389624 
ProductSalicylate hydroxylase (Salicylate 1-monooxygenase) 
Protein accessionXP_001383962 
Protein GI126134875 
COG category[C] Energy production and conversion
[H] Coenzyme transport and metabolism 
COG ID[COG0654] 2-polyprenyl-6-methoxyphenol hydroxylase and related FAD-dependent oxidoreductases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTGCAA CTCATTTCAA CATTATTATC TGCGGTGGTG GTATTGGAGG CATTGCTGCT 
TCCATCGGCT TGAGGAAGAA AGGTCACAAT GTTACTATTT TAGAAGGTTC ATCATCTTTA
AGTGAAGTTG GTGCTGGTAT CCAAATGCCT CCCAACTCCG TTCATGTCCT TAAGGAGTAT
GGTATCTTCG ACAAATTCTT GCCTTACATT ACCAGGCCAA AAAACATTTG TCTCAGAAGA
TATGACAATG GTAAGGTTCT CTCCATGACT CCATTGGATC CAGAGATGAC AAAGTCCTAT
GGTAACCCAT ACGTTTTGAT TCATCGTGCT GATTACCAAA GAATCTTGCA CGAATCTGCA
CTAGAACTTG GAGTTGAATA CAAACTCAAC TCTAGAATAG CTTCCGTTGA CCAAGAAGCA
GTTACTGTTA CTATGGTTGA TGGTAATATC CACCAAGCTG ACATCATTGT TGGAGCTGAT
GGTATTCGTT CGAAGGTTAG AGATAGTGCA GTAGTTACCG AAGAAAAAGT ACTACCACTT
AAATGTTCAA ACTGTGCCTA CAGAGCTACT ATTCCAAGGG AAGTAATGTT GTCTGACCCA
GAAATTGCTC ATCTTATGAC TGATGTCAAC TCCAACTGCT GGATAGGTTA CAGACGACAC
ATTATGGCAT ACCCAATAAG AAATGGAGAG CTATACAATA TGGTGTTGTG TCATCCTGGT
GAAGCCTCTG TTGGTGTCTG GAATGAACCT GGCGATGTAG AAGAAATGAG ACACCACTAC
AGAAACTTCG ATCCCATTGT TGTTAGACTT TTATCAAAGG TTCAATCTGT TCTCAAGTGG
GTTCTTGCCG ATTTGCCTAT GCTTCCACGT TATGTCAGTG AGAGCGGCAA AGTCGTTTTG
ATTGGAGATG CAGCTCATGC TATGTTGCCA TATTTGGCTC AAGGGGCTGC TCAAGCTATT
GAAGACGGCG CTACCTTGGC AGACGAAATC AACATGTGCA AGTCCAGCAG CGACATCCCA
GCTGCTCTTA AAAACTACCA AAAGAGAAGA AAGAGAAGAG TTGAGGCTGT TCAAGCAGGT
GCTCACAAAA ATGGTCACGT CTGGCACTTA CCAGATGGTG AAGAGCAAGA AGAAAGAGAT
ACTAAAATGA TGAAGAGAGA TGATAATAAC CCAGATCAAT GGTCTGATAT TGAATACCAA
CGCTGGCTCT TTGGATGGAA CGCTTTTATT GATTAG
 
Protein sequence
MTATHFNIII CGGGIGGIAA SIGLRKKGHN VTILEGSSSL SEVGAGIQMP PNSVHVLKEY 
GIFDKFLPYI TRPKNICLRR YDNGKVLSMT PLDPEMTKSY GNPYVLIHRA DYQRILHESA
LELGVEYKLN SRIASVDQEA VTVTMVDGNI HQADIIVGAD GIRSKVRDSA VVTEEKVLPL
KCSNCAYRAT IPREVMLSDP EIAHLMTDVN SNCWIGYRRH IMAYPIRNGE LYNMVLCHPG
EASVGVWNEP GDVEEMRHHY RNFDPIVVRL LSKVQSVLKW VLADLPMLPR YVSESGKVVL
IGDAAHAMLP YLAQGAAQAI EDGATLADEI NMCKSSSDIP AALKNYQKRR KRRVEAVQAG
AHKNGHVWHL PDGEEQEERD TKMMKRDDNN PDQWSDIEYQ RWLFGWNAFI D