Gene PICST_43564 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_43564 
SymbolSAL1 
ID4838216 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009043 
Strand
Start bp1576826 
End bp1577983 
Gene Length1158 bp 
Protein Length386 aa 
Translation table12 
GC content38% 
IMG OID640389531 
ProductAromatic-ring hydroxylase 
Protein accessionXP_001383586 
Protein GI150864661 
COG category[C] Energy production and conversion
[H] Coenzyme transport and metabolism 
COG ID[COG0654] 2-polyprenyl-6-methoxyphenol hydroxylase and related FAD-dependent oxidoreductases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.193664 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAGCAT GGGACCAAGG TAAGATATTA ATTTCAGGAG CAGGAGTTGT TGGTTTGTTG 
CTAGCACAAT CATTAAAGAA ACTAAACATT CCATACGAAA TATTCGATAG AGATGAGTCA
ATAAGCGCTA GAGGTCAAGG TTGGGGTATC ACAATTCATT GGGCATTGAA CGATATGTTG
AGCATGCTAC CAGAAGATTT AATCAAATCT GTGTACGATG CTCAAGTTTA CGAAAATTTC
CATGAGAATG ACAATGGTAA TTTCATTTAT ATCAATGGCT CAAACGGTAT CCCAGTTGTT
AATATCCCAC CTGCTCCAAG ATTAAGGGTC AGAAGAGAAG AGTTAAGAGT AATCTTGTCA
ACTGGTATTG ATGTCAATTG GGGTTGTCAG ACCATTAACA TTGAGACTGA TGACGATGCA
ACTGATATCA ACAAGAGAGT TACAGTCACA TGTAAGAATG GCAAAGTATT TCAAGGTGGA
ATATTGATGG GTATCGAAGG TTCCAAATCG GTCACAAGAT CAATTACAAA TCCTACCAAT
CATGAATTGC AATACTTACC GATTAGATTT ATTGGTGGTA CCATTGAGTT ATCGGAAGAT
GAATATAGGA AGATGGCAAC AACTTTCAGT CCATTGTTGT TTCAGGGCAC AATTCCACAA
ACTGAATCGT TCTTTTGGTA TTCCCTCTTA GCAACACCAA AATACACTAA AAACGGAACT
TACAAATCAC AGATCATGAT GTCATGGAAA AATAATCCGG ATGAACCATT TGATACACCA
GAAGAAAAAT ATGCCCTGAT TAAAATCCAT TCAAAAGGAT TAGACCCTCG ATTACAATAC
ATGATTGATT ACTTGGACAA GAGCACCGGG TCTTTCATGG AGTTGCAATT GGCCGATTGG
CCAATATGCG ATTGGAACGA TTTCAATAAT AAGATCTTAT TGATGGGTGA TGCATGCCAT
GCGATGACGA TGTATAGGGG TGAAGCTTGC AATCATGGCA TTGCTGATGT TAAACTATTC
ACAAACTTGT GTGAAAGCTT GTTGAATGGT AAAATAGATT GGAATGCGGT TGTTGGGAAG
TATAAAACTT CGATAAAAGA TAGATGTTCA GAAGCTGTTC TTCTATCGAG ACAGGCTTGT
ATCGATGCTC ATGATTGG
 
Protein sequence
MTAWDQGKIL ISGAGVVGLL LAQSLKKLNI PYEIFDRDES ISARGQGWGI TIHWALNDML 
SMLPEDLIKS VYDAQVYENF HENDNGNFIY INGSNGIPVV NIPPAPRLRV RREELRVILS
TGIDVNWGCQ TINIETDDDA TDINKRVTVT CKNGKVFQGG ILMGIEGSKS VTRSITNPTN
HELQYLPIRF IGGTIELSED EYRKMATTFS PLLFQGTIPQ TESFFWYSLL ATPKYTKNGT
YKSQIMMSWK NNPDEPFDTP EEKYASIKIH SKGLDPRLQY MIDYLDKSTG SFMELQLADW
PICDWNDFNN KILLMGDACH AMTMYRGEAC NHGIADVKLF TNLCESLLNG KIDWNAVVGK
YKTSIKDRCS EAVLLSRQAC IDAHDW