Gene PICST_65833 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_65833 
SymbolYOX1 
ID4839280 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009045 
Strand
Start bp1562693 
End bp1564198 
Gene Length1506 bp 
Protein Length440 aa 
Translation table12 
GC content45% 
IMG OID640390595 
Producthomeobox- domain containing protein 
Protein accessionXP_001385298 
Protein GI150865898 
COG category[K] Transcription 
COG ID[COG5576] Homeodomain-containing transcription factor 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTATCTAC AGACACCAAA GAGGACAAGC ACCTTCCCTC CTACACCCTC TTCTTACGGT 
GCCAACAAGC CTAACTTGCC TCCATTGTCA TCGTTGTTGT CGTCGACTCC TTCCAGCAAG
GACCCTGCCA TAACTTCGAC TCCATACATG CCAACAAAGT TGCCTTCCAT CGACTCCTTC
AGCACTACGC CAACTTCCTC GAGATACTTC CAGTCGAATC AAAGACCATT TGGTTCCTAT
GCCTCTTTTC CTACACCTCC AACCTCAAGA GGTTCGATAA CAGGAAACTA CTCTGTCGAC
ACCAGCGAAG CCGACATCAG CATCACTGAC TTGAGAAGAA CTTCTTCTTT TGCTCCCCCT
CCTTCCACCC ATCACATTCT TAAGAGCATC AACACGTCTG CACATTCTGC CCCAGCTACT
GCGGCTGTCA CACCTGTCAG TGCATCTGCT TCTTTGCCTC CTCCACCTCC ACAACCTGTT
ACGTCTTCGA ACGACTCCAA GTCATATGCA TTCATTTCTC ATTCTCCTGC CACTTTCCCT
CTGCAAGAGC CATCCATTGA CAATGCTCCA TTGGCCAGAA GGAAGAGGAG AAGAACATCG
CCCAACGAAT TGTCGATCTT GAACAAGGAG TTCCTTGTCG GCTCCACTCC TAACAAGATG
AGAAGAATTG AGATTGCGGC CAAGGTCAAC ATGACAGAGA AGGCTGTGCA GATTTGGTTC
CAGAATAAGA GACAGAGTTT GAGAAAGCAG CTGAACCATG AGAAAGAAGT CACGGAGTTG
CCTCCTACTC CTGTGGCAAT GGTTCCTCAT CCTCCAATGC CTGCAATGGT TGTTGCAGTA
CCAAATGCCC ATTCTATTCC ACAAAATACT ACACTTCCAC CTCTTACTAG AAATCCATCT
GGATCTTATT TGCCAGCTCC ACTTACATCC AACCCTCCAT TAATTTCATC TACACCAACA
AAACCATTGA TCAAGTCTCA TTCCTACGTT GGCTCTCCAT CTTTAACTAC ATCTTCTCCA
ATCAAGCCAA GATCTAGTTC GATTCCCAAC TTTGGAAAGA TTCCTGAAGC ATCCAGCACA
CCTTTTGTAT CCAAGATCAT AAATGCTCAA ACTACAGCTA CTTCGACTAC AACTACAAAT
GTCGAAGACG ATTCCAACAC TTCCATGGAC GACTCCATGA TTGCACACTC TAATAAACGT
CAAAAGTTGG TTCTTAATGA AACCAGGAAA AAGCAACCTT TGCAGCTCAA CTCTGGCAGT
TCCAGCACCA TGACCTTCAA GTTGATTCCA AGCAGCACTA AGGTCAACCA GAAGCTCTCC
AGTGTACAAA ATGAGGATAA GAAGTCTTCC ATTCAGAGCA TTTTGAACTC TACATCTACA
TCTACTAGAA AGCCATTGGG CGAAATCAGC AGCAACAATT TGAACAGCAA GCCTATTATA
AAAAATGACA AAAAGGACGC TGCTGCCGAA AATTTATTGA GCTTGAAGGC TGGCCTCTGG
AAGTAA
 
Protein sequence
MYLQTPKRTS TFPPTPSSYG ANKPNLPPLS SLLSSTPSSK DPAITSTPYM PTKLPSIDSF 
STTPTSSRYF QTSSFAPPPS THHILKSINT SAHSAPATAA VTPVSASASL PPPPPQPVTS
SNDSKSYAFI SHSPATFPSQ EPSIDNAPLA RRKRRRTSPN ELSILNKEFL VGSTPNKMRR
IEIAAKVNMT EKAVQIWFQN KRQSLRKQSN HEKEVTELPP TPVAMVPHPP MPAMVVAVPN
AHSIPQNTTL PPLTRNPSGS YLPAPLTSNP PLISSTPTKP LIKSSSIPNF GKIPEASSTP
FVSKIINAQT TATSTTTTNV EDDSNTSMDD SMIAHSNKRQ KLVLNETRKK QPLQLNSGSS
STMTFKLIPS STKVNQKLSS VQNEDKKSSI QSILNSTSTS TRKPLGEISS NNLNSKPIIK
NDKKDAAAEN LLSLKAGLWK