Gene PICST_57167 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_57167 
SymbolYIN1 
ID4838189 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009043 
Strand
Start bp1825323 
End bp1827605 
Gene Length2283 bp 
Protein Length713 aa 
Translation table12 
GC content43% 
IMG OID640389504 
ProductFungal specific transcription factor 
Protein accessionXP_001383639 
Protein GI150864701 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.256805 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTACTG TGGTTAAAGA ATCCACACCA AAGAGAGGGC GTACCAATTT GGCTTGCGAC 
AGGTGTCGCC AAAGGAAAAC ACGTTGCGAC GGAGTGCAAC CGGTTTGTGG AACATGTATG
AAGCGCAAAA TTCCATGTAA ATTTGAGCGT AAATACCCTC GAGCTCAGGT TTCTGTTCAG
TATGTCAAGA CACTCGAAGA GAAGTTGAAA ATCCTAAAAT CTCAAGGCAG AGACAAATTT
AACGATAACG TCAATTTGAA TGTTAATAAC AACGGCTCGA CAGCAAATCC AATAGAAGGT
CTCGGAATTG ATTCCTCTTC TAGTGGTGAA CTTCTTCACA GCCAAAATCA AAATCAGATT
AACCAACAAA ATCAAATACA TTACCACACT CAGTATTCAA TACAACAACT ACCTAACTCA
ATCAATAACC GCATTACGCC TCAACCTATC CGTAACTCGG GGTTCTCCGC CATGTCCGTG
GGCTCCCTGA ATGATATCCA TGCTCCTTCA TTTGCCTACA TCAACTCCAA TAACAAAAAA
TCGTTGATGT TGAACTACGA CGATAAGCAT TCTGTGGAAT ATGGTGTAAA TGGCCTGGTC
ATTTCGGGAA CGTATAAGGA TGAAGAGACC AGTGCAGATG CCATGGGAGC TGGCTCTCTT
TCTGATGACA GGAGCAAAGA CAAGAACTTC TACGGATCTT CAGCAGCAGT TTCGTTCATG
AAAGAACTTG CTTTAACTGT AGATGGAGGT CCCCAAAGGT CGGACTCGGA AAACAAGAAA
TTTGTGGAAC GAGCCAAGTA CAAGATGTCT CGTAACGATG CTAGAAGCAC CAAAGGGTTA
TCAGATATGT TGGTTCCTCC ACGGAGTATC GCTGATCGTT ACATAAGAAA CTACTTCGAC
TTCACCTACA ATCTCTACCC GTTTGTACAC AAGCCAACAT TTATGGCAGC CTATGACGAA
ATCTGGTCTG CTGATGCTGG TTCCTATGAG GTGGATGAGC TTTTCTACTC TATTCTTAAT
ATCATTTTTG CATTTGGTTG TCGACTCCTG TTAGACATTG ACCATCAGGA AAGCTTTGCC
AATGCTGACA TGTACTTTGA GCGATCGCAG GAGTTGCTCC GGTTTCATCT CATGGATGCC
GGGTCACTCT TGTTGGTACA AGCGTTGATA CTTACGGGCC AGTATTTACA AGCTACAACC
AGACTGGCTG GGTGTTGGAA CATCATAGGA TTGGCCATCC GAATGGCGCA GGGTTTGGGG
TTACACGAGG AACAGAATCT TGAGTCTACT AACAGTTACA TTGAAAAAGA GATGAGAGAG
AGATTATGGC ATGGTTGTTT AATGATGGAT AGGATAGTAT CAATGACTTT AGGAAGACCC
ATGATGGTAA TGCATGAGTC CCGTATGTCT TTACCTTCAG CGATTGATGA TGAGAATATC
TTAGACGATT CCTACGTACC CTCTTCTAAT CCATCTTACA TGTGTTTTTT TAATGAAACG
GTTAAGCTCT ACGACATACT TGCAGACATT TTGAAGATTT TCTATTCTAG CAGTGAACCT
GAATTTTTTG ATCTCTTTCT CAGTATTTTT AAAATCGAAG AGAGACTCCA TAAATTCCAT
GAAAATGTAC CTAAGCATAT TAAGTTCGGG TTTGAGTTGC ACGAAAAGCC TTTCCGTCGT
CAGAGTATCA TTCTCCACTT GCGGTACTTG CATTTGAAGA TGGTACTATA TCGCCGTGTA
TTATTTCCTA AAAAATATGC TGCAAACAGG CGGGAAATTC ACTCCGAATT AATTTCCCAT
ACCACAAATT CGATCTCCTT GCTTTGTGTG GAAGCTGCGA TCGAGTTGAT TCAGGTGGTT
AAGAAATATA GAGCGGAGGA CATAGAGATA TTGCCAGCTA GTTGGTACAC TGTGTTCTAC
TTGTACAGTG CGGAGACTGT TATGTTGGCA GCTAAACTTA AGCCCACCCT ACAAGAAGAA
ACCAGCACGG AAGTCTTTAC CACTGCATGG ACCAACGGGT TAGAAATGTT GGCCGGCTAC
CAAGATGAAT CAGAGTCGGC AGTGCGGTGT TTGAAGATAT TGGAGATCAT GGGAGAAAGA
GTTCACGCTT CCGCCTTCCG ACACAAGCGG CATATTGACA TTCCTACGGT CGGCAGCGAC
TCGGCCCAAA ACAGTCCTTC GCATATTCCT TCCGATATTC TCTATTCATT ATTGTATGAT
ACAGCAGGGC CATTTGGAGG TCCTTTCTTC TACCGAGAAG AAATGAATCG GTTTTCCAGC
TAG
 
Protein sequence
MTTVVKESTP KRGRTNLACD RCRQRKTRCD GVQPVCGTCM KRKIPCKFER KYPRAQVSVQ 
YVKTLEEKLK ILKSQGRDKF NDNVNLNVNN NGSTANPIEG LGIDSSSSGE LLHSQNQNQI
NQQNQIHYHT QYSIQQLPNS INNRITPQPI RNSGFSAMSD EETSADAMGA GSLSDDRSKD
KNFYGSSAAV SFMKELALTV DGGPQRSDSE NKKFVERAKY KMSRNDARST KGLSDMLVPP
RSIADRYIRN YFDFTYNLYP FVHKPTFMAA YDEIWSADAG SYEVDELFYS ILNIIFAFGC
RLSLDIDHQE SFANADMYFE RSQELLRFHL MDAGSLLLVQ ALILTGQYLQ ATTRSAGCWN
IIGLAIRMAQ GLGLHEEQNL ESTNSYIEKE MRERLWHGCL MMDRIVSMTL GRPMMVMHES
RMSLPSAIDD ENILDDSYVP SSNPSYMCFF NETVKLYDIL ADILKIFYSS SEPEFFDLFL
SIFKIEERLH KFHENVPKHI KFGFELHEKP FRRQSIILHL RYLHLKMVLY RRVLFPKKYA
ANRREIHSEL ISHTTNSISL LCVEAAIELI QVVKKYRAED IEILPASWYT VFYLYSAETV
MLAAKLKPTL QEETSTEVFT TAWTNGLEML AGYQDESESA VRCLKILEIM GERVHASAFR
HKRHIDIPTV GSDSAQNSPS HIPSDILYSL LYDTAGPFGG PFFYREEMNR FSS