Gene PICST_29498 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_29498 
SymbolSHE4 
ID4836756 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009042 
Strand
Start bp411172 
End bp413658 
Gene Length2487 bp 
Protein Length828 aa 
Translation table12 
GC content38% 
IMG OID640388071 
Productprotein required for mother cell-specific HO expression 
Protein accessionXP_001382314 
Protein GI150863742 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGGAGT TGGCTCAGCA AGTTGAAAAC TTATCAATTA CTGAAGATGT TTCAGAAGAG 
ATCGTTAGGC TTTTCAGCGT GAGAGCCGGA CTTCTGCAGC CAGCATTAGA TGAATGTAAC
AGTACTCAAC TACTTCAAAA GGCAATAAAG CTCTCTCAAA AGAAAGAGTA TTCTACGAGA
TTGTACGAAG CAGTGCTCAA TGACACCGAA ACTTTTCTAA ACGGTCTACA AACTATAGGT
GATGACCAGG CCGTTGTACT CACTAATGTG CTATCGAATT CTAGCTCTGA GCCTAAACCT
CCAGTAGTTC AGAACTTGTT GAATTCTATC AAGACAATTA TAAAGCCCTT GATATTAGAT
AAAGACACAG TATCGAAAGT CAGATTGTTC TTGTCAATCT ATTTTTCCAT AGTATCTCAC
TTTGGGAACG AAAGTGCCCA ACATCTTCAC GTCTTGCTCA AATTCATCAA CCCTATTCCA
GTCATTCTCT CTGAATCTGG GGAATCGGAC TACAAGACAA TCGTTTCTCT AGTTATGATG
ATATTGGTCA AGAATTTGCA AGCTAACAAA AAAACTACAT GCGAAGTTAT CACCGAGTAT
CTCGAGTTGA TCAAAGAAGA CGAAAGTCCC ACTGCAGCCG AAGTGTTGAA CTACATCGAA
TTGTTAGAAA ACTTATACCC ATTGATCCCA GAAGTAATTG AACCAATTTA TACATGCGAT
AAATCCAAGA ATTTGATAAC CGCTGAAGTT GATCGCGTTC TTGAATCTGG TGGCAGGGAA
TTGGAGTATC CGAACAGACG TAGGGTCTGT ATCAGTATCT TGAATCTTAT AAGCAGTTCT
TGTATTTCTG AAACTGGCCG TAACTACAAT GTTTCCACTT TCTTGCAATT ATTAAAAGCA
GGAACAATGC TTAAAGATAT CGAAATTAAA TTATTGTCTA CATTGAATCT TATCAAGTTA
TGGAACTTTA TAGAATTGGA GAAGAAATCG GAATCTTCCG AGAATGTGAC GATAACAAAT
CTTGAAACAA ACTTAACCAG CTACTTGCGA AATTCAAATG TCGCCGAAGA TGCACGAAAT
ATTGAAGTTT GTGTAGAAGG TTTGGCATAC TTGAGTTTGA ATACAAGCGT AAAGCAGCAC
CTTAGACAGG ACGAAGTCTT AATAGAACTA TTGTTGAAGA TATTGAAAGA TAGCTCAAGC
GGGGCAAAGA AGCAAAGAAA TGACTCGTCC TTGGTTTACG GAATTTTGGT TTTGATTTCT
AATCTTGCAA AGTTGAAAGA TCCCAATGAT AAGGGTTCAG ATAGAAGTAC TGCTTCCTTT
TTGAAGGGAT TCGCAACTCC TAGCGGTTCA AGAACGAAAG ACAAAGAAGA AGACCAAGAT
GCCATCCAGT TGTTCAACAG ATCTCTATTA AAGGACCACA AGATTATTGA GATTATCTCA
GTTTTAAAGA TTTACAAGGA AGAGGACGGA ACGGCACCAC AGATACAACA GAACAACTTA
CTAAGACAGT TCATTTTTAT ATTGCATACG CTTTCCATGA ACCCCCAAAG AGCAGTCAAG
GAAGAAATAG TCAAACAGGG AGGATTAAAC GTAATATTGG GCTATCTCAT CAAATATAGC
ACAGTTAGTA AAGCCACAGG AGAAACTCGT CCAATTTCTA GTTCTGCAGA ACTTATTGAT
ACGAGAATGC TCGCGATTCG AGCTTTGGCC AAAATCTTGA TATCTGTAAA CCCTTCATTA
TCATTCAAAA AATATGATAT CAAAACATCG GTTCCATTTC TCGTCGAGCT ATTGGGTCCT
GATATCTCTG TATATACTGG CTCACTTGAT ACCCAATCGG CAAACGAAAA GTACCTCTTT
GACTTTACTA ATCTTGACAA GTACGAGTCG TTGATGGCAT TGACAAATCT TTCGTCCAAT
GAGGATACTC AATTACAGGG GTTGATACTT CGCAGAACTT ACGATACGTA TTTGAACAAC
TTTATAATTG ATTCAGATAT CCCACCTGTC CAAAAAGCTT CTTGGGAGCT CATATCCAAC
TTGATAACGC AAACATCTTT ACTTGCTAAA TTTTTCAATC TAGAAGATAA AGACAGCTAC
AAAAGGTTAG ACTTGTTAAT TAAATTGTTA AACTCCAAAG ACGAAGAACT ACAGATAGTG
ATTGCTGGTT TGTTGGCTAA TGCTACCTCT GAATTTGATA TGATTTCCGA GATCCTTGTG
AAAGATACGA AAATTTTTAA GGACATCACC AACACTCTTA GTTTCATTTT TCAACATCAA
AATAGTATCG ATAACTTGAT ATTGCGGTGC AGCTACGTAT TGATCAACTT GGTCTATGCG
GCTGCCAACT TGGGAGAGGA GAAGCTACAA GAATTTGCTG ATAATCAAAA GCTTAAACTG
GCGGTCAACG AGACCTTGAA AGCAACAAAA AACCAGGGTA TCCTAGAAGT ACTTATTGAA
GTCATTAAGA TGGTGCGTTT TAAGTAG
 
Protein sequence
MKELAQQVEN LSITEDVSEE IVRLFSVRAG LSQPALDECN STQLLQKAIK LSQKKEYSTR 
LYEAVLNDTE TFLNGLQTIG DDQAVVLTNV LSNSSSEPKP PVVQNLLNSI KTIIKPLILD
KDTVSKVRLF LSIYFSIVSH FGNESAQHLH VLLKFINPIP VILSESGESD YKTIVSLVMM
ILVKNLQANK KTTCEVITEY LELIKEDESP TAAEVLNYIE LLENLYPLIP EVIEPIYTCD
KSKNLITAEV DRVLESGGRE LEYPNRRRVC ISILNLISSS CISETGRNYN VSTFLQLLKA
GTMLKDIEIK LLSTLNLIKL WNFIELEKKS ESSENVTITN LETNLTSYLR NSNVAEDARN
IEVCVEGLAY LSLNTSVKQH LRQDEVLIEL LLKILKDSSS GAKKQRNDSS LVYGILVLIS
NLAKLKDPND KGSDRSTASF LKGFATPSGS RTKDKEEDQD AIQLFNRSLL KDHKIIEIIS
VLKIYKEEDG TAPQIQQNNL LRQFIFILHT LSMNPQRAVK EEIVKQGGLN VILGYLIKYS
TVSKATGETR PISSSAELID TRMLAIRALA KILISVNPSL SFKKYDIKTS VPFLVELLGP
DISVYTGSLD TQSANEKYLF DFTNLDKYES LMALTNLSSN EDTQLQGLIL RRTYDTYLNN
FIIDSDIPPV QKASWELISN LITQTSLLAK FFNLEDKDSY KRLDLLIKLL NSKDEELQIV
IAGLLANATS EFDMISEILV KDTKIFKDIT NTLSFIFQHQ NSIDNLILRC SYVLINLVYA
AANLGEEKLQ EFADNQKLKS AVNETLKATK NQGILEVLIE VIKMVRFK