Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_29498 |
Symbol | SHE4 |
ID | 4836756 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009042 |
Strand | + |
Start bp | 411172 |
End bp | 413658 |
Gene Length | 2487 bp |
Protein Length | 828 aa |
Translation table | 12 |
GC content | 38% |
IMG OID | 640388071 |
Product | protein required for mother cell-specific HO expression |
Protein accession | XP_001382314 |
Protein GI | 150863742 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGGAGT TGGCTCAGCA AGTTGAAAAC TTATCAATTA CTGAAGATGT TTCAGAAGAG ATCGTTAGGC TTTTCAGCGT GAGAGCCGGA CTTCTGCAGC CAGCATTAGA TGAATGTAAC AGTACTCAAC TACTTCAAAA GGCAATAAAG CTCTCTCAAA AGAAAGAGTA TTCTACGAGA TTGTACGAAG CAGTGCTCAA TGACACCGAA ACTTTTCTAA ACGGTCTACA AACTATAGGT GATGACCAGG CCGTTGTACT CACTAATGTG CTATCGAATT CTAGCTCTGA GCCTAAACCT CCAGTAGTTC AGAACTTGTT GAATTCTATC AAGACAATTA TAAAGCCCTT GATATTAGAT AAAGACACAG TATCGAAAGT CAGATTGTTC TTGTCAATCT ATTTTTCCAT AGTATCTCAC TTTGGGAACG AAAGTGCCCA ACATCTTCAC GTCTTGCTCA AATTCATCAA CCCTATTCCA GTCATTCTCT CTGAATCTGG GGAATCGGAC TACAAGACAA TCGTTTCTCT AGTTATGATG ATATTGGTCA AGAATTTGCA AGCTAACAAA AAAACTACAT GCGAAGTTAT CACCGAGTAT CTCGAGTTGA TCAAAGAAGA CGAAAGTCCC ACTGCAGCCG AAGTGTTGAA CTACATCGAA TTGTTAGAAA ACTTATACCC ATTGATCCCA GAAGTAATTG AACCAATTTA TACATGCGAT AAATCCAAGA ATTTGATAAC CGCTGAAGTT GATCGCGTTC TTGAATCTGG TGGCAGGGAA TTGGAGTATC CGAACAGACG TAGGGTCTGT ATCAGTATCT TGAATCTTAT AAGCAGTTCT TGTATTTCTG AAACTGGCCG TAACTACAAT GTTTCCACTT TCTTGCAATT ATTAAAAGCA GGAACAATGC TTAAAGATAT CGAAATTAAA TTATTGTCTA CATTGAATCT TATCAAGTTA TGGAACTTTA TAGAATTGGA GAAGAAATCG GAATCTTCCG AGAATGTGAC GATAACAAAT CTTGAAACAA ACTTAACCAG CTACTTGCGA AATTCAAATG TCGCCGAAGA TGCACGAAAT ATTGAAGTTT GTGTAGAAGG TTTGGCATAC TTGAGTTTGA ATACAAGCGT AAAGCAGCAC CTTAGACAGG ACGAAGTCTT AATAGAACTA TTGTTGAAGA TATTGAAAGA TAGCTCAAGC GGGGCAAAGA AGCAAAGAAA TGACTCGTCC TTGGTTTACG GAATTTTGGT TTTGATTTCT AATCTTGCAA AGTTGAAAGA TCCCAATGAT AAGGGTTCAG ATAGAAGTAC TGCTTCCTTT TTGAAGGGAT TCGCAACTCC TAGCGGTTCA AGAACGAAAG ACAAAGAAGA AGACCAAGAT GCCATCCAGT TGTTCAACAG ATCTCTATTA AAGGACCACA AGATTATTGA GATTATCTCA GTTTTAAAGA TTTACAAGGA AGAGGACGGA ACGGCACCAC AGATACAACA GAACAACTTA CTAAGACAGT TCATTTTTAT ATTGCATACG CTTTCCATGA ACCCCCAAAG AGCAGTCAAG GAAGAAATAG TCAAACAGGG AGGATTAAAC GTAATATTGG GCTATCTCAT CAAATATAGC ACAGTTAGTA AAGCCACAGG AGAAACTCGT CCAATTTCTA GTTCTGCAGA ACTTATTGAT ACGAGAATGC TCGCGATTCG AGCTTTGGCC AAAATCTTGA TATCTGTAAA CCCTTCATTA TCATTCAAAA AATATGATAT CAAAACATCG GTTCCATTTC TCGTCGAGCT ATTGGGTCCT GATATCTCTG TATATACTGG CTCACTTGAT ACCCAATCGG CAAACGAAAA GTACCTCTTT GACTTTACTA ATCTTGACAA GTACGAGTCG TTGATGGCAT TGACAAATCT TTCGTCCAAT GAGGATACTC AATTACAGGG GTTGATACTT CGCAGAACTT ACGATACGTA TTTGAACAAC TTTATAATTG ATTCAGATAT CCCACCTGTC CAAAAAGCTT CTTGGGAGCT CATATCCAAC TTGATAACGC AAACATCTTT ACTTGCTAAA TTTTTCAATC TAGAAGATAA AGACAGCTAC AAAAGGTTAG ACTTGTTAAT TAAATTGTTA AACTCCAAAG ACGAAGAACT ACAGATAGTG ATTGCTGGTT TGTTGGCTAA TGCTACCTCT GAATTTGATA TGATTTCCGA GATCCTTGTG AAAGATACGA AAATTTTTAA GGACATCACC AACACTCTTA GTTTCATTTT TCAACATCAA AATAGTATCG ATAACTTGAT ATTGCGGTGC AGCTACGTAT TGATCAACTT GGTCTATGCG GCTGCCAACT TGGGAGAGGA GAAGCTACAA GAATTTGCTG ATAATCAAAA GCTTAAACTG GCGGTCAACG AGACCTTGAA AGCAACAAAA AACCAGGGTA TCCTAGAAGT ACTTATTGAA GTCATTAAGA TGGTGCGTTT TAAGTAG
|
Protein sequence | MKELAQQVEN LSITEDVSEE IVRLFSVRAG LSQPALDECN STQLLQKAIK LSQKKEYSTR LYEAVLNDTE TFLNGLQTIG DDQAVVLTNV LSNSSSEPKP PVVQNLLNSI KTIIKPLILD KDTVSKVRLF LSIYFSIVSH FGNESAQHLH VLLKFINPIP VILSESGESD YKTIVSLVMM ILVKNLQANK KTTCEVITEY LELIKEDESP TAAEVLNYIE LLENLYPLIP EVIEPIYTCD KSKNLITAEV DRVLESGGRE LEYPNRRRVC ISILNLISSS CISETGRNYN VSTFLQLLKA GTMLKDIEIK LLSTLNLIKL WNFIELEKKS ESSENVTITN LETNLTSYLR NSNVAEDARN IEVCVEGLAY LSLNTSVKQH LRQDEVLIEL LLKILKDSSS GAKKQRNDSS LVYGILVLIS NLAKLKDPND KGSDRSTASF LKGFATPSGS RTKDKEEDQD AIQLFNRSLL KDHKIIEIIS VLKIYKEEDG TAPQIQQNNL LRQFIFILHT LSMNPQRAVK EEIVKQGGLN VILGYLIKYS TVSKATGETR PISSSAELID TRMLAIRALA KILISVNPSL SFKKYDIKTS VPFLVELLGP DISVYTGSLD TQSANEKYLF DFTNLDKYES LMALTNLSSN EDTQLQGLIL RRTYDTYLNN FIIDSDIPPV QKASWELISN LITQTSLLAK FFNLEDKDSY KRLDLLIKLL NSKDEELQIV IAGLLANATS EFDMISEILV KDTKIFKDIT NTLSFIFQHQ NSIDNLILRC SYVLINLVYA AANLGEEKLQ EFADNQKLKS AVNETLKATK NQGILEVLIE VIKMVRFK
|
| |