Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_35246 |
Symbol | |
ID | 4837521 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009042 |
Strand | - |
Start bp | 2297299 |
End bp | 2299389 |
Gene Length | 2091 bp |
Protein Length | 668 aa |
Translation table | 12 |
GC content | 43% |
IMG OID | 640388836 |
Product | predicted protein |
Protein accession | XP_001383196 |
Protein GI | 150864405 |
COG category | [A] RNA processing and modification [D] Cell cycle control, cell division, chromosome partitioning [K] Transcription |
COG ID | [COG5147] Myb superfamily proteins, including transcription factors and mRNA splicing factors |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.0592899 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCACCAC TCTATGTCAA AGGTGGAGTC TGGACCAACG TCGAGGACGA GATCCTCAAG GCTGCGGTGT CCAAATACGG ACTCAACCAG TGGTCTCGTG TAGCATCGCT ATTGACGAAG AAACTGGCCA AACAAGCCAA AGCTCGATGG AACGAATGGC TAAATCCTGC AATTGATAAA ACAGAATGGA CCCTCGAAGA GGACGAAAAG CTCTTGAACT TGGCTAAATT GCTACCCAAC CAATGGAGAA CTATCGCTCC TATAGTGGGC CGAACCGCTA CCCATTGTGT AGAACGATAT CAGAAGTTGC TAGATGCTGC AGCCAACGAA GGAGAAGTAG ACGAGGAAGT AAACGAGTTG GCACTAGCAG GTGCTGGAAT CGAGTCAATG GCAGCGACTG GTCCATCTGT AGGCGAGTTA AATATCAATC CCGAAAGCAA GCCAGCAAAA CCAGACAACG TAGATATGGA CGACGAAGAA AGAGAGATGC TTTCAGAAGC CAAAGCCCGG TTAGCCAACA CCCAGGGGAA GAAGGCCAAG AGAAAAGCCA GAGATCGAAT GTTGGAAGAA AGCAGAAGAA TCGCATTGCT CCAGAAACGT AGGGAGCTCA AGGCTGCTGG AATAAATGTC AGTCTAGACT CTAAGAATAA GAAGAAGCGA AAGGAGTTTG ACTATAATGC TGATATTCCC CATGAACATG TACCGCAAGC TGGCTTGTAT GATACCACCG AGGAAGATAA GAAGAACGAC TATGATAAAG TTGGATTCAA TAGACAGATA GCAAAAGACG GAATGAGTTT CCAAGAAGTA GACGATACCC ACAAAAGAGA TAAGGAAAGA CAGTCGCGAG AAGCAGCCAG GGAATCGAAG AAACACAAGA TGGGGTTAGA GTCTGCACTT GAAGTTCTTA ACGAGAATGA ACGAGAGAAC TTGAAGAAAA GAAAACTCGA ACTTCCAGCT CCCATACCCT CCGAAGCTTC TTATCAGACT TCTGTGGTGG TTGACGAAGA CTCTAGTACG AGATTATTGA AAGGATTATT TGAAAGGAGT AATCCAACTG TAGAAGAAAG TGAAGAACAA AAAGAAGACG ATGATGCAGA CGTCAGAATC AAAAACAAGG CTAGAGAATT AATTGCTAGA CAAGCTATTC CATCTACCCT TATTGTTCAA CAGGAAGACA ACAAAGAAGA TTTAAGAACT TCTGCAGATG TTTCTGTGAA AAGAGAGCCT ACCCAGAAAG AAAAACTTAG AGCTACGGCT CAACTAAAGA AGGCAACCAT AGAATTCATA CGAAGCAAGT TCCGTGCCTT GCCTAGACCA CACTCTTCAT CCGGAATTAT TCTACCAAGT TTCGATGTAA ATGAAGAAAC TATCAATTTG AATGTTGAAA ATGAAGGTAT AGGCAACAAT GACGTAGATC AAGGAGAAAG ATTGCACAAC TTGCGGATAT TACAACAGAT TGACGAGGAG AAAGCAAAAT CCAGAAGATC GCAAGCAATT CAACGCGATT TACCGATACC AAATCCTAGC AAGTTACGGA CTCCGGATTT GAAACAAGTT TCTGAAATCG AAAAATTAGT GCTAGCTGAG TTCTCAAGTT TGATCAAGTC TGATTACAGA AAATATGTGG ATCAAAGCTT CCGCGCACCT CTTGTAGAAG ACCTCGAAGA GGAGATACTC ACCAAAGTGA ATGAAGAAAT TCAAAAGGAA TTGAAGGATA GGAAAGTTCC GAAAGTCGAA ATCAAGAAAT TGGAGTTAGA ATTGGACTCT TCCTTTGAAA CAGCAGAACA GGTAATCATG AAACTTCACG AGTATCGGAG GAAATGTTCT GAAAATGAAG ACAAACTCAA TTCGAAGCTC AATATGAAAG CATATGAAGA AGAGGAAGAG AGACTCAATA ACGATCTTTA CGAGGTGTAC TCGCAGTTGT ACAATACTTC GTTGGAATTG ACCGAGGTAG AGAAGTTGTT GGAAGCAGAG GAGGATGCTA TTGATAGACG TACCAAGAGA TTGAATGAAT TGGTGGGCCC ATTGGCACAA GTTGAAGCTG AAGCAAGAGA AAGAGTCCGT GAATTGTATT TGAAGAGATA A
|
Protein sequence | MPPLYVKGGV WTNVEDEILK AAVSKYGLNQ WSRVASLLTK KSAKQAKARW NEWLNPAIDK TEWTLEEDEK LLNLAKLLPN QWRTIAPIVG RTATHCVERY QKLLDAAANE GEVDEEVNEL ALAGAGIESM AATGPSVGEL NINPESKPAK PDNVDMDDEE REMLSEAKAR LANTQGKKAK RKARDRMLEE SRRIALLQKR RELKAAGINV SLDSKNKKKR KEFDYNADIP HEHVPQAGLY DTTEEDKKND YDKVGFNRQI AKDGMSFQEV DDTHKRDKER QSREAARESK KHKMGLESAL EVLNENEREN LKKRKLELPA PIPSEASYQT SVVVDEDSNV RIKNKARELI ARQAIPSTLI VQQEDNKEDL RTSADVSVKR EPTQKEKLRA TAQLKKATIE FIRSKFRALP RPHSSSGIIL PSFDVNEETI NLNVENEGIG NNDVDQGERL HNLRILQQID EEKAKSRRSQ AIQRDLPIPN PSKLRTPDLK QVSEIEKLVL AEFSSLIKSD YRKYVDQSFR APLVEDLEEE ILTKVNEEIQ KELKDRKVPK VEIKKLELEL DSSFETAEQV IMKLHEYRRK CSENEDKLNS KLNMKAYEEE EERLNNDLYE VYSQLYNTSL ELTEVEKLLE AEEDAIDRRT KRLNELVGPL AQVEAEARER VRELYLKR
|
| |