Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_67483 |
Symbol | |
ID | 4838695 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009044 |
Strand | + |
Start bp | 336703 |
End bp | 339886 |
Gene Length | 3184 bp |
Protein Length | 657 aa |
Translation table | 12 |
GC content | 47% |
IMG OID | 640390010 |
Product | predicted protein |
Protein accession | XP_001384019 |
Protein GI | 150864981 |
COG category | [A] RNA processing and modification [D] Cell cycle control, cell division, chromosome partitioning [K] Transcription |
COG ID | [COG5147] Myb superfamily proteins, including transcription factors and mRNA splicing factors |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | CACCAATAAG TTATAAGAGC AAGCAACGAG CTGTCCGGCC CGACGCATCT AAAGCACGAC ATCTCTCGAG GTACGACATT CCCCTCGACG CCCCAAAACC GCAACCCACA CCCAAGAGGC CCACATCCAC TGGCTAGCTT AACTAGACGT CGCACGACAT TGTACCGCGA CATCCCACGA CACGACAACC CCCACGGCCG CCTATCACTG ACCCAGCGTT TCTGTCAGTG ACTTGGTCCA GAATCAGATT ATTATCTGCT CCATATAGTA ATCGTCCTTC CTAGGCTTCA AAAGTCAAGG AGTAAACGAC GACATTGGAG GCCCCAGAAA ACCCTTCCAA ATATAAGGGG CACCAGCACA ACAAAGTGGG ACCCACCCAG ACAAGTTTTA GACTCCACCA AAGGTCTAGC CGTGACAATT TGTAGATCTC CAGTCACTTT ACGACATTAC TTTGACTAGG CTCTAGCACC AACTCTTTCT CGTCCCCACA TTGCCACCAA TCTCATTGAT AGCCACTAGC ATATTCAGTC ATCTGGTTTG AAAATTTCTC CACTACGTCA AATTTTCCAG CGAAGCAGAC TAAAGTCACC CTATCCAGAA GTCTAGCAGG GAAAATATAT AAGGAGCCCT CTTTATATTA TCGAGACATC CGTAATTTCC ATACCCCGTT CTTGACACTA TAGGAGAGTT TGTTCTTGAT CCTTTTGACA TTTATTGATA AATCTTACTT TTCCTTGATC TTCCAGATCT TTATCCATAC TTGGTAGTCT CATTTATATA GTAGTCCATT TGGCTCGCAT ACCGTTTTAC GCTACCTTTC CTCATACAAA ATTAATCCCT GTTTTGGACC CAAAAATTAA CTAAAAAGCA TCTGCCATTC CACCATTTAG CAAGTAGCTA ATCCAAATTA CCCCCTACAG CGATATCCTT CAGTGATATC TGCAGAGCCC CCACAAGAGC CCCCCATCAT TCAGTCCCAG ATATATATTC GCACACCGGT GCTCACCATG AATTACCATA ACTCAAACCC CGGCGGGCCC GACGCTACGT CGGCTTCCTC AGCACCAGCA TCCACAGCGT CGTCAGCATC GGCAGCCGCT TCTGCATCCT ATTACTACGC TCCGGTACAA CAGTACCAGC CGCAACCGCT CCACCACACC ATCCAGCCTA ATGGCCCGAA CTCGGTTCCA ACCACCCCAG GAGCCCCCAC AAGAAGAGGT CCATGGTCTC CTATGGAAGA CAAGAAGCTT CTTGATCTCA TCAACATCTT TGGTCCTACC AATTGGGTGC GCATCTCTAA TAGCATTGGA ACGAGAACGC CGAAGCAATG CCGCGAGCGC TACCACCAGA ACTTGAAGCC GCTGCTAAAC CGTCTGCCAA TTACAGTAGA AGAAGGCGAG TTGATCGAGC TGTTGGTAGC AAAATACGGT AAGAAATGGG CCGAAATTTC TCGTCACTTA AATGGCCGTT CCGACAACGC CATCAAGAAT TGGTGGAACG GAGGAGCCAA CCGTAGAAGA AGAGCCTCGT TGGTGCACGA ACCGAATGTG GCTGGTAACA GCAACAGCAA TAATAATAAC AACTCGATGA GCAGCTCTAA CGGTAGTAAC GTCAATGCCA ATGGCCTCAA TAGTAGCACT AGCATCTCGA CGATGTCGGC ATCCACTTCT GCATCCACCA ACTCCACACG CTCGCAAAAC GGGTCGCTCT CCAGTCCTTC AGGACTTCCC ACGCTTACCC AGAACAAGAG TTCTGCTAGC TTGCCTGAAG CAGTGTTGTC TGTCAGCTCG GCACAGGTCC ACCAGTCAAA TTCGCTGCGC TCATCACTCA ACGAGCCTTC GCTCTCGGCA AACTCATCTG CACTCAACTT ATCTGCTGCA AACCCGAACA GTCTTTCATC CTACGCCATC ACCAATAACC ATAACACCAA CATCAACAAT ACCAGTAACA ACTCCACAAT ATTACCACCT CCTATCGGAG CTTCGCAGCC TTCGACTTTC CCCCAGATTC CTCAGCTTCC CCAGATTTCA TTTAACACAT CCATGTTTGG TAAGCCTGAT GCGCTGTTCA AAGCCCATAC ACCTCCTCCT GGGTCTATGG CTGCAGCCGT CCCTCATACC ATGACTTCAC CTGTAAAAGC GACTTCACTT AGATCTGCTA GTTTTGACGT AACCTCGGCT ACTGGAGCCA CGAACGCATC CACTTCAACT CTTACATCCA CCACTCTTCC TCCAATTTCT TCATCTAACA AGAGAAGACT CTTGGACGAC CCCATCAGTA GAAGACATTC CACTGCAAAC TACCACTATG CCCATCCCAA CGGAAACACA AATAACAATA ATAATAATAA TAATAATTTC GCAGTTCCGA CTTCTTCGGC TCCTGGTTCG GCATCTGCTG CTACAGGTTC AATTATCGGA GGTGCTGGCA CCGTTTCTCC CTCGTACTAT GGCTCGCCAC TACTTCTCAG TACTCAAGTA TCGAGAAACA ACTCGATCTC ACACTTTGAG TTTCTGACGT TGAACTCAAC CTCGCATTCT CTGAGAAGAT CGAGTTCGAT AGCACCAGAC TTCTTTCCAA ATCCATTAAA GGAGCTACAG GCAGCCGCAT CTTCCTTGAA CAAGGAGGGA AACGTGAACC ACAAACGTAA CATGTCGCAG AACTCGTCGT TCAACTCTCC TTCTTTGACT CCTTCTACCC GTTTCTCCAT CTCATCAACT ACCTCCCTTT TAAATAACAC TTCTACCAAC TTGACAATGC CATCAGCCAC AACCCTGCCT TCCAGCAATT ACAACGGTCT CAAGAACGAT CATTCTTCTA GCAGTGGGTC CATTCCAGCA CTCAAAGAAG AAGTCGAGTT GAAGTTGAAG CACAAGAACG ACTTGGACGA TGTAGACATG GACGACTCCC ATAACCACTT GCAAAATCCC AGGACAACCA TGGTGAAAAC CAAGATCTCG GTTCTGAGCC TCATTGATTG AATGAGCGGT GGATCCGTTC TCTCGGACTG TTTTCGTACT TCCATCTGTA ACAATACTTG TTTTCCTTAT AGACCTCCAA AAACAGTTTC TTCAATTCTT GGTTATCTAT TTTTCAATTC TCGGGAGTTT ACTCTGTAAC TAATAATTTG ACTATTACTT CAAGAAAATG AAATACATAT TTAAGAATAC ATGCATCGTT CTAC
|
Protein sequence | MNYHNSNPGG PDATSASSAP ASTASSASAA ASASYYYAPV QQYQPQPLHH TIQPNGPNSV PTTPGAPTRR GPWSPMEDKK LLDLINIFGP TNWVRISNSI GTRTPKQCRE RYHQNLKPSL NRSPITVEEG ELIESLVAKY GKKWAEISRH LNGRSDNAIK NWWNGGANRR RRASLVHEPN VAGNSNSNNN NNSMSSSNGS NVNANGLNSS TSISTMSAST SASTNSTRSQ NGSLSSPSGL PTLTQNKSSA SLPEAVLSVS SAQVHQSNSS RSSLNEPSLS ANSSALNLSA ANPNSLSSYA ITNNHNTNIN NTSNNSTILP PPIGASQPST FPQIPQLPQI SFNTSMFGKP DASFKAHTPP PGSMAAAVPH TMTSPVKATS LRSASFDVTS ATGATNASTS TLTSTTLPPI SSSNKRRLLD DPISRRHSTA NYHYAHPNGN TNNNNNNNNN FAVPTSSAPG SASAATGSII GGAGTVSPSY YGSPLLLSTQ VSRNNSISHF EFSTLNSTSH SSRRSSSIAP DFFPNPLKEL QAAASSLNKE GNVNHKRNMS QNSSFNSPSL TPSTRFSISS TTSLLNNTST NLTMPSATTS PSSNYNGLKN DHSSSSGSIP ALKEEVELKL KHKNDLDDVD MDDSHNHLQN PRTTMVKTKI SVSSLID
|
| |