Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_68096 |
Symbol | |
ID | 4840057 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009046 |
Strand | - |
Start bp | 823937 |
End bp | 825898 |
Gene Length | 1962 bp |
Protein Length | 578 aa |
Translation table | 12 |
GC content | 44% |
IMG OID | 640391372 |
Product | predicted protein |
Protein accession | XP_001385858 |
Protein GI | 150866309 |
COG category | [A] RNA processing and modification [D] Cell cycle control, cell division, chromosome partitioning [K] Transcription |
COG ID | [COG5147] Myb superfamily proteins, including transcription factors and mRNA splicing factors |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.0298848 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | CAACCATCTA CCATAACGGC TGCAAAAAAT ACCAATTGGC TGCAACATGG ACGAGCAGCA GGAAATCGGA GGAGCACATG CTCTCCTCCA ACTAGGCTCA AAGCAGAAGG ATGATGGAAT AGACGACAGC ATTGATGCAA TCGGCAATAA TGAAAACAGC AATGTCAATG AAGATGTAGA TGACGAAGGA GACCTTTCAG ACTCGAAGAA CCAGCAGATT AATGATGCAG TAGAGGCTGC TGTCATGAGG TATGTGGGAG GAACTTTGGA CTCTGCAGAG CATGAGAGCA AAAGAAGCAA ACGGAAGATC CACGATGAGA TCATCAACAA TATCCACGAG TTCAACCAAT GGACTGGATT CTTAGAAGAA AATATCAGCG ACCATGGCAA CGAGGACTAC GATGCTTATA CGTCTCAACA GCAGTCTCAG CCGCTCCATC TGAACAAAAG AAGCAAAGGA AAGAAAAGAA GAACTCAACT GGGAACAAGC GACATCGATC CAGAACTCGA GGCTTTAGGC ACAACTGAAC ACGACCAATT AGTAGAGGCA GCAATAATTG ATGCCAGAGA ACTTGCCAGA CACATCAATG AACAGGGAGC CGGAACTGGG AACCTAAATC TCAGCCACCA CCAACAGCAC GCTGGATCTG ACTCAATCAA TGCGATTACT CAACTTGCTC AGGCTGCTAC ATCGTTATCC GAAACAAAGA AAGCAAAACT CAAGAGAAAA GATGGGGAGT CATACCAGAT GAAGAACATT GCCTTACGAC CCAAGTTCAA CAACTTGACC AGTGTAGAAA CTCTCATAGA GGAAGCCTCG GCGCAAGCCT GTGAATGGTT CAATTCCCTA CCTGACACTA CGGGTAAAGG TCCACGTATG TTTTCGGCAG AAGAGATGAG TGCAGTAGAC CATTTCGTAG CAGGTTATTG CCATTTGAAC AAATGGACAA GAGAAGATGT GTGTAACAGA GTGTGGTCCA ACGAGAGAAA GAAAGACAAT TTCTGGGAGT CTTTAGTACG AGTACTACCA TACAGATCCA GAGCTTCTGT GTACAAGCAT GTGAGGAGAA TCTACCATGT GTTTGACGTC AGAGCAAAGT GGACAGAGGA AGATGATGCC TTGCTTAAGA AACTTGCACT TACACATGAG GGTAAATGGA AACAAATTGG AGAAGCTATG GGTAGAATGC CGGAAGATTG CCGTGACAGG TGGAGAAATT ACGTCAAGTG TGGAGACAAC AGAACATCAA ACCAATGGTC ACAAGATGAA GAGAACGCCC TTAAGCAGAT CGTCACTGAT ATGTTTCAAC AATCTGGAAA CAAAGAGTAT GCGTCTATTA ATTGGACCGT TGTTAGTGAA AGAATGAATG GAACCAGATC TCGTATTCAG TGTCGTTACA AATGGAACAA GTTGGTAAAG AGGGAAACTG CTCTTCGTGC AACGTACATG AATTCCGACA CAAAACTCTG GATGCTCAGA AAACTCCAGA GCTCTGGCTG GGATTCTGTT GATAGTGTCG ATTGGACTGA AGTAGCCCGT TTGCATAGGG AAGAGAATGT TAAGCAGGAT GAGAATGGCT ACCAGTGGGA TGCGCCAGAT TTCAAGGCTA GTTTTGAGAA GATGAGGTCA GAAGTGAGAG ACCACAAGAG GCTTTCATTT GTTACCATTT TGATGCGTTT GATTGAGGAT TTGGAAGGAC ATCCAAAGCT TATAGCTCAA CATTTGAGAG AAAACAAGGA CAACAGTAAC AAGCTCTACT ACGACCGCCA AAACAAGACC AAGAACGACA AAATAGTTGA TCCAAACGAT CCTGAGTCAA TAGCAACAGC AGCTGTTGCT GCGGTCTCGT CAGGTGTAGA CGGTGTTGAT GCCCAGCAAC AGGCATATAG CTTATGGCGA TAGAGGCTAA TTTCTTAATC TAGTTGTACG TATTGATATA CTTGTAAATA TAGAGTTACA TT
|
Protein sequence | MDEQQEIGGA HALLQLGSKQ KDDGIDDSID AIGNNENSNV NEDVDDEGDL SDSKNQQIND AVEAAVMRYV GGTLDSAEHE SKRSKRKIHD EIINNIHEFN QWTGFLEENI SDHGNEDYDA YTSQQQSQPL HSNKRSKGKK RRTQSGTSDI DPELEALGTT EHDQLVEAAI IDARELARHI NEQGAGTGNL NLSHHQQHAG SDSINAITQL AQAATSLSET KKAKLKRKDG ESYQMKNIAL RPKFNNLTSV ETLIEEASAQ ACEWFNSLPD TTGKGPRMFS AEEMSAVDHF VAGYCHLNKW TREDVCNRVW SNERKKDNFW ESLVRVLPYR SRASVYKHVR RIYHVFDVRA KWTEEDDALL KKLALTHEGK WKQIGEAMGR MPEDCRDRWR NYVKCGDNRT SNQWSQDEEN ALKQIVTDMF QQSGNKEYAS INWTVVSERM NGTRSRIQCR YKWNKLVKRE TALRATYMNS DTKLWMLRKL QSSGWDSVDS VDWTEVARLH REENVKQDEN GYQWDAPDFK ASFEKMRSEV RDHKRLSFVT ILMLDPNDPE SIATAAVAAV SSGVDGVDAQ QQAYSLWR
|
| |