Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_28766 |
Symbol | |
ID | 4851516 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009068 |
Strand | - |
Start bp | 2034293 |
End bp | 2035804 |
Gene Length | 1512 bp |
Protein Length | 503 aa |
Translation table | |
GC content | 42% |
IMG OID | 640393224 |
Product | predicted protein |
Protein accession | XP_001388017 |
Protein GI | 126274720 |
COG category | [A] RNA processing and modification |
COG ID | [COG5182] Splicing factor 3b, subunit 2 |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.194827 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCACCCA AGAGAAGCAA GAACCAGCTT CGTAGAGAAC GGGTGAAGCT CCGAAAGCTA GAAGCTGACA AGAAAGAAGA TACATTGGAG AGCACTGAAA TACAAAAGCC AGACAAAGTA AAAGATTCAG AAACGACAAA TAATGACAAT AAAGAACATA GAAATGACAA AGCTCACGAC ATAGATACCG CTGAAGAGCA GAAAGATTCA GTCGAAAAGT CTGTTGAGAA TCTTGAAGAT TTTATAGCGA TTGCCAAAGA TTCTTTTCAA TCTGTTCCGG TTAATACTTC AATAGATGAG TCTCTTTATC AGCAGTTCCA GGGAGTGTTC AGTAAGTTTC AAGGAGCTAC TGTTGCTGAA GAAGAAGTAG AATCTGTTCC AGAATCCAAA GGTGATGTTC TCTATAATAG TGGGTCTGAT GAAGAGTCAG AATTGGAGTC CCTGGATTCT GAAGAAGAGG AAGAGCTTTC TAAACGACAA CTTCGTAAAC GTAACAAAGT GCCATTGGCC TCGCTCAAGG CATCAACTAT ACGGCCACAA CTCGTTGAAT GGTATGATGT AGATGCTCTG GACCCGTTCT TCCTCGTAGC ATTAAAGACG AGCCCCAATG CGGTTCAGGT ACCGAGCCAC TGGTCAGCAA AGAGAGAGTA TCTTTCATCG AAGAAAGGAA TAGAACGATT ACCGTTTCAG TTGCCAAAAT TTATCACCGA TACAGGTATT CAAGATATGA GACATAGTGA TGATCAAACG TTGAGGCAAC AGCAAAGAGA CAGAGTACAG CCCAAGATGG GTCGATTGGA TATCGATTAC CAGCGCCTTC ATGATGCTTT TTTTAAATAT CAGGAAAAGC CACGATTGCT TGGTTTTGGA GATGTATACT TTGAAGGTAG AGAAGCAGCT GATGAATATA GCAATGATCT CTCTAGCATA AGACCTGGCA AGGTGTCATC TGAGCTACGT AAAGCTTTAG GTATTCCTGA AGGTGCACCT CCTTGGATTT CCATTATGAA GGATATAGGT AAGCCTCCTG CCTATTCCAG TCTAGCAATA CCTGGATTAG ATACTTCATA CGATAATGAC GGCTACAGAG ATAGTAAATC TGTGAACACT AGTAAATTGC ACGAAACAGA ACATTGGGGG AAGCTAGAGG ACTACGAAGA ATCTGAAGAA GAAGAAGTGG ATGGTGAAGA GGAAGATGAA GAGCTGGATG CTGACGATGA CGAAATGATT GCATATGAAC AGGAAGAGGA ACAAGCTGAT GATGAACCTG TGAAGGTGCA GATTTCAGAA TATGGTGGAA TAAAGTCAAG ACCACACAAG CCTGTTGATG AAAGCAATGA ACACAAGTCG CTTTACACTG TAATCAAAGA GAAACAACCT CTGGAAGGAG CTGGTCTATT ACAGAGTGGC TTTTCCTACG ACCTCTCCAA AGATTCACAA GTTGATGAGA CACCTGTTAA ACAGGATACG AAGGTTGAAA CTCTTGAACC TAAGAAGAAG TTCAAGTTCT AG
|
Protein sequence | MPPKRSKNQL RRERVKLRKL EADKKEDTLE STEIQKPDKV KDSETTNNDN KEHRNDKAHD IDTAEEQKDS VEKSVENLED FIAIAKDSFQ SVPVNTSIDE SLYQQFQGVF SKFQGATVAE EEVESVPESK GDVLYNSGSD EESELESLDS EEEEELSKRQ LRKRNKVPLA SLKASTIRPQ LVEWYDVDAL DPFFLVALKT SPNAVQVPSH WSAKREYLSS KKGIERLPFQ LPKFITDTGI QDMRHSDDQT LRQQQRDRVQ PKMGRLDIDY QRLHDAFFKY QEKPRLLGFG DVYFEGREAA DEYSNDLSSI RPGKVSSELR KALGIPEGAP PWISIMKDIG KPPAYSSLAI PGLDTSYDND GYRDSKSVNT SKLHETEHWG KLEDYEESEE EEVDGEEEDE ELDADDDEMI AYEQEEEQAD DEPVKVQISE YGGIKSRPHK PVDESNEHKS LYTVIKEKQP LEGAGLLQSG FSYDLSKDSQ VDETPVKQDT KVETLEPKKK FKF
|
| |