Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_72477 |
Symbol | NOP58 |
ID | 4839579 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009045 |
Strand | + |
Start bp | 108195 |
End bp | 110116 |
Gene Length | 1922 bp |
Protein Length | 515 aa |
Translation table | 12 |
GC content | 44% |
IMG OID | 640390894 |
Product | part of small (ribosomal) subunit (SSU) processosome; U3 snoRNP protein |
Protein accession | XP_001384674 |
Protein GI | 126136301 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG1498] Protein implicated in ribosomal biogenesis, Nop56p homolog |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.158216 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TGTTCATATA TTGTTTGTAG CAACCGTTAT TCGGAACTGA ATCATCTGTT GTAAGAAGGT TCTATAAGAG GATCTCCCTA GCATCAACTA AGAAGAATCG ATAAAACAGG TATTTTCATT CCAGTAATTT TCCTACCTTG ATACCATCTA TATTTACCTA CAACAGTACA AAATGGCCTA CGTATTGACC GAAACTGCTG CCGGATACGC CCTCTTGAAG GCTTCCGATA AGAAGATCCA CAAGTCATCT TCTTTAATCG AAGATTTGAA CACTGCCGAC AAGGTTGCTG AGCAGTTCAA GATCCACCGT TTTGAAAAGT TCCAATCAGC TGCTAATGCC TTAGAAGAGG CTAACGCTGT CATCGAGGGC CGTGTCTCTG ACTCCTTGAA GAAGATGTTG GAAGATGCCA AGTCCGACAA GAAGGCTACT TTGATCGTTT CTGAAGCCAA ATTGGGTAAT GCCATCAACA AGTTGGGCTT GAACTTCTCG GTTGTATCAG ATGCTGCCTC GTTGGACTTG CACAGAGCAA TCAGAGAATT CTTGCCTGAA TTGTTGCCAG GCTTGGATGA CTCCATGTTG AAGCAAATGT CATTGGGTTT GGCCCATTCC ATTGGCCGTC ACAAGTTGAA ATTCTCTGCC GACAAGGTCG ACACTATGAT TGTACAAGCC ATTGCTTTGT TGGACGATTT AGACAAGGAA TTGAACACCT ACGCCATGAG ATGTAAGGAA TGGTATGGCT GGCACTTTCC AGAGTTGGCC AAGATGATTG TCGACTCCGT AGCTTATGCC AGAATCATCT TGACCATGGG TGTTAGATCC AATGCCTCGG AAACTGATTT GTCGGAAATT CTTCCAGAGG AATTGGAAGA ACAAGTCAAA TCTGCCGCTG AAGTTTCTAT GGGTACTGAA ATCACTGCAA TTGATTTGGA AAACATCAGA GCTCTTGCCG AACAGATTGT TGACTTTGCT GCTTACAGAG AGCAATTGTC CAACTACTTG TCTTCGCGTA TGAAGGCTAT CGCACCAAAC TTGACTGCCT TGGTTGGTGA GTTAGTCGGA GCCAGATTGA TTGCCCACGC TGGTTCATTG ACCTCTTTGG CTAAGGCTCC AGCTTCGACC ATCCAAATCT TGGGTGCTGA AAAGGCTTTG TTCAGAGCCT TGAAGACCAA GCACGACACA CCAAAGTACG GTTTGTTGTA CCATGCCTCA TTGGTTGGTC AAGCATCCGG TAAGAACAAG GGTAAGATTG CCAGAGTGTT GGCTGCTAAG GCTGCCGTGG CCTTGAGATA CGACTCTCTT GCTGAAGAAA GAGACGACTC CGGTGACTTT GGTTTGGAAG TCAGAGCCAA GGTAGAATCT AGATTGAGTG CTCTCGAAGG CCGTGACTTG AGAACCACCT CGAAGGTTGT CAGAGAACAA CCAAAGGTTG ACATAACGGA AGCCCGTGCA TACAACGCTG ACGCCGACGC TCCAACTGCT GAAGTTGACT CGGACGACGA ATCGGACACC GAAGAAATCG ACACCAAGAA GGAAAAGAAG GAAAAGAAAG AAAAGAAAGA AAAGAAGGAC AAGAAGAGAA AGAGAGAAGA TGACGACGAA GAAGAAGAAG ACAAGAAGTC CAAGAAGGAG AAGAAGGAGA AGAAGGAAAA GAAGGAGAAG AAGGAAAAGA AGGAAAAGAA AGATAAGAAA GATAAGAAGT CCAAGAAAGA AAAGAAGTAG ACAGCATCAC CAGTACGCTT TCTTCTGTAA CTTTTACTTT ACTTTTTGCA TTTACTACAT TCATGACTTT GATTCTCAAG TTCACCAACC AACCATCTTA GAAGTTTACT TGCCTTGGAG CCCATGACTT CTTATTTTGT TCTATAGTAC GATGTTGTCA CCCAGCATTT TTGTGTTTCA TTAATATATA TATACATATA TC
|
Protein sequence | MAYVLTETAA GYALLKASDK KIHKSSSLIE DLNTADKVAE QFKIHRFEKF QSAANALEEA NAVIEGRVSD SLKKMLEDAK SDKKATLIVS EAKLGNAINK LGLNFSVVSD AASLDLHRAI REFLPELLPG LDDSMLKQMS LGLAHSIGRH KLKFSADKVD TMIVQAIALL DDLDKELNTY AMRCKEWYGW HFPELAKMIV DSVAYARIIL TMGVRSNASE TDLSEILPEE LEEQVKSAAE VSMGTEITAI DLENIRALAE QIVDFAAYRE QLSNYLSSRM KAIAPNLTAL VGELVGARLI AHAGSLTSLA KAPASTIQIL GAEKALFRAL KTKHDTPKYG LLYHASLVGQ ASGKNKGKIA RVLAAKAAVA LRYDSLAEER DDSGDFGLEV RAKVESRLSA LEGRDLRTTS KVVREQPKVD ITEARAYNAD ADAPTAEVDS DDESDTEEID TKKEKKEKKE KKEKKDKKRK REDDDEEEED KKSKKEKKEK KEKKEKKEKK EKKDKKDKKS KKEKK
|
| |