Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_68431 |
Symbol | |
ID | 4840839 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009048 |
Strand | - |
Start bp | 146682 |
End bp | 148822 |
Gene Length | 2141 bp |
Protein Length | 481 aa |
Translation table | 12 |
GC content | 43% |
IMG OID | 640392154 |
Product | predicted protein |
Protein accession | XP_001386627 |
Protein GI | 150866885 |
COG category | [R] General function prediction only |
COG ID | [COG0724] RNA-binding proteins (RRM domain) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.326918 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GAAGATCAAA TCAAATCGTG TTGGAAACGG ATCCCATCGA AAGAAACTAC AGACCGCGGT TGATTGATTG ATTTGTATTT AACCACTACA TAGTTTCTTG TTTAGTTGTA TACATTTCCA TACAATATTT CAATAAACTG ATCTATATAA CACTTGCACC ACAAGTTGTA CTCCGTATCT ATAGCTAGAT AGATAACAAC GCCATAAAGC GAAGGAAAAC GAAACTAAAC CAGCAAAGAC AAAACTTCTA TCTTTTTCTA TTCTAATCCA TTAAAAAAAT GTCCAGCTAC GAAGACGACG AAGCTCTTTT CGAAGACATC TAGTATGTAT TAATTTGATT TTTAGGCAAA TTTAATTTGT GCATTTTATT CGGTAGAAGC GAATTACGAG ATTCCACACC CAGCATATGA GCAGGAAGGT CTAGAAGGCA CAATCAGATA TCCGGCTACT GGCGGGGATT GCCGCCAGCT CAAGATCCAC AAGTTAAAAT TGCCAACGGA GTTTAGAAAA ATTCACATGC TAACTTACGA ATCTATAGTG ACGATGATAA TGAAGTCGCA AAGGAAAAGT CGACAGAGAA AAACAACGAA AACAAAGACG CTGAGACTGA GGCTGAAAAA CAGGATCAGA AGCCTGAGGC CGAAACTTCT GCTGCTTCCG TCCCTGAAGG TCTGACTTCT TCTTCCAATC AAGCCGCTGC TACAGCCACT ACATCTACTT CTGTGTCTGA AGATCAGCAG CAACCAAATT TGCAACCTCC TGCAGCTGCT TCTCAACAAG CAGACCAATT GCAGCAGACT CAACAGAACC AACCGATTGG CCAGCACCCT TTCAATTCTT CTCTCTCTTC CTATACTGGC CAACCAGCAT ATCCTGGCGC TGATCCCAAC CTCCACCAAC AGTATCCACC TGTTCCTCCT CCTCCACAGC CTCCTGTTCA ACACCAGCCT TCTAGCATGG GCAGGGAAAC AGGAAAAATG TTCATTGGTG GCCTCAATTG GGACACCACA GAGGAAGGCT TGGTTCTGTA CTTTTCCAAG TTTGGAGAAG TTGTAGACTA CACCATCATG AAAGACAACA ACACCGGAAA ATCACGTGGG TTTGGTTTCT TGACCTTCAG AGACCCAAAG TCTGTAGACG AAGTCATCAA GACTGACCAC ATCTTGGACG GGAAGTTAAT TGATCCCAAG AGAGCCATTG CCAGAGAAGA GCAGGACAAG GTAGGCAAGA TTTTTGTAGG TGGTATCGAT CCAATGGTCA ACGAAAAGGA GTTCCATGAC TTCTTTTCGC AATTCGGAAG TATCATCGAC GCCCAGTTGA TGATTGATAA AGATTCCGGT AGATCCAGAG GGTTTGGATT CATTACGTTT GACTCGCCTG ATGCCGTGGA CAGAGTCACT GTCAACAAGT TTTTGACGTT GAAGGGCAAG GCCATGGAGG TCAAGAGAGC GGAACCTAGA GGCCAACATC AGCAAAACCA AATGCAACAG CAGCAACAGC AGCAGCAGTA CAATTACAAC TACGGTAACC AATACGGTCA AGCTTATCCT CAAATGGCTT ATGGTCAACA GCAGATGGCC CCTGAAATGG TCGAATATTG GCAAAGAATG CAACAGTGGT TTATGTTCCA GCAACAGGCC CAGGGTGCTG GTGGTGAACA AAAAGATGTG GAACAACCTG GCCAACCATT GAACCCACAA CAACAAGCCA ACGAACCTGA AGGTGATCCA GAGCAAAATT CCGACAGAAA CAATGAAAAT GCTAATGATG ACTACAGAGG TGGCTATGAC AACGGCCAAA GGGACCAAAG GAGAATGAAC TTGCCAAAGG GCCCAAGAAG GGCTCCACCT TCCGGTCCTT CTGGCAGCAG AGGAGGAAGA GGAGGGTACC ACAAGAGAAG TAGAGGCTAC CATCCTTACA ACAGAGGTGG TAGAAGATAG TTTATACACC GCACAGAAAG TAAAATACAG GCACATCACG GTTATATTTT TATTAAGATT ACACCTATTT ATCCGTTAAG TTATCGTTAT TTAGTAATAC TATTGCTATT AATGTTGATT TCAAGTTGCA ATGTCATCTA TTTATCTATT TTTTCATGTC ACCGTTATAT AGTACAGCCA TTATCAAAAA TACAATCCTA A
|
Protein sequence | MSSYEDDEAL FEDIYDDDNE VAKEKSTEKN NENKDAETEA EKQDQKPEAE TSAASVPEGS TSSSNQAAAT ATTSTSVSED QQQPNLQPPA AASQQADQLQ QTQQNQPIGQ HPFNSSLSSY TGQPAYPGAD PNLHQQYPPV PPPPQPPVQH QPSSMGRETG KMFIGGLNWD TTEEGLVSYF SKFGEVVDYT IMKDNNTGKS RGFGFLTFRD PKSVDEVIKT DHILDGKLID PKRAIAREEQ DKVGKIFVGG IDPMVNEKEF HDFFSQFGSI IDAQLMIDKD SGRSRGFGFI TFDSPDAVDR VTVNKFLTLK GKAMEVKRAE PRGQHQQNQM QQQQQQQQYN YNYGNQYGQA YPQMAYGQQQ MAPEMVEYWQ RMQQWFMFQQ QAQGAGGEQK DVEQPGQPLN PQQQANEPEG DPEQNSDRNN ENANDDYRGG YDNGQRDQRR MNLPKGPRRA PPSGPSGSRG GRGGYHKRSR GYHPYNRGGR R
|
| |