Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_65683 |
Symbol | UTP21 |
ID | 4838834 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009044 |
Strand | - |
Start bp | 1712278 |
End bp | 1715208 |
Gene Length | 2931 bp |
Protein Length | 951 aa |
Translation table | 12 |
GC content | 42% |
IMG OID | 640390149 |
Product | U3 snoRNP protein |
Protein accession | XP_001384636 |
Protein GI | 126136224 |
COG category | [R] General function prediction only |
COG ID | [COG2319] FOG: WD40 repeat |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 25 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTTGAAC CCGTAGATAA GAAGAGGAAG GTTTTGGATG CAGATAGTTC GATCTCAGTC TTGCCTGGCT CAAAAACGGT ACAGAAGCCA AAACCATCCA AGATTTTCAG TCCTTTCAGA GTTCTAGGAA ATGTTACGAA CGAGGTTCCA TTTGCTGTAG GAACGTTAGG GTCAACTTTT TACATTGTAA CTTCCGTGGG AAGATCTTTC CAAATATATG ATGCTGCTAC TTTGCATCTT TTGTTTGTTT CACAATCGCA GACTCCTGCC AAGATTACCT GTTTGGAAGC ACATCACCAT TATGTTTATG CTGGATTTGG AAACAAGATT GGAATTTTCA AGAGAGGCAG ATTGGAGCAT ACTTTGGAAT GTACAACCAG TGCTAGTGTG ACACATGTAT TATCTTTTGG TGATTATGTC ATAGCTGCTG CTTCAGATGG AGAAATTTCT GTCTTTAAGA AGTTACCGGG AGCAAAGTAT GCAAATTCGT TGTACACCAT CTTGAAAGCC ATTAACGCTG CCATTGAGGG TGAAATCGTC GGTTTGATTC ATCCACCTAC TTATTTGAAC AAAATTGTGG TTTCTACGAC TTCTGGGTTA TTCATCTTCA ATGTTCGTAC TGGAAAGCTT CTATTCAGAT CTCCAGCTAG TCAATTCACA GAAGCGATTT CTTGTATTGA GGCTGCTCCC GTCTTGGATA TCATTGCTGT AGGAACTACT ACTGGTAGTG TCTATTTGTA TAATTTGAAA AAGGGCAAGA TTTTGGGACA GAAAATCGTC ACAGCTGCCG AAGACGCTTC GGCCAAAGTC GTTTCGTTAT CTTTCAGAAC TGATGGCTCG CCTCATTTAG TAGCGGGCTT GAACACAGGA GACTTGTTCT TCTACGACTT GGCCAAGAAA GCTAGAGTCC ATGTGTTAAG GCATGCTCAT AAAGAAACCC ACGGAGGTAT TTCCAACGCC AAATTCTTGA ATGGACAGCC TATTGTAGTC AGTAATGGTG GTGACAACCA TTTGAAAGAA TATGTATTTG ACCCTACATT GTCGACTTCT AACTCATCTA TTGTTTCTCC TCCACGTCAC TTGAGATCCA GAGGAGGACA TTCTGCCCCA CCCGTAACTA TTGAGTTTCC TGACGAAGAA AAATCTCACT TTATCTACAG TGCATCAGGT GATAGATCAT TTTGGTCCTT CTCGTTGAGA AAGGATGCTC AGGCTCAAGA AATGTCTCAG AGACCGCAGA AGCAAAAGAA CGGGAAGAGA CAGGCTGGTC AGGTTCAGTC CATGAAGGAA AAATTCAACG AAATTATCGC AATCTCATCG TCTCAGACTC GTGAAGGCGA TTGGGAGAAT ATCTTGACAG CTCACAAGGA TGAGCCTTTT GCCAGAACTT GGGAATCAAA GAACAAGAGA GTGGGTAGAT TCAATTTGAA CACTATCGAC AATGGAATGG TCAAATCTGT TTGCATTTCT CATTGTGGTA ACTTCGGTTT GGTAGGTTCT GCTCAAGGTG GTATTGGAGT GTACAATTTG CAATCAGGAT TGCTTCGTAA GAAGTACGTT TTGCACAAAA AGGCTGTGAC CGGTTTATCT ATTGATGGCA TGAATAGGAA AATGGTTAGT TGTGGTTTGG ACGGAATCAT TGGCTTCTAC GATTTTAGTC AGTCCAAGTA CTTGGGAAAG TTGCAATTGG AAGCTCCTAT TACCAGTATG GTTTATCACA AATCTTCGGA CTTGATTGCC TGTGCGTTAG ACGACTTGAG TATTGTCATT ATCGATGTCA CCACTCAAAA GGTTGTCAGA GTCTTGATTG GGCACACTAA CAGAATCACT AGTTTGGATT TCTCTCCAGA TGGTAGATGG ATTGTGTCTG TTGGTTTGGA TGCAACAATG AGAACTTGGG ATTTGCCTAC TGGTGGATGT ATCGATGGTG TCAGATTACC TGTTGTTGCA ACGGGGATCA AGTTTTCCCC AATTGGTGAT GTCTTGGCCA CCACCCATGT TTCTGGAAAT GGGATTTCAT TATGGACTAA TCGTGCCCAG TTCAGACCCA TTTCTACCAG ACACGTCGAA GAGGAAGATT TTGCAACTAT ATTATTGCCA ACGGCCTCTG GAGATGGTGG ATCATCCATG TTGGATGGTG CTCTAGAGGG AGATACCGAT GAGGATGACG TTCTTGCGCA AACATACACC ACTTTGGACC AAATAGACGA ATCGCTTATC ACCTTGTCCT TGGGAGCCAG AAGCAAATTC AGCAACTTGG TTCACTTGGA TGTCATCAAA CAAAGAAGCA AACCCAAGGA AGCTCCAAAG AAACCAGAAA ATGCTCCTTT CTTCTTGTCT TTGTCTGGAC AAGCTGTTGG AGACCAAGCT TCTGTAGCTG AAGGGAAGCC AGGTCAATCT TCTGCAGACA ATAATGATGA TACAGCCGAG GGTAGATTAC ATAAATTGAA GTCAGACCAA GGACACAACT TTGAGTCTAA ATTCACTACA TTATTAAGAG AAGGTTCTAG TAACGGTGAT TACTCAGAGT TTTTGAAGTT CTTGGTTGGT GCTTCTCCTT CTCTTGTCGA TTTGGAAATC AGATCATTGA ACTCATTCCC TCCCTTGAAT GAAATGGCCA ATTTTGTTGA AGCTCTTAAC CAGGGTCTCA AGTCTAACAC CAATTTCGAT TTATATCAGG CTTTCTTCTC CATGTATTTA AAGAGCCACG GTGATGTCAT TCACAACAAT GCAGACGAGC AGAGATTGAA TTCGGCCCTT GAGCAATGGA GTGAATTGGA TAGACAAAAA GGAGAAAAGT TAGACGAGCT TGTTAAGTAC TGTTCTGGAG TGATCAGTTT CTTGAGCACT GTTTAGGTAT TATTGCACAT TAAAGATATA TATGTATTTA TTTCGTATAA TCATAGAGAT GAAAAATAGA CCACGATCTA A
|
Protein sequence | MVEPVDKKRK VLDADSSISV LPGSKTVQKP KPSKIFSPFR VLGNVTNEVP FAVGTLGSTF YIVTSVGRSF QIYDAATLHL LFVSQSQTPA KITCLEAHHH YVYAGFGNKI GIFKRGRLEH TLECTTSASV THVLSFGDYV IAAASDGEIS VFKKLPGAKY ANSLYTILKA INAAIEGEIV GLIHPPTYLN KIVVSTTSGL FIFNVRTGKL LFRSPASQFT EAISCIEAAP VLDIIAVGTT TGSVYLYNLK KGKILGQKIV TAAEDASAKV VSLSFRTDGS PHLVAGLNTG DLFFYDLAKK ARVHVLRHAH KETHGGISNA KFLNGQPIVV SNGGDNHLKE YVFDPTLSTS NSSIVSPPRH LRSRGGHSAP PVTIEFPDEE KSHFIYSASG DRSFWSFSLR KDAQAQEMSQ RPQKQKNGKR QAGQVQSMKE KFNEIIAISS SQTREGDWEN ILTAHKDEPF ARTWESKNKR VGRFNLNTID NGMVKSVCIS HCGNFGLVGS AQGGIGVYNL QSGLLRKKYV LHKKAVTGLS IDGMNRKMVS CGLDGIIGFY DFSQSKYLGK LQLEAPITSM VYHKSSDLIA CALDDLSIVI IDVTTQKVVR VLIGHTNRIT SLDFSPDGRW IVSVGLDATM RTWDLPTGGC IDGVRLPVVA TGIKFSPIGD VLATTHVSGN GISLWTNRAQ FRPISTRHVE EEDFATILLP TASGDGGSSM LDGALEGDTD EDDVLAQTYT TLDQIDESLI TLSLGARSKF SNLVHLDVIK QRSKPKEAPK KPENAPFFLS LSGQAVGDQA SVAEGKPGQS SADNNDDTAE GRLHKLKSDQ GHNFESKFTT LLREGSSNGD YSEFLKFLVG ASPSLVDLEI RSLNSFPPLN EMANFVEALN QGLKSNTNFD LYQAFFSMYL KSHGDVIHNN ADEQRLNSAL EQWSELDRQK GEKLDELVKY CSGVISFLST V
|
| |