Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_74252 |
Symbol | UTP20 |
ID | 4841101 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009048 |
Strand | + |
Start bp | 275617 |
End bp | 283215 |
Gene Length | 7599 bp |
Protein Length | 2507 aa |
Translation table | 12 |
GC content | 36% |
IMG OID | 640392416 |
Product | U3 snoRNP protein |
Protein accession | XP_001386454 |
Protein GI | 150866754 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.117257 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCTAAAA CGGCGAAAAC CAAGTCTACG GAGTCTTCAA GAAGACATGC GTTTTCTTCG TTTCGTGAGA GAATTGATTC CATTAAGATT GAGCCCAACT TGAAACTTAC CAAACGGGTA CATGATTATG TGGAAACTTC TCATTTTTTG GCTACACTCG ACCACTGGAA GGAGGTAAAC TTAAGTGGAA ACTTCAGTGA ATTTGTTGAT AAGGTGGAAA TTTACTCACA GACTTTGCCG CAGATCTTGC ACCACCAAAG CTTCATCTTT GATGCCCTTT GCACCCACAT AAAAGTGAAC GATATTAATT CCATTCAGCC ATTATTAGAA CTCGTAGCAC AGTTTATTCA CGATTTAGGA GATGATTTTT TAGTCTACTA TTCTAAGTTT CTACAATTAT TGACGGAAAT CGCACTTGAG ACCATTCCAA ATGACAGTCA AAATCAGCGG AACACTTCGA ATGTACTTGA GTGGACCTTT AATTGTTTAG CATTCGCATT CAAATACTTG TCCAGAAGTT TAACATCGGA TTTACAGCCA ACATTCGAAA TTTTCTTGCC CATTCTCCAA ATGAAAAAGA AAACTTATAT TTCTAGATTT TGTGCAGAGT CACTAGCATT CCTTGTCAGA AAGCTGAAGG CAGACGCATT GACTTCAATT ATGATGTTCT GCTTTTCAAA AATAGATTCA ATGGAAGAGT CTGATTCATA CCCAGAGACT TTGGCTATTT TGTTTGCGGA GTCCATCAAA CACACTAAAG GAACATTTCA TTCGAAAACT AATTTGATAT TGCTGAAGAT CTTGGAAGTT TCTTTCCACA AGTTAAATAT CCAGAAGAAG AGTATTTCAA TACTTTCCGA CATAATTTTG GATTTAATTA ATCATGGTAC TCTGGACACA TGCAAGCAAG TCTACAAGGT GACAATGGAG TTTTTATTAG GGGAGTTAAA AAATGAAAAT TCTATACCTT CTGTATCACA ACTCTTGACT GTCCTCATTT TTGCTGAATC TGGTAAAAAG ATTACTGACT GGATATCGTT CACAGAAATA TTAAATGCTT TGTTGGTATA TTTGAGCAAT TGCCAAGGTT CTACAGAAAA TGATTCACAA TTGGCTGAAC TAACCTCCTA TTTGTTTGTA ATTGTTCTTA GAAACGGGGA CCTTCAAGTT TTAACAAAGT TGCATAGACA AATTTTTATT ACAATGTCAT CGTTTTGCGA TGGTAAATAT TTTTTGGCTT TTGCTGAATC TTGCGTTGAT TTGACAAGAG AAAGAGCGAA GAATTTTGGC CTCCTACATT TCATTCAAAA ATATGTAAAT CAATTACCTG AAGAGAGCAA TGAAATAAAG AAGGTTGCAT ATCTATTGAC CAAATTGAGT AAGAATCACT CTGAACTATT ACATCAGATA GAGCTTCCCC TACATATTCA CAATTACATT CGAAAAGATA TCAAGACACA AGCAGATTTG ATTAGAAATT CCAAGGTTCC CCTTGATTTT TACTGGTTAT TGCAATTGTC CAAATTTGTT AAAGAAGATG CTGACTGGAA CAATGATTTA ATTGATCTTT TGTTGCTCCT TACTGAGAAG TTCGACGTTT TACCTCGAAA GCTTTGTTGT GATTTGATTG CAGCAATTAT CATTTCTTTG TATTCTTCCT CAATTCCAAG ACACGAGAAA CTTGTAAAGG GAATGAAAAC AGTGTTGTCA ATGATCCTGA AAATTAAGGA GTCTTCGGAT TTGATTCGTG CTTTGCAGAC TTATATTCTA CGTGAAAGAG ATCAAGCAAA GTTATTTATA GAGGCCGAAG AAAACTTCTT GATGGAAATA GCTCAGAATT TGAGTTTGCC CAGTAGTCAA TTACGTTGCA TTTCAGCAGA ATTTCTTTTA ACAGTTCTTG ATACATTGGG TCATGAAAAT ACTTCGTATG TATCGCAGAT TAGGATTATT GAGCAAATTC CATTATCAAT TACTACGGGT AGGGATATCA CATTACGAAT TAGGAACTTA GCATCCGATT TTAAGAATGA TCAATCTCCC AGTGAATTTG ATTGTGTTGT TGTTACAAAT TTCCTTTTCG GGATGTTGAG CAACAGATTT CAACCTTGTT GGGATGCAGT TTATGAAACT TTACCTTCGT ATGTGCCAAT TTGTTCGTCT ATACTTTGGG ATATGGCCAG TAAATTTATT CTTACACTGT ATTCTTCGAC TGAAGATACT TACATGGATT TGGGTTACTC TCCACTTGAA GACAACGAAA TTGTCGATTG GCATGCTAGG AACCCTCGGT TGAGAGATAA TTTTGTTAAC TACCATGAGA AGTACTTTCT CTCGTACCAG AATATCACAG AATCATTGTA CGAATTTGCC AAAGAATCAT GGGCAGACAA TTCGTACAAT GAGTTCATGA GATCTCATAC TTTGAAAGCT CTCTCTTCCA TTCCTTCAAT CGCAGAGGCT CATTCACAAA GTTTAGTTGA CTTGGTTTTA AGTCTCGAAG ATATCGACAA TGAAGATGAT CCCTTAGCTA AGAAAGAAAG GTGGCAATGG AAAGATCGTA ATGACCTTCT TGCCTTGTTC GGTAAATTCA AAAATTTGAA AAAGGTTCAC AAAGCAGACA TTTTGTATGA TTACTTTTTG AGGCTCCTAT GTAGTAATCA GTTACAGGTC CAAAAGATGG CCCTAGAAGT CTTGCTTCAT TGGGGAAAGG GCCCAATTAG AAAATACAGG GACAACCTTC AAAATTTGCT AGATGATAAT ATTTTCCGCG ACGAGCTTTC AAATCTTGCT GTCGGCTCGC AGTCTACAAT CGAGGACCAC GAAAGAGAGG CAATAATGCC GTTTGTCTTG AGAATTCTCT TCGGACGTGT CCAGGGGTCT CCAAGAAGCA ATAGCCAAGT CGGACGGAAA TTTGCTATTG TAACAATTCT TCCTAACTTT TCCGATACTC ACATCATTGA TTTTATACGA TTGGGAGCTA ATAGAATAGG ATATGAAAAA TTTTTCAGTG GGAAGCAGTT ACCGAATTTA GATAGATTTC TTGTTAGAAG ATTAACAGGT TTCATAAATC TTTTGAGCGA AATCTACGAT ACTTTGGGAA CAAAGTATGC GTTTGCGTTA GAATCTACCA TAGAGCCATT GATTTATTCA TTGGTAGTCA CCCAACATTA TATTGATACA GGCATTAATG AAGTGAATGA AAGGGCTGCT GGTTTAGGAA AGGCTATAAG AAATATTCGA CTGAATGCCA TGCGATGCTT GGGTAGCTTA TTCAAGATTT TGGATGAAGA ATTTGATTGG GAACCATATG TTGTGATGAT TTATGAGAAT ATCGTGTCCC CAAGAATAGA GAATTTTGTC TCCGAAAATT TGCAGCAGAC ATCCTCGTTG TTGAAAGTCA TCACCTGTTG GATTGAAAAG AAGAATACTA TTCAATTGTT ACTTACGGAT AACTTGGCCC CAGCCTCAGC GGCTATTTCA TTGCTTAGTC ATGAAAAAAC AAAGGAAGGT GTATTGCTAA CTGTACTTGA CTTCGCAAGA AAAGTCTTGA AACAAAAAGG TGTTCAAACA GATGAATACT TTTCCTTGCT TGCACTTGTG GTGTCTACTT TGTTAGACAA TTTGCCTCGC ATAATCGACG GAATCACAGA CCGTGAACTT GGAAGCATTA CGATTGATGT GTTGATACTT TTGGTGGAAG GCAACTATAT TGACGATTAC CAGACGAAAA ATTCCATATT GCAATCATTA ACTAGTGCAA TAGATAAGCC AAGAAGTCAA ATAGAGCTTA AGGACAAGGT TCTTATATTG AGAGTCTTGT CGTCTTTAGT TGACAATTTC GAGTGCTCCT TCGACAATAT CAAGCCATTG TATGAAACTG TTTCCAAACT GTTCAGAGTA TATCCCGAGA AAAACGTTCG TCAAATGCTT GTCTCCGTGT TATTGAGCAT TGGGAATAGG TTTTCAGAAG TACACGCAAT ATCATTGCTT ATTGCAGACT TGAATGCTTT CTCAACAACT AGATTACAGG AACTAGATTT TGAAAAGAGA CTTGAGGCCT TTAGGAAAAT CAATGAAGAA GAGTTTTGTT CTTTTGATCC TAATTCATGG TTACCAATAT TGTATTGTTG TTTATTCTTC ATTAACGATC AGAATGAAAT GGCTATCAGA AGTAATGCTT CCTATTCTTT AAGGCGGTTT ATTGATTGTT TTTCTTCCAA GGATGAAGAG ACATCACAAA ATTACATTGG GATGCTAAAG AACATAATCC TTCCACAATT AAGACTCGGG CTCCGTAAAG ATAACGAAGA TGTTCAAAAT GAATACATTG CTGTCTTGGA ACATCTTGTC TGTTCATCGA AACATTACAA CGAACTTGAT GATATGAGGA TTCTTACTTT TTCTGATGAT GACGAATCTA ACTTTTTCAA AAATGTTAAT CACATACAAC TCCATAGAAG ACAAAGAGCA ATCAGGAGAT TGATAGACTA TAGAAGTGAA CTATCTTCAA ACAGTATTGC TCACTATATT TTGCCAATTG TTGAACGTTA TGCATTGTCA TCGGAAGAAA AGTTCAAAAA TATTGGGCAT GAAGCATTTG AAACTATTAG TATCTTAGTA AGGTCAATCT CATGGAATCA GTACAAGGCG TTGTTCAGAA GGTATATTTC AAATTTGAAA TCAAACAAAG AGGATTTATT AAGAAATCAT GTGCATTTGA TAATTTCTAT TTCTAAAAGT TTAATGTTGT CTCATTTGGA AAGTCAAGAA TCAGAGATTC TGGATCGTGG GCACCCTTCG GATCAAGAAG CCATAGACGC ATACATTCTT CAAGAAGTAT TTCCACCTAT TTCAAAAGTT TTGGTTGTAC GTAATGATGA AACCATTGTC GCTCGTGCTC CTTTGGCAGA AGCTCTTTCC TACTTAGTTA CTTGCATCAC AGAAGAGAGA ATTGCGTCTG AGTTACCAGG AATTTTGACA AGTACTTGTC AAGTTATGCG TTCTCGTTCC GAGGAGCTTA GAGACGCGGT CAGGAAATCT CTTGGAAATA TAGTGAAACT CTTGGGACCA AAATACCTTC GATTTGTATT GAAAGAGTTG AAGGGAGCGC TATCTAGAGG GTCTCAAATT CATGTGTTGA GTTTTACAAC ACATTATCTA TTGGCATCAA TCTCTGATAG TCTATCTCAT GGAGACTTGG ATGACACATT GGATATTGTG GTTGACATTG TTATGGAAGA CATATTTGGA GCGGCTGGCC AAGAAAAGGA TGCTGAAGGT TATAACAGCA AAATGAAAGA AGTAAAGCAT AAAAAATGTT TTGACGTTGC CGAATTATTG TCGGCAAATA TTAATTTGAA ATCATTTGGA ACCTTATTAG CCCCAATTAA ACTTTTGCTT CAAGAAGGCA TATCCTTGAA AACTCAGAAC AAGCTTGATG AATTGTTAAG GAGGTATGCT CTTGGTTTGA ATCACAATGA CGAATCATCA AATAGGGAAA TGCTCTTCTT ATGTCACGAA ATTCATCTGG AATCAGAAAA TAGTCCAAAT CAAAAGGAAG GAAAATTTCT CACTCAATCT GAAAAACATT TCATTGTAAA TTTGAATGCT AAGAAAACAC GGGCTCAGGT AAATAGCACA ATTTATGTGC AAACATTTCA GAGATTTTCA TTGGAATTGC TCAGAACAGC TATTTCTCGT CATGATAACC ACTTGACTGT TCCAAATATG GTTGCTTTTA TTCCTTTGTT GGAACAGGAT CTCAAATCAG AGAGCGAGGG AGTTGTAATC TCATGCTTGA GGATATTGAA CACGGTAGTA AGACTTCCAT TCAATAATCA GGAGGAAGCT ATATTTACAG CGTCTACAAG AAAAGCACTT GCTATCATTA AGGAAAACCC AAGCACTAAT GCAGAAATCT GCCAAGCTGC TCTAAGATTT CTTGCAACTA CAATTAGACA CAAGCCGGAT ATCAAAATAA AAGAATCTGC CATTGCCTAC ACACTCGAAA GAATTCTTCC AGATTTGCAA GAGCCAAATA AACAGGGTTT GGCGTTCAAT TTTTTGAAAG CTGTTGTTTC TCAGCATATT ATGATTCCAG AACTTTACGA TGTAATGGAT AAAGTGTCCA AGATAATGAT TGTAAACCAC TCTAAAGATA TTAGAGATAT GTCAAGAAGC GTATATTTTA TGTTTTTGAT GGAATATGAT CAAGGAAGGG GAAGACTTGA AAGGCAATTC AAATTCTTAG TGGACAACTT ATCGTTTCCA ACTGAGACTG GTCGACAATC CGTTATGGAA TTAATTCACT CGATTGTCAC CAAGGCTGGA CTAGAATTGT TAGATAAATT AGCAACATCC TTCTTTGTTG CCTTAGCAAA TGTATTAGTA TCTGACGAAG AACCCAAATG CCGTGAAATG GCATCTTCGT TGATTGGAAA TATAATTAGA AAATTGGGAG CAGGAAATGT TGACAATATT GAGAAATATT GCTTAGCATG GTCGAAGCAA TCGCAGAATC AATTATTGAA ACGTTGTGGA CTTAACATCT ACAAGATTTA TGTTGCCGAA TTTGGGGTCG AATCTAATCA AGTCTTAAAG CAAACAGCAC TTGATAGCAT CAAATTCGCA ATTGAAGCAG GGAATAGCAG CGACGGTGCT GTTGAGTGGG AATTATTGTA TTCTGCATTA AGTTTGTTTT CTACCCTTAC TTCAAAGTTG AGGGAGTCTA TATTGGATGA AGAATATGAG TCGATCTGGA ACTCAATCAT AAGGGTATTG CTATTCCCTC ATTCATGGGT GAGATTGATA TCTTCGCGTA TTATTGAGAT TTTATTATCA GGACTTGACA CAGTAAAGTT TGAAATTGAC AATTACAAAA TCCAAACCAT CGCATACAGG CAACTACATC AACTTGCAGC TCCTCAAGTT TCAGAAGATT TAGGAAACCA AATTGTTAAG AATTTGGTTT TGATATCAAT GAAGTGGGAA AAGGAAAACA CGAAGTATGA ACACGTACAA ACAAACAACA CCGCTGACCA AAAGTACGAT TATGCTAATG ACTATCTTGT TGCAAGGATA TGCTCAATTT TGCGCCTGGA CATCAATTCC AATGTATCGT TTGAATCAAG GAAAAGTGCA ATAAAATTGA GTGCAATGTT ATTGCAAATT ACGGGTGAAG ATAGATTGAG CCTGGTTTCG GAAAGTTTAT TGCTTGGACT CTTCCAATAT ACTGAACTCG AACCAAAGAA TGGTAACGAG GAATCATTAG TCACACTTTC ATTGGAATGC TTGCAATTGA TTGAAAACAA ACTTGGTGTC ACCCAATATA CTACAATTTT CACGAAAGTC AAACAGAAGG TCAATTCAAG AAGAGTGGAA AGGAAAACGA AGAGAGCACA ATTGGCAGTG AATGCCCCCG ATGTTGCAGC AAGACGAAAG TTGAGGAAGC ACGAAAGAAC AAGGGAGAAG AGAAAGCACG AAAAAGATGA AAACGGATTT TATCACAGTA AGAGAAGAAA ATAATCTACC ATTTATATGA TTATTCAAAT TTTAAATTTA TAGACGTTTA GTTAGGAATT AAATGAAATA GAACTCACT
|
Protein sequence | MAKTAKTKST ESSRRHAFSS FRERIDSIKI EPNLKLTKRV HDYVETSHFL ATLDHWKEVN LSGNFSEFVD KVEIYSQTLP QILHHQSFIF DALCTHIKVN DINSIQPLLE LVAQFIHDLG DDFLVYYSKF LQLLTEIALE TIPNDSQNQR NTSNVLEWTF NCLAFAFKYL SRSLTSDLQP TFEIFLPILQ MKKKTYISRF CAESLAFLVR KSKADALTSI MMFCFSKIDS MEESDSYPET LAILFAESIK HTKGTFHSKT NLILSKILEV SFHKLNIQKK SISILSDIIL DLINHGTSDT CKQVYKVTME FLLGELKNEN SIPSVSQLLT VLIFAESGKK ITDWISFTEI LNALLVYLSN CQGSTENDSQ LAELTSYLFV IVLRNGDLQV LTKLHRQIFI TMSSFCDGKY FLAFAESCVD LTRERAKNFG LLHFIQKYVN QLPEESNEIK KVAYLLTKLS KNHSELLHQI ELPLHIHNYI RKDIKTQADL IRNSKVPLDF YWLLQLSKFV KEDADWNNDL IDLLLLLTEK FDVLPRKLCC DLIAAIIISL YSSSIPRHEK LVKGMKTVLS MISKIKESSD LIRALQTYIL RERDQAKLFI EAEENFLMEI AQNLSLPSSQ LRCISAEFLL TVLDTLGHEN TSYVSQIRII EQIPLSITTG RDITLRIRNL ASDFKNDQSP SEFDCVVVTN FLFGMLSNRF QPCWDAVYET LPSYVPICSS ILWDMASKFI LTSYSSTEDT YMDLGYSPLE DNEIVDWHAR NPRLRDNFVN YHEKYFLSYQ NITESLYEFA KESWADNSYN EFMRSHTLKA LSSIPSIAEA HSQSLVDLVL SLEDIDNEDD PLAKKERWQW KDRNDLLALF GKFKNLKKVH KADILYDYFL RLLCSNQLQV QKMALEVLLH WGKGPIRKYR DNLQNLLDDN IFRDELSNLA VGSQSTIEDH EREAIMPFVL RILFGRVQGS PRSNSQVGRK FAIVTILPNF SDTHIIDFIR LGANRIGYEK FFSGKQLPNL DRFLVRRLTG FINLLSEIYD TLGTKYAFAL ESTIEPLIYS LVVTQHYIDT GINEVNERAA GLGKAIRNIR SNAMRCLGSL FKILDEEFDW EPYVVMIYEN IVSPRIENFV SENLQQTSSL LKVITCWIEK KNTIQLLLTD NLAPASAAIS LLSHEKTKEG VLLTVLDFAR KVLKQKGVQT DEYFSLLALV VSTLLDNLPR IIDGITDREL GSITIDVLIL LVEGNYIDDY QTKNSILQSL TSAIDKPRSQ IELKDKVLIL RVLSSLVDNF ECSFDNIKPL YETVSKSFRV YPEKNVRQML VSVLLSIGNR FSEVHAISLL IADLNAFSTT RLQELDFEKR LEAFRKINEE EFCSFDPNSW LPILYCCLFF INDQNEMAIR SNASYSLRRF IDCFSSKDEE TSQNYIGMLK NIILPQLRLG LRKDNEDVQN EYIAVLEHLV CSSKHYNELD DMRILTFSDD DESNFFKNVN HIQLHRRQRA IRRLIDYRSE LSSNSIAHYI LPIVERYALS SEEKFKNIGH EAFETISILV RSISWNQYKA LFRRYISNLK SNKEDLLRNH VHLIISISKS LMLSHLESQE SEISDRGHPS DQEAIDAYIL QEVFPPISKV LVVRNDETIV ARAPLAEALS YLVTCITEER IASELPGILT STCQVMRSRS EELRDAVRKS LGNIVKLLGP KYLRFVLKEL KGALSRGSQI HVLSFTTHYL LASISDSLSH GDLDDTLDIV VDIVMEDIFG AAGQEKDAEG YNSKMKEVKH KKCFDVAELL SANINLKSFG TLLAPIKLLL QEGISLKTQN KLDELLRRYA LGLNHNDESS NREMLFLCHE IHSESENSPN QKEGKFLTQS EKHFIVNLNA KKTRAQVNST IYVQTFQRFS LELLRTAISR HDNHLTVPNM VAFIPLLEQD LKSESEGVVI SCLRILNTVV RLPFNNQEEA IFTASTRKAL AIIKENPSTN AEICQAALRF LATTIRHKPD IKIKESAIAY TLERILPDLQ EPNKQGLAFN FLKAVVSQHI MIPELYDVMD KVSKIMIVNH SKDIRDMSRS VYFMFLMEYD QGRGRLERQF KFLVDNLSFP TETGRQSVME LIHSIVTKAG LELLDKLATS FFVALANVLV SDEEPKCREM ASSLIGNIIR KLGAGNVDNI EKYCLAWSKQ SQNQLLKRCG LNIYKIYVAE FGVESNQVLK QTALDSIKFA IEAGNSSDGA VEWELLYSAL SLFSTLTSKL RESILDEEYE SIWNSIIRVL LFPHSWVRLI SSRIIEILLS GLDTVKFEID NYKIQTIAYR QLHQLAAPQV SEDLGNQIVK NLVLISMKWE KENTKYEHVQ TNNTADQKYD YANDYLVARI CSILRSDINS NVSFESRKSA IKLSAMLLQI TGEDRLSSVS ESLLLGLFQY TELEPKNGNE ESLVTLSLEC LQLIENKLGV TQYTTIFTKV KQKVNSRRVE RKTKRAQLAV NAPDVAARRK LRKHERTREK RKHEKDENGF YHSKRRK
|
| |