Gene PICST_65683 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_65683 
SymbolUTP21 
ID4838834 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009044 
Strand
Start bp1712278 
End bp1715208 
Gene Length2931 bp 
Protein Length951 aa 
Translation table12 
GC content42% 
IMG OID640390149 
ProductU3 snoRNP protein 
Protein accessionXP_001384636 
Protein GI126136224 
COG category[R] General function prediction only 
COG ID[COG2319] FOG: WD40 repeat 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTTGAAC CCGTAGATAA GAAGAGGAAG GTTTTGGATG CAGATAGTTC GATCTCAGTC 
TTGCCTGGCT CAAAAACGGT ACAGAAGCCA AAACCATCCA AGATTTTCAG TCCTTTCAGA
GTTCTAGGAA ATGTTACGAA CGAGGTTCCA TTTGCTGTAG GAACGTTAGG GTCAACTTTT
TACATTGTAA CTTCCGTGGG AAGATCTTTC CAAATATATG ATGCTGCTAC TTTGCATCTT
TTGTTTGTTT CACAATCGCA GACTCCTGCC AAGATTACCT GTTTGGAAGC ACATCACCAT
TATGTTTATG CTGGATTTGG AAACAAGATT GGAATTTTCA AGAGAGGCAG ATTGGAGCAT
ACTTTGGAAT GTACAACCAG TGCTAGTGTG ACACATGTAT TATCTTTTGG TGATTATGTC
ATAGCTGCTG CTTCAGATGG AGAAATTTCT GTCTTTAAGA AGTTACCGGG AGCAAAGTAT
GCAAATTCGT TGTACACCAT CTTGAAAGCC ATTAACGCTG CCATTGAGGG TGAAATCGTC
GGTTTGATTC ATCCACCTAC TTATTTGAAC AAAATTGTGG TTTCTACGAC TTCTGGGTTA
TTCATCTTCA ATGTTCGTAC TGGAAAGCTT CTATTCAGAT CTCCAGCTAG TCAATTCACA
GAAGCGATTT CTTGTATTGA GGCTGCTCCC GTCTTGGATA TCATTGCTGT AGGAACTACT
ACTGGTAGTG TCTATTTGTA TAATTTGAAA AAGGGCAAGA TTTTGGGACA GAAAATCGTC
ACAGCTGCCG AAGACGCTTC GGCCAAAGTC GTTTCGTTAT CTTTCAGAAC TGATGGCTCG
CCTCATTTAG TAGCGGGCTT GAACACAGGA GACTTGTTCT TCTACGACTT GGCCAAGAAA
GCTAGAGTCC ATGTGTTAAG GCATGCTCAT AAAGAAACCC ACGGAGGTAT TTCCAACGCC
AAATTCTTGA ATGGACAGCC TATTGTAGTC AGTAATGGTG GTGACAACCA TTTGAAAGAA
TATGTATTTG ACCCTACATT GTCGACTTCT AACTCATCTA TTGTTTCTCC TCCACGTCAC
TTGAGATCCA GAGGAGGACA TTCTGCCCCA CCCGTAACTA TTGAGTTTCC TGACGAAGAA
AAATCTCACT TTATCTACAG TGCATCAGGT GATAGATCAT TTTGGTCCTT CTCGTTGAGA
AAGGATGCTC AGGCTCAAGA AATGTCTCAG AGACCGCAGA AGCAAAAGAA CGGGAAGAGA
CAGGCTGGTC AGGTTCAGTC CATGAAGGAA AAATTCAACG AAATTATCGC AATCTCATCG
TCTCAGACTC GTGAAGGCGA TTGGGAGAAT ATCTTGACAG CTCACAAGGA TGAGCCTTTT
GCCAGAACTT GGGAATCAAA GAACAAGAGA GTGGGTAGAT TCAATTTGAA CACTATCGAC
AATGGAATGG TCAAATCTGT TTGCATTTCT CATTGTGGTA ACTTCGGTTT GGTAGGTTCT
GCTCAAGGTG GTATTGGAGT GTACAATTTG CAATCAGGAT TGCTTCGTAA GAAGTACGTT
TTGCACAAAA AGGCTGTGAC CGGTTTATCT ATTGATGGCA TGAATAGGAA AATGGTTAGT
TGTGGTTTGG ACGGAATCAT TGGCTTCTAC GATTTTAGTC AGTCCAAGTA CTTGGGAAAG
TTGCAATTGG AAGCTCCTAT TACCAGTATG GTTTATCACA AATCTTCGGA CTTGATTGCC
TGTGCGTTAG ACGACTTGAG TATTGTCATT ATCGATGTCA CCACTCAAAA GGTTGTCAGA
GTCTTGATTG GGCACACTAA CAGAATCACT AGTTTGGATT TCTCTCCAGA TGGTAGATGG
ATTGTGTCTG TTGGTTTGGA TGCAACAATG AGAACTTGGG ATTTGCCTAC TGGTGGATGT
ATCGATGGTG TCAGATTACC TGTTGTTGCA ACGGGGATCA AGTTTTCCCC AATTGGTGAT
GTCTTGGCCA CCACCCATGT TTCTGGAAAT GGGATTTCAT TATGGACTAA TCGTGCCCAG
TTCAGACCCA TTTCTACCAG ACACGTCGAA GAGGAAGATT TTGCAACTAT ATTATTGCCA
ACGGCCTCTG GAGATGGTGG ATCATCCATG TTGGATGGTG CTCTAGAGGG AGATACCGAT
GAGGATGACG TTCTTGCGCA AACATACACC ACTTTGGACC AAATAGACGA ATCGCTTATC
ACCTTGTCCT TGGGAGCCAG AAGCAAATTC AGCAACTTGG TTCACTTGGA TGTCATCAAA
CAAAGAAGCA AACCCAAGGA AGCTCCAAAG AAACCAGAAA ATGCTCCTTT CTTCTTGTCT
TTGTCTGGAC AAGCTGTTGG AGACCAAGCT TCTGTAGCTG AAGGGAAGCC AGGTCAATCT
TCTGCAGACA ATAATGATGA TACAGCCGAG GGTAGATTAC ATAAATTGAA GTCAGACCAA
GGACACAACT TTGAGTCTAA ATTCACTACA TTATTAAGAG AAGGTTCTAG TAACGGTGAT
TACTCAGAGT TTTTGAAGTT CTTGGTTGGT GCTTCTCCTT CTCTTGTCGA TTTGGAAATC
AGATCATTGA ACTCATTCCC TCCCTTGAAT GAAATGGCCA ATTTTGTTGA AGCTCTTAAC
CAGGGTCTCA AGTCTAACAC CAATTTCGAT TTATATCAGG CTTTCTTCTC CATGTATTTA
AAGAGCCACG GTGATGTCAT TCACAACAAT GCAGACGAGC AGAGATTGAA TTCGGCCCTT
GAGCAATGGA GTGAATTGGA TAGACAAAAA GGAGAAAAGT TAGACGAGCT TGTTAAGTAC
TGTTCTGGAG TGATCAGTTT CTTGAGCACT GTTTAGGTAT TATTGCACAT TAAAGATATA
TATGTATTTA TTTCGTATAA TCATAGAGAT GAAAAATAGA CCACGATCTA A
 
Protein sequence
MVEPVDKKRK VLDADSSISV LPGSKTVQKP KPSKIFSPFR VLGNVTNEVP FAVGTLGSTF 
YIVTSVGRSF QIYDAATLHL LFVSQSQTPA KITCLEAHHH YVYAGFGNKI GIFKRGRLEH
TLECTTSASV THVLSFGDYV IAAASDGEIS VFKKLPGAKY ANSLYTILKA INAAIEGEIV
GLIHPPTYLN KIVVSTTSGL FIFNVRTGKL LFRSPASQFT EAISCIEAAP VLDIIAVGTT
TGSVYLYNLK KGKILGQKIV TAAEDASAKV VSLSFRTDGS PHLVAGLNTG DLFFYDLAKK
ARVHVLRHAH KETHGGISNA KFLNGQPIVV SNGGDNHLKE YVFDPTLSTS NSSIVSPPRH
LRSRGGHSAP PVTIEFPDEE KSHFIYSASG DRSFWSFSLR KDAQAQEMSQ RPQKQKNGKR
QAGQVQSMKE KFNEIIAISS SQTREGDWEN ILTAHKDEPF ARTWESKNKR VGRFNLNTID
NGMVKSVCIS HCGNFGLVGS AQGGIGVYNL QSGLLRKKYV LHKKAVTGLS IDGMNRKMVS
CGLDGIIGFY DFSQSKYLGK LQLEAPITSM VYHKSSDLIA CALDDLSIVI IDVTTQKVVR
VLIGHTNRIT SLDFSPDGRW IVSVGLDATM RTWDLPTGGC IDGVRLPVVA TGIKFSPIGD
VLATTHVSGN GISLWTNRAQ FRPISTRHVE EEDFATILLP TASGDGGSSM LDGALEGDTD
EDDVLAQTYT TLDQIDESLI TLSLGARSKF SNLVHLDVIK QRSKPKEAPK KPENAPFFLS
LSGQAVGDQA SVAEGKPGQS SADNNDDTAE GRLHKLKSDQ GHNFESKFTT LLREGSSNGD
YSEFLKFLVG ASPSLVDLEI RSLNSFPPLN EMANFVEALN QGLKSNTNFD LYQAFFSMYL
KSHGDVIHNN ADEQRLNSAL EQWSELDRQK GEKLDELVKY CSGVISFLST V