Gene PICST_70712 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_70712 
SymbolUTP6 
ID4836711 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009042 
Strand
Start bp2458463 
End bp2460498 
Gene Length2036 bp 
Protein Length434 aa 
Translation table12 
GC content40% 
IMG OID640388026 
ProductU3 snoRNP protein 
Protein accessionXP_001382698 
Protein GI150864022 
COG category[R] General function prediction only 
COG ID[COG5191] Uncharacterized conserved protein, contains HAT (Half-A-TPR) repeat 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.555676 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTGAGA AAGTGAGATA CTATTTGGAA CAGTCTGTTC CCGAGTTAGA GGACTTGAAG 
AGCAAGGGCC TTTTTGAAAA AAATGAAATC ACCATGATTA TGAGAAGAAG AACCGACTTC
GAGCACAGAA TCCAAGGCAG GGGAAGTAAG CCTAGAGATT TCACCAAATA CGCAGAATTT
GAAGTTAACT TGGAGAAATT GAGAAAGAAG AGATACAGCA GATTATCAAA AGTAGGGGTT
ATTGACACCA AGCCCAGCAT TAGTGACTGG GCTGGAACTA GAAGAATCAT GTTCATTTTT
GAGAGAGCTA CTAGAAGATA TCCAGGTGAC TTGGACTTAT GGTCGCAATA CTTGAAGTTT
GCCAAATCAA ATGGTGCTAT AAAGGTTATT TATAAGGTTT ATTCTAGACT TTTGCAATTA
CAACCCCGTA ATATAGATGC TTGGTTATCT GCGGCAAAGT ATGAGTTCGA AACCAATGCC
AACGCAAAGG GAGCAAGAAT GCTTTTCCAA AGAGGGTTAA GATTGAATCC AGAATCGTTG
GAATTGTGGT TGAGCTATGC CCAATTCGAG TTGACGTATA TATCGAGATT ACTTGCCAGA
AGAAAAGTTT TGGGTCTCAT AACTGAAAAG CAACAGCTGG ACGAAATGAC TAGCCAGCAA
GAAAGATTAG CCAAGTCTAT TGCAGACTCA GCCATAGGTG ATGACGACAA AGACTTTAAC
GATGATAAGA TCGAATTGCC TTCAACAGAA GAAATGAAAG AACAATTGCA TCACTTACCC
GAAGCAGACA TGAACATGTT GGGTAATCCC GAAACCAATC CAGCCTTGAG AGGAGACGTA
GCGTTAACAG TTTTTGACTT GTGTGTTCCT GCAATTATCA AGAGCATTCC TGAATTTTCT
ACTGTAGTCA ATGCTCAAGA CAAGACTTTC GAGGTTGTTG ACCATTTCTT ATCCATGATC
GACTTGTTCG AGGACCTCAA CAGAGACCAC TTATACTTAC ACATCTTGAA CTTTTTACAA
AGTAATTATC CTCATGATTT ACGTACAAGT TTGATCGACA TATGTTTGCC TATCAGAAAT
GTTAAACGGA CCAGCCCTCA GTTGTCCGAG TTGTTACAAT TGGCCGTTAA TAAATTCATT
GCATACAAGT CTAAGTTGAG AGACTTACAA GAGAAGGATA CATTAACTAA CTTATTCGTT
AACCAGCTTA CAAACCAGTT CTTGAGCACT CACGAAGTCG CTGATGGTTC TGAAAAGATA
GACACTTTAT TAAGAGCCAT TATTAAGAAG TGCCGCACCA TCTAGAATAG ATAGACATTA
ATACATAAAT ATAGAACTAT AGACTGTGGT GGAACTAGAA CTACAACGAC TAATAGAAGT
GGGACTCATA AACTAGTCCA ATTTGTTTTC GCTGTCCAAA TTGAACTTTC CTAAGTTCTT
GACACGACGA GAGATGGCAG TATAACTCAA GGTAAACACC ACCAATCCAG AGAGTACGGC
TGTAAGCACC AAATAGAAGG TAAACCACTG GTAACTTATA GGAACAAAAG CCGCTGGGAA
TCTGTAGAAG CCATCTGCAT AAGGCAAATT CTCAGTCTTT CCGGCTATCG AGTTTACAGG
AACAATCTGA TCAAGCCATC TGACAGCAAA ATTGAGAGTG AGCAGGTCAC CAATGACGAT
TCTAGTGACA TTAGTTCCTG AAGAATAAGC CACGGCACTG ATGAAAGTGT TTTCTTTATG
GTTGCGGTTA GCTGGCTTCG TCAAAAGGAC ATTTAAAGAC GAATCGATAT CCAAATGAGA
GTCGTGTATT TCTCCTCTGA AATTGAAAGT CAAAGGAACA TAAGCTCCCC CCTTTTGCAA
GTAGTTGGCA TCAGCACTAA CAGGTAATCT ACAGCCAAAT GGCACGTCTT CGTGGATGTA
CAATTGGAAC AAATGGTATA AGTTGTCTGT GAATTCGATA GTACAGTTGA AATCAGCATT
GACTCCATAT TCAAAGCTCA AAGGTCTGTT GGTCTCTTTA CAAGTAGGAA AGGGAC
 
Protein sequence
MAEKVRYYLE QSVPELEDLK SKGLFEKNEI TMIMRRRTDF EHRIQGRGSK PRDFTKYAEF 
EVNLEKLRKK RYSRLSKVGV IDTKPSISDW AGTRRIMFIF ERATRRYPGD LDLWSQYLKF
AKSNGAIKVI YKVYSRLLQL QPRNIDAWLS AAKYEFETNA NAKGARMLFQ RGLRLNPESL
ELWLSYAQFE LTYISRLLAR RKVLGLITEK QQSDEMTSQQ ERLAKSIADS AIGDDDKDFN
DDKIELPSTE EMKEQLHHLP EADMNMLGNP ETNPALRGDV ALTVFDLCVP AIIKSIPEFS
TVVNAQDKTF EVVDHFLSMI DLFEDLNRDH LYLHILNFLQ SNYPHDLRTS LIDICLPIRN
VKRTSPQLSE LLQLAVNKFI AYKSKLRDLQ EKDTLTNLFV NQLTNQFLST HEVADGSEKI
DTLLRAIIKK CRTI