Gene PICST_74592 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_74592 
SymbolUSP1 
ID4850846 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009068 
Strand
Start bp213593 
End bp216445 
Gene Length2853 bp 
Protein Length480 aa 
Translation table 
GC content43% 
IMG OID640392554 
Productuniversal stress protein (USP) family protein possible involvement in nucleo-mitochondrial control of maltose, galactose and raffinose utilization 
Protein accessionXP_001387280 
Protein GI126273687 
COG category[T] Signal transduction mechanisms 
COG ID[COG0589] Universal stress protein UspA and related nucleotide-binding proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
CAGATCTGAC GAATTATTGT GAAACTTGAG CTCAAAACAC AAGTTCATTT CATTCCGAGA 
AAGGCTCTAA AGAGAATAGG CCCAAATTCG TGATCCAAAG AGATACTATC TACAGAATAG
CGAGATACAA CAGAGAGATT AGATATATCA GTGGTAAAAC ATAGGAACAT ACCAGTAATA
TTTGCTATAG AAAAGTGTCA TATAAGGATA GGAAATTTAT TGCAAAGCCA TAATCATCTT
TCCCAGAGAA ATTATACTGA AGAGTTCCAT TATCGGGTGC CATTACTGAA GCTAGTCAAT
TCGCTAGAGA TAGATTATTA GCATATTAGT GGTAATCGGC TTAAGTAACA CGCCTAGTTC
TAGAATTACG TCGCGGTTCC ACAGTTACCA CGATTTTATT TACGAACCCA AGCCCCTTCT
ACGAGGCATT CCAGGCGTTC CTGCATTCTG CTCCCCACGA ACAGTGACAG TACCAATTCC
GCAAGAACCA AAGTACTAAA TAGAGATGAT GTCACGGCAA TCTCGTTTGG AAGCCGAGAA
GCTTCAAATC ATCCAGAAGT CGATGTCGAA CCGAGGAAGA TCGCTTTCAC CCACGGCTCA
CGGAGCCGGC GGCAATACTA TCAACAATGT CAATTCGCGT TCTTCGTCTA CACTGAACGA
GGAGTTTTCG AAGGACTTGA AATGGTCCAT TACGAACCAC GATCCGCTGG AACGGATTAC
CGTTACACAT GATGATGGAA CTGTAGAGCA GCCAAATAAC GACTTGTGGG CCAACGAGGT
TTCGGACGAA GAACATATCT CCGACGACGA CACTAACGAG AAGTTCCAGT ACGATGACAC
TGGAAATATT CTCCCTAATT ATGCTTGTCA CGACGATAAG ATAAACGAGA TTTCGTCGAT
TTTGGAGAAC TCCAATTTGG ACGATCAGTC GACGATCAAG AAGCTCGAAG AGTTGACAGC
TAATGAAAGA GCCTTTGCCA ATGCCAAGAA CATAGGAAGT CTTATCGATA AGGAAGCACT
CAATAAGCTA GCCAATGAAA AACAGAAATT GACTGGCAAC GAAATGTTGG ACTTGAACCG
GTCAGATCAG GATGCTTTGG AAAGAAAACA GAAGCTTGAG AATTACCAGG CTTACAGAAA
GAAGATTATC GACCACGAGA ATGGTAAGGA TGGCACTGCG AAGGACAGTG CTACTACTTC
CACACTTCTG CCGGTAAAGC TGCCAGAGGG TGAAGCGGTA GCTGAAAATG ACGAAGACTT
TATGATTCCC TATACTTCAG CTGTAGAAGA CAAATTGGAT AGTGAGTTTG CCTCCCAATT
GAACGAAACC ATCAAGGAAG GCGAGATTGA CTCCAACAAG AGCCAGTCCA GAGTGATCCA
GACCATCACG AGAGGAAACT TCTTCCAATT AGTCAATCCC AAGGTCAAGC CGAAGATGTT
TTTGGTGTGT ATGGACTTCT CGCCAGAGTC GATTTTCGCA TTGGAGTGGT GTCTTGGTAC
AGTTTTGGTA GATGGCTCAG TGTTGTTTAT AGTGTACGTC ATTGAAGAGA ACGATAACAA
CCACAACTTG AAGGGCAATA CTAGCAACGA AAATACCCGT GAACAGTACC GGTTGAATAT
GTTAAACAAG GCCAAACAAC AGGTGTTGAA CTTGTTGAAG TTGACCAAAT TGCAAATCCA
TATCGTCATC GAAATCATCC ACCACCCTAT TCCCAGACAC TTGATCTTGG AGTTCATCGA
CAATTTACAG CCTACATTGG TGGTAGTGGG CTCCAAGGGC CAGAGTGCCA TCAAGGGTGT
TCTTTTGGGG TCGCTTTCCA ACTACTTGGT CACCAAGTCG TCTGTTCCGG TGATGGTGGT
GCGTGAGAAG TTGAAGAAGA TCAACAGATT CAAGTCTGGT TCTTCCGTCT TTACCAACAA
CATTAAGCCG TTGACATTGT CGGAAGCCAG AATTGACTGA ACTATGTTAT GTCAGAAATT
TCAGTTCTTC TACTACATTT ACATTACTTT AGGTTACGTT ACTTCATTCC TTAGTTAGAC
AGTTCTATCG AAGTATTATT TGTGTTACAT AAGACCTTTC CTGTAAGTAA ACCCTGTATT
CCCATTATCA TAAACCGCGT AGAGATGTAC TAAAGAAATC ATCAAATACA ACTTCTTCAA
TGCTCAATAG TACAACATCT TCTTTACTGC TTGTCCGCTG CTTCAACTAT TCTCAAAAGT
ACAACTCTGT AAATCCTTTG GCTGTGAACT TTCTTCCACA GTCCCCAGCT CCACACTTAG
TAGGAGAAGA AAGTTGAGGA TTATCAATTG TGAAACCAGA AGGAGTCGCT TTTGTTCTTC
GGGCTCTGTT TCCGATATTC TTCACACATC GTCCACAAAA CGTATGCCCA CACTTGGCGA
TAAAGATTCG TTTCGAGAGT TCCTTATCAA CCTTGGAACA CTGCTTCACA CAAAACCACG
GTGCTTGCAC CCGGTATTGT TTGGCTAACT TCTCGAAATG AAGATCGTAC CGGTGATCTG
ATTTGAAATC TTCAGGAATG CCTTCTCCTA AGATAATACC ACATAAACTA CAGCAGAGGT
TTGACTCAGG CTTGATATCG TTTACATACC CAGCGAGTTC TTCCTTGGAG ACTTTCTTCT
TCTCCTGCAA TGTCTTACGA TTGTAGATAT TTTCGTTAGC GAGTCTGGCA TCAAGTGCGT
TTTCGTTATC TCTCTCTATT CGCTGCATTA TAGACTGTTC AGCCTCCTCC AAATCAAATC
TACTTCCAGG GAAGTCATAC TGAGTAATTC CTCTAAGCGA AAACGGCACA GTACGCAAAA
TATCCATAAA CCCCACCTGC CCATCGGTTC GGC
 
Protein sequence
MMSRQSRLEA EKLQIIQKSM SNRGRSLSPT AHGAGGNTIN NVNSRSSSTL NEEFSKDLKW 
SITNHDPLER ITVTHDDGTV EQPNNDLWAN EVSDEEHISD DDTNEKFQYD DTGNILPNYA
CHDDKINEIS SILENSNLDD QSTIKKLEEL TANERAFANA KNIGSLIDKE ALNKLANEKQ
KLTGNEMLDL NRSDQDALER KQKLENYQAY RKKIIDHENG KDGTAKDSAT TSTLLPVKLP
EAENDEDFMI PYTSAVEDKL DSEFASQLNE TIKEGEIDSN KSQSRVIQTI TRGNFFQLVN
PKVKPKMFLV CMDFSPESIF ALEWCLGTVL VDGSVLFIVY VIEENDNNHN LKGNTSNENT
REQYRLNMLN KAKQQVLNLL KLTKLQIHIV IEIIHHPIPR HLILEFIDNL QPTLVVVGSK
GQSAIKGVLL GSLSNYLVTK SSVPVMVVRE KLKKINRFKS GSSVFTNNIK PLTLSEARID