Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_74592 |
Symbol | USP1 |
ID | 4850846 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009068 |
Strand | + |
Start bp | 213593 |
End bp | 216445 |
Gene Length | 2853 bp |
Protein Length | 480 aa |
Translation table | |
GC content | 43% |
IMG OID | 640392554 |
Product | universal stress protein (USP) family protein possible involvement in nucleo-mitochondrial control of maltose, galactose and raffinose utilization |
Protein accession | XP_001387280 |
Protein GI | 126273687 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG0589] Universal stress protein UspA and related nucleotide-binding proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | CAGATCTGAC GAATTATTGT GAAACTTGAG CTCAAAACAC AAGTTCATTT CATTCCGAGA AAGGCTCTAA AGAGAATAGG CCCAAATTCG TGATCCAAAG AGATACTATC TACAGAATAG CGAGATACAA CAGAGAGATT AGATATATCA GTGGTAAAAC ATAGGAACAT ACCAGTAATA TTTGCTATAG AAAAGTGTCA TATAAGGATA GGAAATTTAT TGCAAAGCCA TAATCATCTT TCCCAGAGAA ATTATACTGA AGAGTTCCAT TATCGGGTGC CATTACTGAA GCTAGTCAAT TCGCTAGAGA TAGATTATTA GCATATTAGT GGTAATCGGC TTAAGTAACA CGCCTAGTTC TAGAATTACG TCGCGGTTCC ACAGTTACCA CGATTTTATT TACGAACCCA AGCCCCTTCT ACGAGGCATT CCAGGCGTTC CTGCATTCTG CTCCCCACGA ACAGTGACAG TACCAATTCC GCAAGAACCA AAGTACTAAA TAGAGATGAT GTCACGGCAA TCTCGTTTGG AAGCCGAGAA GCTTCAAATC ATCCAGAAGT CGATGTCGAA CCGAGGAAGA TCGCTTTCAC CCACGGCTCA CGGAGCCGGC GGCAATACTA TCAACAATGT CAATTCGCGT TCTTCGTCTA CACTGAACGA GGAGTTTTCG AAGGACTTGA AATGGTCCAT TACGAACCAC GATCCGCTGG AACGGATTAC CGTTACACAT GATGATGGAA CTGTAGAGCA GCCAAATAAC GACTTGTGGG CCAACGAGGT TTCGGACGAA GAACATATCT CCGACGACGA CACTAACGAG AAGTTCCAGT ACGATGACAC TGGAAATATT CTCCCTAATT ATGCTTGTCA CGACGATAAG ATAAACGAGA TTTCGTCGAT TTTGGAGAAC TCCAATTTGG ACGATCAGTC GACGATCAAG AAGCTCGAAG AGTTGACAGC TAATGAAAGA GCCTTTGCCA ATGCCAAGAA CATAGGAAGT CTTATCGATA AGGAAGCACT CAATAAGCTA GCCAATGAAA AACAGAAATT GACTGGCAAC GAAATGTTGG ACTTGAACCG GTCAGATCAG GATGCTTTGG AAAGAAAACA GAAGCTTGAG AATTACCAGG CTTACAGAAA GAAGATTATC GACCACGAGA ATGGTAAGGA TGGCACTGCG AAGGACAGTG CTACTACTTC CACACTTCTG CCGGTAAAGC TGCCAGAGGG TGAAGCGGTA GCTGAAAATG ACGAAGACTT TATGATTCCC TATACTTCAG CTGTAGAAGA CAAATTGGAT AGTGAGTTTG CCTCCCAATT GAACGAAACC ATCAAGGAAG GCGAGATTGA CTCCAACAAG AGCCAGTCCA GAGTGATCCA GACCATCACG AGAGGAAACT TCTTCCAATT AGTCAATCCC AAGGTCAAGC CGAAGATGTT TTTGGTGTGT ATGGACTTCT CGCCAGAGTC GATTTTCGCA TTGGAGTGGT GTCTTGGTAC AGTTTTGGTA GATGGCTCAG TGTTGTTTAT AGTGTACGTC ATTGAAGAGA ACGATAACAA CCACAACTTG AAGGGCAATA CTAGCAACGA AAATACCCGT GAACAGTACC GGTTGAATAT GTTAAACAAG GCCAAACAAC AGGTGTTGAA CTTGTTGAAG TTGACCAAAT TGCAAATCCA TATCGTCATC GAAATCATCC ACCACCCTAT TCCCAGACAC TTGATCTTGG AGTTCATCGA CAATTTACAG CCTACATTGG TGGTAGTGGG CTCCAAGGGC CAGAGTGCCA TCAAGGGTGT TCTTTTGGGG TCGCTTTCCA ACTACTTGGT CACCAAGTCG TCTGTTCCGG TGATGGTGGT GCGTGAGAAG TTGAAGAAGA TCAACAGATT CAAGTCTGGT TCTTCCGTCT TTACCAACAA CATTAAGCCG TTGACATTGT CGGAAGCCAG AATTGACTGA ACTATGTTAT GTCAGAAATT TCAGTTCTTC TACTACATTT ACATTACTTT AGGTTACGTT ACTTCATTCC TTAGTTAGAC AGTTCTATCG AAGTATTATT TGTGTTACAT AAGACCTTTC CTGTAAGTAA ACCCTGTATT CCCATTATCA TAAACCGCGT AGAGATGTAC TAAAGAAATC ATCAAATACA ACTTCTTCAA TGCTCAATAG TACAACATCT TCTTTACTGC TTGTCCGCTG CTTCAACTAT TCTCAAAAGT ACAACTCTGT AAATCCTTTG GCTGTGAACT TTCTTCCACA GTCCCCAGCT CCACACTTAG TAGGAGAAGA AAGTTGAGGA TTATCAATTG TGAAACCAGA AGGAGTCGCT TTTGTTCTTC GGGCTCTGTT TCCGATATTC TTCACACATC GTCCACAAAA CGTATGCCCA CACTTGGCGA TAAAGATTCG TTTCGAGAGT TCCTTATCAA CCTTGGAACA CTGCTTCACA CAAAACCACG GTGCTTGCAC CCGGTATTGT TTGGCTAACT TCTCGAAATG AAGATCGTAC CGGTGATCTG ATTTGAAATC TTCAGGAATG CCTTCTCCTA AGATAATACC ACATAAACTA CAGCAGAGGT TTGACTCAGG CTTGATATCG TTTACATACC CAGCGAGTTC TTCCTTGGAG ACTTTCTTCT TCTCCTGCAA TGTCTTACGA TTGTAGATAT TTTCGTTAGC GAGTCTGGCA TCAAGTGCGT TTTCGTTATC TCTCTCTATT CGCTGCATTA TAGACTGTTC AGCCTCCTCC AAATCAAATC TACTTCCAGG GAAGTCATAC TGAGTAATTC CTCTAAGCGA AAACGGCACA GTACGCAAAA TATCCATAAA CCCCACCTGC CCATCGGTTC GGC
|
Protein sequence | MMSRQSRLEA EKLQIIQKSM SNRGRSLSPT AHGAGGNTIN NVNSRSSSTL NEEFSKDLKW SITNHDPLER ITVTHDDGTV EQPNNDLWAN EVSDEEHISD DDTNEKFQYD DTGNILPNYA CHDDKINEIS SILENSNLDD QSTIKKLEEL TANERAFANA KNIGSLIDKE ALNKLANEKQ KLTGNEMLDL NRSDQDALER KQKLENYQAY RKKIIDHENG KDGTAKDSAT TSTLLPVKLP EAENDEDFMI PYTSAVEDKL DSEFASQLNE TIKEGEIDSN KSQSRVIQTI TRGNFFQLVN PKVKPKMFLV CMDFSPESIF ALEWCLGTVL VDGSVLFIVY VIEENDNNHN LKGNTSNENT REQYRLNMLN KAKQQVLNLL KLTKLQIHIV IEIIHHPIPR HLILEFIDNL QPTLVVVGSK GQSAIKGVLL GSLSNYLVTK SSVPVMVVRE KLKKINRFKS GSSVFTNNIK PLTLSEARID
|
| |