Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_69820 |
Symbol | |
ID | 4837492 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009042 |
Strand | - |
Start bp | 2570599 |
End bp | 2575017 |
Gene Length | 4419 bp |
Protein Length | 1340 aa |
Translation table | 12 |
GC content | 42% |
IMG OID | 640388807 |
Product | predicted protein |
Protein accession | XP_001383247 |
Protein GI | 150864435 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG5560] Ubiquitin C-terminal hydrolase |
TIGRFAM ID | [TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | AGCAATCAAA CCGGCCGCCA CTGTCGTCAC GAGTCGCGAT TGTTGTATCC ACCAACTGGA TTCCGTATCT GTCATCTCGT CCCTCGGTAG CGTTGGTTTT CTCGTCGCGC CCATCTCGTC TTCACTCCAA TAGTTTAGCT ACTAAGCTGA ATCTACCAGA CTATTCTCAC TAGCAATTAT ATACTGAGTC CATTAGATCG TTGCAGATTT CTAATTTATA TATTGCCAAT TTATCCTAAT CTCCCCCCTT ACGTCGAATA CTCTCAGTCA ATCACAGTGC GATGTCAGAC GACACTCCTG CTGCCACCGC ACTTTCGTCG TCGTCGTCGT CGGACTCAGT GAACCTCTCC ACCAACTCTC CCTACAAGAT CCCCAACTCC ACTGATAACT TGAAGCTGGA CTCCGAGTCG CTCTTGGAAA CTGAATTGAT GGAGAAGCGC CACTTGATAG AGGAATTGAT CCACAACAGC AATACAGGCA AAGAGGGCGA CCCCTGGTAC CTAATCTCGA CAGAGTACCT CAACAACTTC CTCCACCTCC CGGCAACATC TTTTGAAGAC CTCCAATCCA AACTTGGTCC CGTAGACAAC ACCTCCATTG TAGACCAGAA TGGCATCTTG TACCCGGAAA ACAACGAACC AGTCGAAACA TACAACGTCT CACCAGAAAT CTTCAACTAT CTTGCCGACT GGTTTGGCAC CAAAGGGCAG CCTGTGTGTA GATGTCTTAT AATCAACCCC GAAACTGGTG CCAAAGAAGT CGAAAGATTT CCTCCTGTCT TCCATATCCA CCAATTGGGC AAAAGACCCG TGCAGAACTC GTACTATACT CGCCACCATA ACCATCCTAC TCATGCTACC AGCCATCCCA CAGCTGTATC TTTGTCTCGG ACTAAGACAT TCACCGATTT GCTCGACTTG ATCCGGGTAT CGGTGTTGAA GTTGCCCAAA AGACCCATCG ATAACTTCCG TATCTGGTTC ATCGAATCAA AAAACATAGC CGAATACACT TCTACTCTTA CTATCAGTAA TCTCATCTTT GACATCGACA ACAAAAGTTT GGTCCATCTG GGCATCTTGA AAGATACTCT CAAGAGCCAG GGCATTAAAT CCGACGCTTT CCATATCCTC GTAGAGGTGA AGGATGCAAA GGCCAGCCAT GACCGCTTCC CAGTGAACAC ATACTTAAAC CAAATAGATG TGGAACAGTA CGACTTCGAC AAGCTTAGTA GTAAGAGCAG CGGGCATTTG GGCTTAAGCA ACTTAGGAAA TACATGTTAC ATGAATTCAG CTTTGCAATG CTTGTTACAT GTGGCAGAAG TAAACTACTA TTTCTTGTAC AATCTATACA AACGAGAATT GAACTTTGAC AATCCTTTGG GTAACCACGG AGATATAGCC AATGCTTTTG GAAACTTGCT TAAACAGGCT TTTGACCCTT CCTCGTCAAA GGACTCTAGC ATCACACCCA GAGAATTCAA ATCCACCATA GGAAGGTACT CGTCCATGTT TTCCGGCTAC ATGCAACAAG ATTCTCAAGA ATTTCTAAGC TGGCTCTTGG ATGCTCTTCA TGAGGATCTC AACAGAATTC ATAAAAAGCC CTACTGCGAA AAGCCGGAGT TGAAAGACGA AGAAATCAAC GATCCCAAAG CCATTATTAA ATTGTCTGAA ACTTGCTGGA ACCAGCACAA GATGAGAAAT GATTCGGTGA TTACAGATTT GTTTACCGGC TTATATCAGT CGACTTTGCT TTGTCCAGAT TGTTCAAAGA CCTCTATAAC ATTCGATCCG TTCAACGATT TGACGTTACC ATTGCCAATA AGCAAGAAGT GGTATCACAC ATTCACTATT GTTGATTTGT CGGATGGCGG CATCTTAAGA GGAACGAGAA TAATGAAGTT GGAAGTTGAG CTTAACAAGA CGTCCAATTT CGACGATTTA TTGACTTATT TGAGTGATTT CTTAAAAGTC GAGACTACCC ACTTGTTCCT CTACGAGATA TTTAGAAATT CATTCTACAG CGACTTTCAA TTGGATTACA ACCGAAACAA GTTTATGCCC ATTAGTGACA TCGTGAGAGA TTCAGATGAA GTACATGTCT ATTACGTCCC TTACAACCCA GAAACGGACA TTATAATTCC AGTTTACAAT ACAGTTGAGG ATCCAGACAA GTCCTACAAA GTTACAGAAA CTTTTGGAAT CCCGTTGTTT GTGACATTGA ACAAAGAAAG TGATGTATAC AGTTTTGGAA CAATCCGTCA AAAGCTTTTG GATAAGGTTT CTATTCTTTC TAAACTTGAC TTGAACGAAG AATATAACAA AATCAAGAAT GACACAGAGA GCTACGTCGA AAAGAAACAT TATGGACCTA AAGATTTTCC TTTACTTTGC AAGCAGGATG ACAATGCAGT ATTGGTAGAA AAGCCTCCAA GTGATCATAC AATTGAAGGT GATGGACAAC TTGACGAAGA AGACGAAGAT GGATACATAT CTGATATTTC TTTGGCCAAT CCTTTTATCA GCGCAGATTT TGGTTTCACA ATTAAGTCTT TAAACGGATA TAGCCACAAA CCGCATGCAA ACTTTCGTAA TAGATACAAC TTTGCTAGAA ATGCGCCCAA GACAGTCACG CCACCTCGTG TGATAAATGT TCCACTTCAC AAGCCTACTT TCAATGAATT CAAGCCTTTG TCAGACCAAT TGCCAGAACT TAAGAAAGAT TTCTATCATT ATCCAACGTA CGACAAGAAG TTCAACAAAG AAATGGAGGA CTTGGCAGAA GAAGTGCAGA ATGATTTGAT AGCTAGTGCT AGTGATGATA CATCCAAGTC CAACGATGAC TATGAATTTG TAATGCTTGA TGCTGAAGGA GGAGAAGAAG CTACGAAGAA TGGCAGTGTC CAGCCACCAG CGCTACCTCC CAGAAATTTA ATTCCACAGA TGAATCTGTC GGATGAAGAC ACAGAAGGGG AGCAGAATTT AGGAAGCTTA TTCGATTCCA CATCTACTTT GCCAATTCCA CCTCCTTCTT CGGGTTATGC TGAATCGTTG AAACCATCCA ATGAAAATTC TCCTACAAAT ACTCCATCAG AGGTTGAGCT TGGTAATCAT CCGGAGTTGA TAACCAAGGA TACAATCTTA CTCTGTGATT GGGATTACCC GATTTTTGAA CAATGTTTTG TCGAGAATCA AACTTGGGAG AATATTCCAG TACTTCCCAA CCCAGAATTG GAGAAGAATA GAGCCAAAAT TGAAAGACAG AGAAAGTCTA AGATTTCACT ATATGACTGC TTGAAGAGTT TCAGTACTCC AGAGATTCTC GGTGAACACG ATTTATGGTA TTGTCCTAGA TGTAAAGATC ATAAACGCGC TACTAAAACC ATTCAAGTTT GGTCAACTGG TGATATATTA ACTATCCACT TGAAGAGGTT CCATAGTGCC AGAGCTTTTA GTGACAAGAT TGAAATGGTT GTTGACTTCC CTATAGAGGG ACTTGATATG AGTTCATACA TTGCTAATCC AGAAGCTGAG AATAGTCTTT ATGACTTGAT TGCAGTGGAT AACCACTATG GTGGCTTAGG TGGTGGTCAC TATACTGCTT CAGTTAAAAA CTTCCGTGAT GGAAATTGGT ACTATTTCAA CGATAGTCGC GTTACCAGTA TCAACAATCC AGAAGAAGTT ATTACTAGTG CAGCCTACTT ATTATTCTAC CGCAAGAGGA CTACAGATCC AGACGACTAC CTTGGCGGCG GAGACTTGAA TAATATGATA CGGAGCGGGC ATGAGTTGTA CAAGAAGTCG TTAATCACCA AGAAAGCCAG CTATGATTTA GTACAACAGC AGGTCGAAAA ATACACTAAA CACGAGGAAC AGATCAAGCG AGAAATGGAA TTAGATCGTG AATTAGAAAA AAGGAAATTA CAGGAACTGG GGCTGATAGA AGCCCCAGAA AATGGAGAGG AAACTGTCTA TGAGGATGGT GAAGCTGCTT CCCCCAACAA TTCCAAAAGT GCAGATAATG AACCTGTAAG TGCAAGAAAG ACTAGATCAT TTACCAATAC CAAAGATTCG CCTAATGATT CTGTCACAAC TTCTGCCAAG GCTAAGTTCA AGTTTGACAA CGACGAAGAC GACTACGACT ACGAAGACGA CCTGGATAAC ATAAGAAAAC AGCGACTTCT TTCTAAGGAG AACAATAACA ACAAGTTGGT GCAAATTAAA AGCAACGGCA AGCAAGAAGT GGCATCGTCG CCTATTGCCA TGGAAGCAGA GTACGATAGT ATGGGCGAAG ACTGCTCAGT CTAACTGGTA CCATAATAAT CCATACATTA TAAACTATAC GCATCATTCA TTACTATAGA TTGAATAGCA ATACATACAC AATGACAGCT ATTTAGGTAG TTGCATTGAC ACAGACCAG
|
Protein sequence | MSDDTPAATA LSSSSSSDSV NLSTNSPYKI PNSTDNLKSD SESLLETELM EKRHLIEELI HNSNTGKEGD PWYLISTEYL NNFLHLPATS FEDLQSKLGP VDNTSIVDQN GILYPENNEP VETYNVSPEI FNYLADWFGT KGQPVCRCLI INPETGAKEV ERFPPVFHIH QLGKRPVQNS YYTRHHNHPT HATSHPTAVS LSRTKTFTDL LDLIRVSVLK LPKRPIDNFR IWFIESKNIA EYTSTLTISN LIFDIDNKSL VHSGILKDTL KSQGIKSDAF HILVEVKDAK ASHDRFPVNT YLNQIDVEQY DFDKLSSKSS GHLGLSNLGN TCYMNSALQC LLHVAEVNYY FLYNLYKREL NFDNPLGNHG DIANAFGNLL KQAFDPSSSK DSSITPREFK STIGRYSSMF SGYMQQDSQE FLSWLLDALH EDLNRIHKKP YCEKPELKDE EINDPKAIIK LSETCWNQHK MRNDSVITDL FTGLYQSTLL CPDCSKTSIT FDPFNDLTLP LPISKKWYHT FTIVDLSDGG ILRGTRIMKL EVELNKTSNF DDLLTYLSDF LKVETTHLFL YEIFRNSFYS DFQLDYNRNK FMPISDIVRD SDEVHVYYVP YNPETDIIIP VYNTVEDPDK SYKVTETFGI PLFVTLNKES DVYSFGTIRQ KLLDKVSILS KLDLNEEYNK IKNDTESYVE KKHYGPKDFP LLCKQDDNAV LVEKPPSDHT IEGDGQLDEE DEDGYISDIS LANPFISADF GFTIKSLNGY SHKPHANFRN RYNFARNAPK TVTPPRVINV PLHKPTFNEF KPLSDQLPEL KKDFYHYPTY DKKFNKEMED LAEEVQNDLI ASASDDTSKS NDDYEFVMLD AEGGEEATKN GSVQPPALPP RNLIPQMNSS DEDTEGEQNL GSLFDSTSTL PIPPPSSGYA ESLKPSNENS PTNTPSEVEL GNHPELITKD TILLCDWDYP IFEQCFVENQ TWENIPVLPN PELEKNRAKI ERQRKSKISL YDCLKSFSTP EILGEHDLWY CPRCKDHKRA TKTIQVWSTG DILTIHLKRF HSARAFSDKI EMVVDFPIEG LDMSSYIANP EAENSLYDLI AVDNHYGGLG GGHYTASVKN FRDGNWYYFN DSRVTSINNP EEVITSAAYL LFYRKRTTDP DDYLGGGDLN NMIRSGHELY KKSLITKKAS YDLVQQQVEK YTKHEEQIKR EMELDRELEK RKLQESGSIE APENGEETVY EDGEAASPNN SKSADNEPVS ARKTRSFTNT KDSPNDSVTT SAKAKFKFDN DEDDYDYEDD SDNIRKQRLL SKENNNNKLV QIKSNGKQEV ASSPIAMEAE YDSMGEDCSV
|
| |