Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_28842 |
Symbol | |
ID | 4851589 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009068 |
Strand | + |
Start bp | 2228700 |
End bp | 2229896 |
Gene Length | 1197 bp |
Protein Length | 398 aa |
Translation table | |
GC content | 44% |
IMG OID | 640393297 |
Product | predicted protein |
Protein accession | XP_001386776 |
Protein GI | 126274954 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG1404] Subtilisin-like serine proteases |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.991697 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.128685 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTATTCA CTAAGTTGTT GTTCATACTT GCTTTCTACT TCAACTTCAT ACACGCCGAA AACTACTTGA TCTCGTTGAA GAACAATGAG AGTTTAGAGG CATTCTTCAA ATATGATATT CTCAGACCAG CTACTGAACA AGTGAGGGCG CTTCTTACCA ATTCATTCTC AATCGGCAAT TTCACTGGGT TTGTTGGGGA CTTCTCCAAA ACCAACCTTG AGAGACTCAA AAGATGTCCC TTAGTGAACG AGATCACACC AGATGTGATA TTTAAGGCTT ATGGAACTAC TACTCAAGAG CAAGCCCCAA GACACCTCGC TCGTCTCTCC AGCAAAAAGA AGCTCAAGTC AGGAAAAAGC TACCAATATG TTTACAATGA CGACTATACT GGATCTGGGG TGTATGCCTA TGTGTTAGAT TCTGGTGTTG CTATTGGTCA CCCTGAGTTC CAAGGTAGGG CTCGGTTTGG CAAAGACTTC ACCAGCCAAG GCTCTGGTGA TTCTAATGGG CATGGAACAC ACGTTGCTGG TATTATAGGT TCTTCTACTT ATGGGGTATC CAAGAATGTA GAGATTATAG AAGTCAAAGT ATTGGATAGT CTGGGCTCGG GTTCTCTCAG TACAATTATC TCAGCGCTAG AGTTCTCTGT GAACCATAGA AAAAGAAGTG GAAAGATGGG AGTAGCCAAT CTTTCATTGG GGTCGTTTAG AAATGGAGTC TTGAACAGTG CAATCAATGC TGCTGCAGAT ACCGGTCTAG TTGTGATAGT TGCAGCTGGA AATTCCAATA TCAATGCCTG CTTATCTAGT CCAGCTAGTG CTGAAGGTGC AATTACTGTT GGAGCTATAG ACGACTACAA CGATTCTTTG GCATCTTTCT CTAATTGGGG GGAGTGCGTT GATATTTTTG CCAGTGGAGC CTATGTTAAG AGTGTGAATG CTGCAGACTA TAATAATCCA GAGACTCTCT CAGGCACTTC CATGGCATCT CCTGCCGTCT GTGGACTCGC TGCAAATCTA CTTAGTGAAG GGGTCCCTCC CCACAAGATC AAGAGCAAGC TTCTTAGCCT ATCACTCAAG GACCAGATCA AAAGATCTTC CTTGTTCCTC AGAAGAGGCA CTCCAAACAG AATAGCTTAT AATGGAATTG ATGACGAATA CAGGGATGAC ACGGACTCCG ACTCCGACGA TGATTAG
|
Protein sequence | MLFTKLLFIL AFYFNFIHAE NYLISLKNNE SLEAFFKYDI LRPATEQVRA LLTNSFSIGN FTGFVGDFSK TNLERLKRCP LVNEITPDVI FKAYGTTTQE QAPRHLARLS SKKKLKSGKS YQYVYNDDYT GSGVYAYVLD SGVAIGHPEF QGRARFGKDF TSQGSGDSNG HGTHVAGIIG SSTYGVSKNV EIIEVKVLDS LGSGSLSTII SALEFSVNHR KRSGKMGVAN LSLGSFRNGV LNSAINAAAD TGLVVIVAAG NSNINACLSS PASAEGAITV GAIDDYNDSL ASFSNWGECV DIFASGAYVK SVNAADYNNP ETLSGTSMAS PAVCGLAANL LSEGVPPHKI KSKLLSLSLK DQIKRSSLFL RRGTPNRIAY NGIDDEYRDD TDSDSDDD
|
| |