Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_47614 |
Symbol | |
ID | 4839823 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009046 |
Strand | + |
Start bp | 1185240 |
End bp | 1186571 |
Gene Length | 1332 bp |
Protein Length | 443 aa |
Translation table | 12 |
GC content | 45% |
IMG OID | 640391138 |
Product | predicted protein |
Protein accession | XP_001385579 |
Protein GI | 150866096 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0465] ATP-dependent Zn proteases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.80881 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCTTCTC CTCCTCCAAT AGACTTCGAC AAGATCAGAG AGAACTCAGA TGGGTCTCTT GGGTCCATGA TAAAGCTGGG CTTCACAGAA GTGATTCTGA ATAACCCCTA CTTCGCAGCT GGTGGTGGTC TAATGGTATT AGGAACAGGT TTGGCGCTTG CCCGTCAGGG TATTGTCAAG AGTTCGGGCT TCATCTATCG TCAGTTGCTT GTAGACCTAG AGATTCCTTC TAAGGACAAG TCGTACCTCT GGTTTCTCGA ATGGATGTCT CAGTATAAAC ACAGAACGCT GCGTCACTTA TCTGTGGAAA CTAACTTCGT TCAGCACGAT AACGGTTCTG TTTCGACTCG GTTCTCTTTG GTTCCTGGTC CAGGTAAGCA TTTAATCAAG TACAAAGGTG CCTACATGTT GGTTAATCGT GAAAGGTCTG GAAAGTTGCT TGATATGACC AGTGGAACAC CGTTTGAAAC AGTGACCTTA ACCACATTGT ACAGCGACAG AAAGTTGTTC AGCGATTTGT TAGGTGAGGC CAAGCAGCTA GCTTTGAAAG CTAGAGAGGG CAAGACTGTT TTATACACTT CGTGGGGTCC AGAATGGCGG CCCTTCGGTC AGCCTAGAAA GAAAAGAATG ATCGGATCGG TTATTCTCGA CAAAAGCATT GCCGAAGGCA TCATTTCAGA CGTCAAAGAT TTCTTGGACA GTGGAGAATG GTACCATAAA CGAGGCATAC CCTACAGAAG AGGTTATTTG TTGTACGGAC CACCTGGAAG TGGTAAGACT TCTTTTATTC AGGCTTTGGC TGGAGAGTTA GACTACAATA TCTGCATTTT GAATTTGTCG GAAAGCAACT TGACCGACGA CCGGTTGAAC CACTTGATGA ACCACATTCC AGAAAGATCT ATATTGTTAC TTGAAGATAT CGATGCTGCC TTCAACAAAA GAGCTCAGAC GGAAGACAAG GGCTACACTT CAGGGGTTAC CTTTTCAGGT TTGTTAAATG CGCTAGATGG AGTTGCTAGT GCGGAAGAAT GCATTACCTT CATGACAACT AATCATCCCG AAAAGCTTGA CCCAGCCCTC ATGCGTCCTG GCAGAGTCGA TTATAAGGTT CTAGTGGACA ATGCTACTGA ATACCAGGTC AGACAGATGT TCTTACGATT CTACGAAAAC GAGAACGAGC TCTGTGAAGT GTTCATGAAC AAATACAGAC ACCTCCAATT GACAAAGGTC AGCACAGCTC AACTACAGGG ATTGTTTGTC TACAATAAAA GCAACCCACA GCTGGCCATT GACATGATCG AGACATTGCA GAACCCAAAT ACCGTGTTCT AG
|
Protein sequence | MASPPPIDFD KIRENSDGSL GSMIKSGFTE VISNNPYFAA GGGLMVLGTG LALARQGIVK SSGFIYRQLL VDLEIPSKDK SYLWFLEWMS QYKHRTSRHL SVETNFVQHD NGSVSTRFSL VPGPGKHLIK YKGAYMLVNR ERSGKLLDMT SGTPFETVTL TTLYSDRKLF SDLLGEAKQL ALKAREGKTV LYTSWGPEWR PFGQPRKKRM IGSVILDKSI AEGIISDVKD FLDSGEWYHK RGIPYRRGYL LYGPPGSGKT SFIQALAGEL DYNICILNLS ESNLTDDRLN HLMNHIPERS ILLLEDIDAA FNKRAQTEDK GYTSGVTFSG LLNALDGVAS AEECITFMTT NHPEKLDPAL MRPGRVDYKV LVDNATEYQV RQMFLRFYEN ENELCEVFMN KYRHLQLTKV STAQLQGLFV YNKSNPQSAI DMIETLQNPN TVF
|
| |