Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_39213 |
Symbol | SAP7 |
ID | 4851218 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009068 |
Strand | - |
Start bp | 1224409 |
End bp | 1226169 |
Gene Length | 1761 bp |
Protein Length | 335 aa |
Translation table | |
GC content | 40% |
IMG OID | 640392926 |
Product | secreted aspartyl proteinase |
Protein accession | XP_001387880 |
Protein GI | 126274204 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.464374 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.00607516 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | TATGCCGTCA ACATTACTGT TGGTACACCA CCAGTACAGC TTCAAGTTGA ATTGGATACT GGCTCTTCTG ATCTTTGGGT TATTAGCACA AAAGCGCCTG TCTGTAAAAG ATACAATTGC AACATTTACG GAGTCTTCGA CAAAGATAAG TCATCGAGTT GGAAAAGTAA TGGCACAGAG TTTGGAATTA GGTATGTTGA TGGCAGTGGT AACGATGGAG GAATTTATGG ACAAGATACT ATTAATTTCG GCAACGGGTT GGTGTTGAAC AACGCAACTT TTGCTGTGGC CAATGAAACG GTTGATCTTG TTGGTATAAT GGGGATTTCC TTTGAATTCA GAGAAGCTGC CACACAGAAA TATCCAAATA TACCTTCTCT AATGAAACTA CAGGGATACA CCAAGAAAAC AGCTTATTCT TTATATGTCA CTGATGAAGT CAAACAAGCT GGTTCTATTT TGTTTGGCGG AATTGACCAC GCAAAATACG AGGGTGAATT GGTTTCCTTG GATATGGTTG CTCTGAATGG AACTTATCGA GCTTTACAAA CTGAATTAAC TTCCATCAAA ATCTGTATCA ATGGTTCTTG TGCTGATGCC TCCGAATCAG GTTCAAGTTC TTCAGTGAAT TCATCCGGTA TGCCTTCGTC CATACAATCA AGCACAAAAA AATCTACGAA AGGTTCGTCC ATATCCAACC CTACGACTAG CCATTCTTTG ACTAGTCTTG GAGTCTCTAG TTCAACTCCT TTCACTACTT CAATTACTCC CGTATCGTCT GTTTCAACAG GTTCGGTAGG TTCAACTGCC TCTATAAGCT TAAGCAAGTC TCCTCCCATG TTGTCTGAAT CTAGTATGTC TATCCGATCT TCATCTTCAT CTTCTTCAAT TGAAGCAGCA TCCTCTTCAA AGCATACGGT AGATCTGGGA ACTTCTTTCA GTAATGAGAC ACCTTCGACA TCTAGTTTCT CAAGTAGTCT TGCGCAGCAA ACTGCCAACC CAAGTGCTAC TTATGTCCTT TCTACTCTTT CGACTGCTTA TGTTTCCTAC TTTACTGAAA TTACCGAAAC TTCTTATTAT ACCACCACAT CAAGCAGAAT CACTGAATAC TATACTCAGA CTATAAAGAA TTCGATCAGT GCAAGTAACA GTACCAGTGC TCCAACATGG ATTGAGTACT ACAATCAAAG TTTACAAATG TTCGAATCTC TTAAAGCGTT GTTGCAAAAA ATGACTGATT TGTACAATCA GCAGAGTCAA GCTCAAAGTA GCTCCAGTAA CAATAAGAAA AGGCAAGTAA TTGAGCAAAT CCAGAAGTCC AAGAGAGATG AAAATGATGT TGCTATTGGT ATTCCATTTA CATTCGACTC TGGAGCAGAT ATAACTATTA TCAGAGAAGA TTTCCTTGAG CAGATATATA AGGTGTTGGC TCCTGGAGAA GATTTTCAAG TCTATGGTAC ATTAGGAACA TACAAGGTAC CATGTTATTT GAATGACAGT GGCAATTTCT TGAGTTTCAG CTTCAGCAAT AAAAAGGAAA TCCAAGTGCC AATGAGTGAG TTTCTTTTGT CGCAAGAAAC CGCTAACGGA ATTCAGTGTG GGTTGAAGAT ACTGACTCCT TCTCCAGGAT TGGAAGATCA TGGAATCTTT GGAGTTAACT TCTTGAGATC GGTCTACACA GTTTTCAACT TGGATGATAA GACTATCTCT CTCGCCAAGG TAAAGTATTC TGAAGATGAA CGGATTACTG CCATAGAGTA A
|
Protein sequence | YAVNITVGTP PVQLQVELDT GSSDLWVIST KAPVCKRYNC NIYGVFDKDK SSSWKSNGTE FGIRYVDGSG NDGGIYGQDT INFGNGLVLN NATFAVANET VDLVGIMGIS FEFREAATQK YPNIPSLMKL QGYTKKTAYS LYVTDEVKQA GSILFGGIDH AKYEGELVSL DMVALNGTYR ALQTELTSIK IYENDVAIGI PFTFDSGADI TIIREDFLEQ IYKVLAPGED FQVYGTLGTY KVPCYLNDSG NFLSFSFSNK KEIQVPMSEF LLSQETANGI QCGLKILTPS PGLEDHGIFG VNFLRSVYTV FNLDDKTISL AKVKYSEDER ITAIE
|
| |