Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_31676 |
Symbol | |
ID | 4838863 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009044 |
Strand | - |
Start bp | 1410166 |
End bp | 1413203 |
Gene Length | 3038 bp |
Protein Length | 449 aa |
Translation table | 12 |
GC content | 43% |
IMG OID | 640390178 |
Product | predicted protein |
Protein accession | XP_001384568 |
Protein GI | 126136088 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2939] Carboxypeptidase C (cathepsin A) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.499026 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCTGAAG TAGGCTATGG AAGCCCACCA GATCCTCGGT ATCCCTATCC TAGCCCTTGG TGTATTATTG TTACCGACAA TGTATTTTCT GCGTATTTTC TCATCGTGGT ATTTTCAATC GGATAACAGA ATGCTACGAG GGTTGGATCT AGTGGTGTTC AACCACAGCT TGTAGTCGTA GGCAGCTAGT GGCAACTAGG TGGGTCAAGA TCCTCGTCGA CAATTACAGG TTCTAGAAGG ACTAAAATGG GATTTCTGCT ACGGTAATTC ATGGCAACCA GAGATCTGTC TCAGGTCTTG CATTGTCTTA CATACGAAAA CTTCCTTAAT TAGTGGAAAT TTTGCACTAT TTTCTGGAAC ATACAGTATA TGCCTGGCAT TGTCGGCGGG GCTCTTTGTG CAGTTTGGAT CCTTAGGCGG CTAGCGTAGT GACTTTGACG TGGTAGTCTA CGGGTCTTTG TCAAATCTCA AAATGTGATA ATTTGGTAGC AGCCGATATC ACAGTCTTCT GGAACTGAAA AATACAGCCG AGTGGGGAAG ATACCATTGG CCTTGTTGAA GGTATAGTCG GTGTGGGCTG TTCTGGACGC GCAGTTACGC AGCCTTTTTC GTCAACTCCC CAATCTTAGC TCCCGCTTGC ATGGACATAC TTCCACTCCT AAGGGATAGT TGACCTTTTT CTAAATGTAG CGCGGATGTC GATCATTCCT GAATGTAATT TTCAACTCAC CCCTTGAACA ATTGCAATTT TAATTTTTTA AAGATGGGTT TAAAGGCTAT TGTCGAGATA TCGGACCCAG CTATCACTTT GGCCATTGAC GAATTATAAT TAATTGCCGT CAAATTTATT TGCTTGCAAT GGATACGTTT TCACGGTGTT CTCGCATTGG GTATCTAAAC AGCTTCTTGT TGCTGGCCCA GGGCTACAGA TATTTACGCC TCGTTTAGAC ACCAGCTATT GTTCTCGAGC TTCTACGATA AACCCCCTTT CCACGTCATC TAAAAGCCAA AAATCCACCG TAATATCCTG GATCTATGGT TATCAAACCT TGGCTCACTT TCTTGACGTG CAGATAGCAG CACTGGACCA TCGCCAGAAA CCAAAGTCAG AATAAGCCTA CTTTGTATAA CATCCTCTGA CTTGAACGAA ACCGAGGAAA CTCCTGTTCG GAAAATGGAC ATGAAGAAGT TCGGACCATA GTGGCACCTG TCGATTTACT TCATTGGATC CGTGAACTTT ATTTTATCTG AATAGTGATT TCATACGCAT TCTTGTCTTG CGTGCCTATG AAACAGGGAC CCCTCGTGTT TGGAGAGCCA ACGCAACCCA ATCGGCATTA ATTGGGACGT TTCTATCCCA TTTTTGGCTT CACATTGTAA GTTGAGTTCT GTGTCTAAGT ACTCTCCACA TAAATAAAAT CAGCTCCATA TCGCTTCATG AATTCAATTA CCTTTGAATA TCTTCACGTG ATCGTACGGC TCAAGTATGG CTCTCCCAGT TTAACCAGCT AGTTCTTATC TAGTCTATCT CTTTAGTTTT CTCTGCCTCA GGTCAATTGA GTAGTGCGAT GAAACTTCTG ATCGGCACTA TTCTACTTTC GGGCTTGTGT GGCTCTGTTG CCACAGCGTA TTCTTTATCT TCCTTGCAAC AGTTTTTGGG CTTCAAGGAA GTTTCGGATG CACACGATAT TGCAAATCTT GGTAATTCTG GGCCTTTGGA CAATTTATTT GATGTAGTTT CAAACGAGAA CTATGCCAGT CACAGGTTGC GAGTTAAGCA CATAGATCCG CTTGTTCTCG GTCTAGATAA AGTAAAGCAA GTCACGGGAT ATTTGGATAT CGAAGATGAC AAGCACTTGT TCTATTGGTT CTTTGAATCG AGAAACGATC CCCAGAACGA CCCAGTAGTA TTATGGTTGA ATGGAGGGCC AGGTTGCTCT AGCTCAACGG GACTCTTCTT TGAGTTGGGG CCCTCTTTTA TCAATTCAAC CCTTCAGCCA GAATATAACC CCTATTCGTG GAACTCGAAT GCGTCTGTTA TCTTCTTGGA TCAACCTGTA GACGTAGGAC TCTCGTACTC GGATGACAAC GAAGTTTCAA CTACGGCTGC TGCTGCAAAA GATGTATACA TATTCTTGGA ATTGTTCTTC CAAAAGTTTC CACAATTCCA AAGCAGAGAC TTTCATATGG CTGGAGAATC GTATGCTGGC CATTACATTC CTAAGTTCGC GTCGGAGATC CTCAGTCATC CGGAAAGGTC GTTCAACGTG ACTTCAGTTC TCATTGGAAA TGGGTTCACT GATGCTATTC CACAATATAA AGCTCTTATT GGAATGGGAT GTGGACAAGG AGGTTATGAT TCAATCTTGT CAGAACAAGA TTGCAAGGAA TTGGAAGAGA ATTACTATCC CAAATGCAAG CAATTCCTTG AACTATGCAA CAGGGAACAG GATGCATTGA CATGTGTACC AGCTTATCAT TACTGTGAAA CAAGAATGTT TATTCCTTTC TCCAAGACGA ACTTGAACCC ATATGACATA CGTGAAGAAT GTGAAAGGGG TGGAACTTGC TACGAGGAAC TAGACGATGT GGACGCTTAT CTCAACCTTG ACTTTGTCAG GAGTGCCATT GGGGTTTCTC CTGAAGTCAA GAAGTATGAA GGTTGTTCTG ATGTTGTATC AAAGAACTTT GCCTTGGAAG GCGATAAAGC ATTGCCCCAT CAGCAGTATG TTGCCGAACT TCTTGAAAAG GAGGTAGCAG TATTGATATT TGCTGGAGAT AAAGACTATA GATGTAATTG GTTAGGTAAC TACGAGTGGA CAGACCAATT AGACTATGAT GGTCATGATG AATTTTCAAG TAAACCTTTG GTGCCATGGC AAACTTCTGA CGGCAGTATT GGTGGAGAGT ACAGGAACTA CGAAAAGTTC ACTTATTTGA GATTCTACGA TGCTGGCCAT TTGGTCCCTC ACGATCAACC CCAGAGGGCA TTGGAAATGG TTAACAGTTG GTTACAAGGA CAGTATTCAT TGAACTAA
|
Protein sequence | MSEVGYGSPP DPRYPYPSPW FTQPFSLRVK HIDPLVLGLD KVKQVTGYLD IEDDKHLFYW FFESRNDPQN DPVVLWLNGG PGCSSSTGLF FELGPSFINS TLQPEYNPYS WNSNASVIFL DQPVDVGLSY SDDNEVSTTA AAAKDVYIFL ELFFQKFPQF QSRDFHMAGE SYAGHYIPKF ASEILSHPER SFNVTSVLIG NGFTDAIPQY KALIGMGCGQ GGYDSILSEQ DCKELEENYY PKCKQFLELC NREQDALTCV PAYHYCETRM FIPFSKTNLN PYDIREECER GGTCYEELDD VDAYLNLDFV RSAIGVSPEV KKYEGCSDVV SKNFALEGDK ALPHQQYVAE LLEKEVAVLI FAGDKDYRCN WLGNYEWTDQ LDYDGHDEFS SKPLVPWQTS DGSIGGEYRN YEKFTYLRFY DAGHLVPHDQ PQRALEMVNS WLQGQYSLN
|
| |