Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_38351 |
Symbol | CPY2 |
ID | 4851141 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009068 |
Strand | + |
Start bp | 1031886 |
End bp | 1033394 |
Gene Length | 1509 bp |
Protein Length | 502 aa |
Translation table | |
GC content | 44% |
IMG OID | 640392849 |
Product | carboxypeptidase C |
Protein accession | XP_001387427 |
Protein GI | 126274127 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2939] Carboxypeptidase C (cathepsin A) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.109115 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGACTTCT CGCTGTATCC AGCAGAAATG GCTGAAGCAT GGAAATCCAT GAGAGAAGTC TTTTCGCCAG CAGAGCTTCA GGCCAAAATT GACCAGTACA ATTCCAAATT GTCGGTCAGT TCCCAGAAGC TCAACAACGC TGTAAAGGAT TTCTCCACTT TTACCAAAGA AACTGTTGCA GGCGTCGAAG CTCAATTTGA GCAATTGTCT CACCCCAAGT TCAGTGACTA CTCCATGAGA ATTAAGAAAA CCAAACCAGA GCTGTTGGGT TTGGATACTG TGAATCAGTA CACAGGTTAT TTGGATGTGA ATGTCTTGGA TAAGCATTTC TTTTATTGGT TCTTTGAATC GAGAAATGAC CCCAAGAACG ATCCTATCAT CTTGTGGTTG AACGGTGGCC CTGGTTGTTC TTCTGCTACT GGTTTGTTCT TTGAATTGGG CCCTTCTTCT ATCAATGCGA CGTTGCAGCC CGTCTTCAAC CCATACTCAT GGAACAATAA CGCTTCTGTC ATCTTTTTGG ACCAACCTGT TGGTGTAGGA TATTCGTATA CTGGTGGAGA CCAGGTCACA AACACTGCTA GTGCTGCTAA GGATGTGTTT GTGTTTTTGG AATTGTTCTT CCAAAAGTTT CCTCAGTTCA TCCAGAATAA GTTCCACATT GCTGGAGAAT CGTATGCCGG CCATTATATC CCCAGCTTTG CTCTGGAGAT TATCAACAAC GCTGACAGAT CTTTTGAGTT GTCTTCGGTC TTGATTGGCA ATGGTATCAC TGATTCGCTT ATCCAGAACG GCTATTACGG TCCAATGGCA TGTGGAGAAG GTGGCTACAA GCCCGTTATC ACCCAAGAAC AATGTGATCA GATAGAAAAA GACTACCCTA AATGTGCTGC TTTGACAAAT ATCTGTTACC ATTTCCAGAA CGCATTGACT TGTGTTCCTG CTCAATACTA CTGTGACATG AAGTTGTTTA AGCCTTACGG AGACACTGGA TTGAATCCCT ACGATATCAG AAAGCCATGT GCAGACCAAG GTGCTAACTG CTATGTAGAA ATGGACTACT TGGATGACTA CTTGAATTTG GACTATGTTA AACAAGCAGT AGGTGCATCT AACATTGACA TCTTCACCAG TTGTGACTCG ACAGTTTTCC AAAACTTTAT CTTGAACGGT GATGAAGCAA GGCCATTCCA ACAGTATGTC GCTGAATTGC TTGAAAAGGA TATTCCTGTC TTATTGTACG CTGGAGACAA GGATTACATC TGTAACTGGT TGGGTAACCA CGCCTGGTCC GATGCCTTAG AGTACGAGCA CCACGAGCAA TTCGAAGCTG CACCTTTCAA GCCCTGGTAC ACCTTTGAAG GCAAGTTGGC TGGTGAAGTC AAGAACTACA AGAAGTTCAC CTTCTTGAGA GTCTACGACG CTGGTCACAT GGTTCCATAC GATCAGCCAG AAAACGCATT GGATATGGTT AACAGATGGG TCCAGGGAGA CTTTTCTTTC GGCAGCTAG
|
Protein sequence | MDFSLYPAEM AEAWKSMREV FSPAELQAKI DQYNSKLSVS SQKLNNAVKD FSTFTKETVA GVEAQFEQLS HPKFSDYSMR IKKTKPELLG LDTVNQYTGY LDVNVLDKHF FYWFFESRND PKNDPIILWL NGGPGCSSAT GLFFELGPSS INATLQPVFN PYSWNNNASV IFLDQPVGVG YSYTGGDQVT NTASAAKDVF VFLELFFQKF PQFIQNKFHI AGESYAGHYI PSFALEIINN ADRSFELSSV LIGNGITDSL IQNGYYGPMA CGEGGYKPVI TQEQCDQIEK DYPKCAALTN ICYHFQNALT CVPAQYYCDM KLFKPYGDTG LNPYDIRKPC ADQGANCYVE MDYLDDYLNL DYVKQAVGAS NIDIFTSCDS TVFQNFILNG DEARPFQQYV AELLEKDIPV LLYAGDKDYI CNWLGNHAWS DALEYEHHEQ FEAAPFKPWY TFEGKLAGEV KNYKKFTFLR VYDAGHMVPY DQPENALDMV NRWVQGDFSF GS
|
| |