Gene PICST_38351 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_38351 
SymbolCPY2 
ID4851141 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009068 
Strand
Start bp1031886 
End bp1033394 
Gene Length1509 bp 
Protein Length502 aa 
Translation table 
GC content44% 
IMG OID640392849 
Productcarboxypeptidase C 
Protein accessionXP_001387427 
Protein GI126274127 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2939] Carboxypeptidase C (cathepsin A) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.109115 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACTTCT CGCTGTATCC AGCAGAAATG GCTGAAGCAT GGAAATCCAT GAGAGAAGTC 
TTTTCGCCAG CAGAGCTTCA GGCCAAAATT GACCAGTACA ATTCCAAATT GTCGGTCAGT
TCCCAGAAGC TCAACAACGC TGTAAAGGAT TTCTCCACTT TTACCAAAGA AACTGTTGCA
GGCGTCGAAG CTCAATTTGA GCAATTGTCT CACCCCAAGT TCAGTGACTA CTCCATGAGA
ATTAAGAAAA CCAAACCAGA GCTGTTGGGT TTGGATACTG TGAATCAGTA CACAGGTTAT
TTGGATGTGA ATGTCTTGGA TAAGCATTTC TTTTATTGGT TCTTTGAATC GAGAAATGAC
CCCAAGAACG ATCCTATCAT CTTGTGGTTG AACGGTGGCC CTGGTTGTTC TTCTGCTACT
GGTTTGTTCT TTGAATTGGG CCCTTCTTCT ATCAATGCGA CGTTGCAGCC CGTCTTCAAC
CCATACTCAT GGAACAATAA CGCTTCTGTC ATCTTTTTGG ACCAACCTGT TGGTGTAGGA
TATTCGTATA CTGGTGGAGA CCAGGTCACA AACACTGCTA GTGCTGCTAA GGATGTGTTT
GTGTTTTTGG AATTGTTCTT CCAAAAGTTT CCTCAGTTCA TCCAGAATAA GTTCCACATT
GCTGGAGAAT CGTATGCCGG CCATTATATC CCCAGCTTTG CTCTGGAGAT TATCAACAAC
GCTGACAGAT CTTTTGAGTT GTCTTCGGTC TTGATTGGCA ATGGTATCAC TGATTCGCTT
ATCCAGAACG GCTATTACGG TCCAATGGCA TGTGGAGAAG GTGGCTACAA GCCCGTTATC
ACCCAAGAAC AATGTGATCA GATAGAAAAA GACTACCCTA AATGTGCTGC TTTGACAAAT
ATCTGTTACC ATTTCCAGAA CGCATTGACT TGTGTTCCTG CTCAATACTA CTGTGACATG
AAGTTGTTTA AGCCTTACGG AGACACTGGA TTGAATCCCT ACGATATCAG AAAGCCATGT
GCAGACCAAG GTGCTAACTG CTATGTAGAA ATGGACTACT TGGATGACTA CTTGAATTTG
GACTATGTTA AACAAGCAGT AGGTGCATCT AACATTGACA TCTTCACCAG TTGTGACTCG
ACAGTTTTCC AAAACTTTAT CTTGAACGGT GATGAAGCAA GGCCATTCCA ACAGTATGTC
GCTGAATTGC TTGAAAAGGA TATTCCTGTC TTATTGTACG CTGGAGACAA GGATTACATC
TGTAACTGGT TGGGTAACCA CGCCTGGTCC GATGCCTTAG AGTACGAGCA CCACGAGCAA
TTCGAAGCTG CACCTTTCAA GCCCTGGTAC ACCTTTGAAG GCAAGTTGGC TGGTGAAGTC
AAGAACTACA AGAAGTTCAC CTTCTTGAGA GTCTACGACG CTGGTCACAT GGTTCCATAC
GATCAGCCAG AAAACGCATT GGATATGGTT AACAGATGGG TCCAGGGAGA CTTTTCTTTC
GGCAGCTAG
 
Protein sequence
MDFSLYPAEM AEAWKSMREV FSPAELQAKI DQYNSKLSVS SQKLNNAVKD FSTFTKETVA 
GVEAQFEQLS HPKFSDYSMR IKKTKPELLG LDTVNQYTGY LDVNVLDKHF FYWFFESRND
PKNDPIILWL NGGPGCSSAT GLFFELGPSS INATLQPVFN PYSWNNNASV IFLDQPVGVG
YSYTGGDQVT NTASAAKDVF VFLELFFQKF PQFIQNKFHI AGESYAGHYI PSFALEIINN
ADRSFELSSV LIGNGITDSL IQNGYYGPMA CGEGGYKPVI TQEQCDQIEK DYPKCAALTN
ICYHFQNALT CVPAQYYCDM KLFKPYGDTG LNPYDIRKPC ADQGANCYVE MDYLDDYLNL
DYVKQAVGAS NIDIFTSCDS TVFQNFILNG DEARPFQQYV AELLEKDIPV LLYAGDKDYI
CNWLGNHAWS DALEYEHHEQ FEAAPFKPWY TFEGKLAGEV KNYKKFTFLR VYDAGHMVPY
DQPENALDMV NRWVQGDFSF GS