Gene PICST_78088 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_78088 
SymbolKEX1 
ID4839353 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009045 
Strand
Start bp1555856 
End bp1558326 
Gene Length2471 bp 
Protein Length693 aa 
Translation table12 
GC content44% 
IMG OID640390668 
Productcarboxypeptidase B-like processing protease 
Protein accessionXP_001384992 
Protein GI126136937 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2939] Carboxypeptidase C (cathepsin A) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TCGCTTTTGG CTCGCTGTTG TAGTTTCTGG TAGTATCCAT CTCGTGGCGT CTTCAAGTCT 
ACGTCTGAGC AATAAGCAGT GAAGAGTATA TCGTATTTCC GAACATAAGA AAGCAAAGTA
TCTGAAACAT TCATAGTGAA TGATATCACA ACGTGAAGAC ATAAATATAT ACCCAGCAAC
ACTTTCATAA CAGCACTACC ATACTATCAC ATCAACGCCA AGGCTTTCAT TGTCGTCAGG
ACTAACTTAA AGATCATAAA GTCACATCTA TTACAATTCT ATTATAACCT TACTATCCAC
TATACCCGTA AATCACTATC GAAGTCCACG TCAACATGTA CTGCCTCCGC TTATTCCTTT
TCTTTGCGGT TAGTTCGCTT GTCTCAGCAC TTCCTCCAAA ATTCGCTGGG TCGGACGTCC
AAAAGCAGTA TTTGGTTCTT GATCTTCCCG GACTCCATAC CAACGTCCAG GAAGAAGATA
TACCCCTCAT GTTCTCGGGC CAGTTGCAGT TGTATCCAGA GAACAACACC AACTACTTCT
TCTGGTCGTA TAAAGATCAA CATCCTTTGC CGGAAAACAC GAATAGAACA ATGTTTTGGC
TTAATGGAGG TCCTGGGTGT TCCTCTCTAG ATGGAGCTCT CTTGGAAGCT GGCCCTTTCA
GAGTCAACGA GGACCGCAAA ATAGTCTACA ATAAGGGTTC GTGGCACAAG GCCGCCAACA
TGGTATTTGT GGACCAGCCG GGTGGAACCG GTTTTAGTTA CACCGATGTC TACGACTCCG
AGCTCTATCA GGTGACGCAG GACTTTTTGG TATTCATGAG TAAATACTAT GAGATCTTCC
CGGAGGAAAG GGACAATGAG ATCTACTTTG CTGGAGAAAG CTATGCTGGA CAGTACATCC
CGTATATTGC CGATGGAATC TTGAGACATA ACAGGAATCT CACAGAAGGC GAAAAGCCGT
ATAACTTGAA AGGCTTGTTG ATTGGCAATG GCTGGATCTC ACCCAACGAA CAGAGCTTGT
CGTATTTGCC CTACGCGGTC CAGGCAGGAA TCGTCAGCAC CGAAAATGAA AGATGGGGTC
AAATACTTAG CGATCACGAG CAATGCCAGA AGATAGTGAA CAGGATCGAT GCAAACTTCG
ATGGAGAGCT CCATGATTAT GAGGTTTCTT CATCAACTTG TGAGAGAGTG TTGCAGACTT
TATTGACCAT TACGAGAGAC AAAAGTTTAC CCAAAGACGA GCAGTGCTTT AATATGTATG
ACTACACTAA GAAGGACAGC TTTCCATCTT GTGGAATGAA CTGGCCCCAT GAGTTAGTAT
TTGTAATGCC CTTCTTGCGT GAAGACGAAG TCAAAGGCGA CTTGAATATC AAGAACAACC
AGGTGTGGCG TGAATGTTCA GGAGCTGTAG GCTCTCATCT TCATGCTCGC AATTCAATAC
CCTCTGTGCA TCTTCTTCCG TCCATTTTGG AAACCGTTCC TATAGTCTTA TTCAACGGCA
ATCTAGACAT CATCTGCAAT TATATGGGCA CTGAAAGTTT TATCAAGAAA ATGACCTGGG
GTGGCAGCAA AGGCTTTTCT TCTCAAGATA CAACTGACTG GATCTACGAC AGCAAGACAG
CAGGCTATAT CAAGTCCGAG AGAAACTTAA CATTTGTCAA TGTCTTTGGA GCTTCCCATA
TGGTACCCTA TGACGTTCCG GAAATCTCAC GTGCATTGAT CGACCTTATC ACTGGAAACT
ATGATGTTCA GGAGACTCAA ACAAAGAGCG ATAAAACTAA AAAGTCATAT GTAACATATC
CTATAGGTGT GAGGGCTGCT AAGCTTGAAG CTGATGCAAA GGCCAAGGCA GAAGCTGATG
CAAAGGCCAA GGCAGAAGCT GATGCAAAGG CCAAGGCAGA AGCTGATGCA AAGGCCAAGG
CAGATGCTGC TAAGCAAGGA GGGCAAGCCA GTCCTACTGA AGAAGAGAAA GTCGACGGTG
ACAAATCGAA CTCAGATGCT TCAACGTCAG AATCAGCCAT ACCAGAAGAT TACGATAAGT
CTGCCACGGT AAGCAAGATA ACGAGAGTGA TCCAGTTGTT AGTGATAATA GTATTGATCT
GGGGAATGTA TGTTCTCTAC ACCTCGTGCA AATCTAGACC ATCTTCTATC ATCAAAACTG
GACCTTCTAC TGGCAAAAAG AAGAATGTTC AATGGGCGGA CCAGTTGAGA AGATTTCAAG
AAGACGACGA AGAAGCTCAG AGACAAAACC AAGGCTTCTT CTCCAAGACG TTTGGAAAAT
TTACAACAGG CGACAACCGT GGAAACTACA CACCAGCTCC GGATAAGTAC TACGAAGATA
TAGAGTTGGG CGATGGCATC ACTGAGCACG ATGAACAGAG CGGAGCTTCT TTGGGAAGTG
CTAGTGTGGA CAATTTTGTC ATTGATAGTG AGGAAGAAGA CGAACTTGAG GAGCAAGAAC
AGGTACACAC T
 
Protein sequence
MYCLRLFLFF AVSSLVSALP PKFAGSDVQK QYLVLDLPGL HTNVQEEDIP LMFSGQLQLY 
PENNTNYFFW SYKDQHPLPE NTNRTMFWLN GGPGCSSLDG ALLEAGPFRV NEDRKIVYNK
GSWHKAANMV FVDQPGGTGF SYTDVYDSEL YQVTQDFLVF MSKYYEIFPE ERDNEIYFAG
ESYAGQYIPY IADGILRHNR NLTEGEKPYN LKGLLIGNGW ISPNEQSLSY LPYAVQAGIV
STENERWGQI LSDHEQCQKI VNRIDANFDG ELHDYEVSSS TCERVLQTLL TITRDKSLPK
DEQCFNMYDY TKKDSFPSCG MNWPHELVFV MPFLREDEVK GDLNIKNNQV WRECSGAVGS
HLHARNSIPS VHLLPSILET VPIVLFNGNL DIICNYMGTE SFIKKMTWGG SKGFSSQDTT
DWIYDSKTAG YIKSERNLTF VNVFGASHMV PYDVPEISRA LIDLITGNYD VQETQTKSDK
TKKSYVTYPI GAKAEADAKA KAEADAKAKA DAAKQGGQAS PTEEEKVDGD KSNSDASTSE
SAIPEDYDKS ATVSKITRVI QLLVIIVLIW GMYVLYTSCK SRPSSIIKTG PSTGKKKNVQ
WADQLRRFQE DDEEAQRQNQ GFFSKTFGKF TTGDNRGNYT PAPDKYYEDI ELGDGITEHD
EQSGASLGSA SVDNFVIDSE EEDELEEQEQ VHT