Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_78088 |
Symbol | KEX1 |
ID | 4839353 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009045 |
Strand | + |
Start bp | 1555856 |
End bp | 1558326 |
Gene Length | 2471 bp |
Protein Length | 693 aa |
Translation table | 12 |
GC content | 44% |
IMG OID | 640390668 |
Product | carboxypeptidase B-like processing protease |
Protein accession | XP_001384992 |
Protein GI | 126136937 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2939] Carboxypeptidase C (cathepsin A) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TCGCTTTTGG CTCGCTGTTG TAGTTTCTGG TAGTATCCAT CTCGTGGCGT CTTCAAGTCT ACGTCTGAGC AATAAGCAGT GAAGAGTATA TCGTATTTCC GAACATAAGA AAGCAAAGTA TCTGAAACAT TCATAGTGAA TGATATCACA ACGTGAAGAC ATAAATATAT ACCCAGCAAC ACTTTCATAA CAGCACTACC ATACTATCAC ATCAACGCCA AGGCTTTCAT TGTCGTCAGG ACTAACTTAA AGATCATAAA GTCACATCTA TTACAATTCT ATTATAACCT TACTATCCAC TATACCCGTA AATCACTATC GAAGTCCACG TCAACATGTA CTGCCTCCGC TTATTCCTTT TCTTTGCGGT TAGTTCGCTT GTCTCAGCAC TTCCTCCAAA ATTCGCTGGG TCGGACGTCC AAAAGCAGTA TTTGGTTCTT GATCTTCCCG GACTCCATAC CAACGTCCAG GAAGAAGATA TACCCCTCAT GTTCTCGGGC CAGTTGCAGT TGTATCCAGA GAACAACACC AACTACTTCT TCTGGTCGTA TAAAGATCAA CATCCTTTGC CGGAAAACAC GAATAGAACA ATGTTTTGGC TTAATGGAGG TCCTGGGTGT TCCTCTCTAG ATGGAGCTCT CTTGGAAGCT GGCCCTTTCA GAGTCAACGA GGACCGCAAA ATAGTCTACA ATAAGGGTTC GTGGCACAAG GCCGCCAACA TGGTATTTGT GGACCAGCCG GGTGGAACCG GTTTTAGTTA CACCGATGTC TACGACTCCG AGCTCTATCA GGTGACGCAG GACTTTTTGG TATTCATGAG TAAATACTAT GAGATCTTCC CGGAGGAAAG GGACAATGAG ATCTACTTTG CTGGAGAAAG CTATGCTGGA CAGTACATCC CGTATATTGC CGATGGAATC TTGAGACATA ACAGGAATCT CACAGAAGGC GAAAAGCCGT ATAACTTGAA AGGCTTGTTG ATTGGCAATG GCTGGATCTC ACCCAACGAA CAGAGCTTGT CGTATTTGCC CTACGCGGTC CAGGCAGGAA TCGTCAGCAC CGAAAATGAA AGATGGGGTC AAATACTTAG CGATCACGAG CAATGCCAGA AGATAGTGAA CAGGATCGAT GCAAACTTCG ATGGAGAGCT CCATGATTAT GAGGTTTCTT CATCAACTTG TGAGAGAGTG TTGCAGACTT TATTGACCAT TACGAGAGAC AAAAGTTTAC CCAAAGACGA GCAGTGCTTT AATATGTATG ACTACACTAA GAAGGACAGC TTTCCATCTT GTGGAATGAA CTGGCCCCAT GAGTTAGTAT TTGTAATGCC CTTCTTGCGT GAAGACGAAG TCAAAGGCGA CTTGAATATC AAGAACAACC AGGTGTGGCG TGAATGTTCA GGAGCTGTAG GCTCTCATCT TCATGCTCGC AATTCAATAC CCTCTGTGCA TCTTCTTCCG TCCATTTTGG AAACCGTTCC TATAGTCTTA TTCAACGGCA ATCTAGACAT CATCTGCAAT TATATGGGCA CTGAAAGTTT TATCAAGAAA ATGACCTGGG GTGGCAGCAA AGGCTTTTCT TCTCAAGATA CAACTGACTG GATCTACGAC AGCAAGACAG CAGGCTATAT CAAGTCCGAG AGAAACTTAA CATTTGTCAA TGTCTTTGGA GCTTCCCATA TGGTACCCTA TGACGTTCCG GAAATCTCAC GTGCATTGAT CGACCTTATC ACTGGAAACT ATGATGTTCA GGAGACTCAA ACAAAGAGCG ATAAAACTAA AAAGTCATAT GTAACATATC CTATAGGTGT GAGGGCTGCT AAGCTTGAAG CTGATGCAAA GGCCAAGGCA GAAGCTGATG CAAAGGCCAA GGCAGAAGCT GATGCAAAGG CCAAGGCAGA AGCTGATGCA AAGGCCAAGG CAGATGCTGC TAAGCAAGGA GGGCAAGCCA GTCCTACTGA AGAAGAGAAA GTCGACGGTG ACAAATCGAA CTCAGATGCT TCAACGTCAG AATCAGCCAT ACCAGAAGAT TACGATAAGT CTGCCACGGT AAGCAAGATA ACGAGAGTGA TCCAGTTGTT AGTGATAATA GTATTGATCT GGGGAATGTA TGTTCTCTAC ACCTCGTGCA AATCTAGACC ATCTTCTATC ATCAAAACTG GACCTTCTAC TGGCAAAAAG AAGAATGTTC AATGGGCGGA CCAGTTGAGA AGATTTCAAG AAGACGACGA AGAAGCTCAG AGACAAAACC AAGGCTTCTT CTCCAAGACG TTTGGAAAAT TTACAACAGG CGACAACCGT GGAAACTACA CACCAGCTCC GGATAAGTAC TACGAAGATA TAGAGTTGGG CGATGGCATC ACTGAGCACG ATGAACAGAG CGGAGCTTCT TTGGGAAGTG CTAGTGTGGA CAATTTTGTC ATTGATAGTG AGGAAGAAGA CGAACTTGAG GAGCAAGAAC AGGTACACAC T
|
Protein sequence | MYCLRLFLFF AVSSLVSALP PKFAGSDVQK QYLVLDLPGL HTNVQEEDIP LMFSGQLQLY PENNTNYFFW SYKDQHPLPE NTNRTMFWLN GGPGCSSLDG ALLEAGPFRV NEDRKIVYNK GSWHKAANMV FVDQPGGTGF SYTDVYDSEL YQVTQDFLVF MSKYYEIFPE ERDNEIYFAG ESYAGQYIPY IADGILRHNR NLTEGEKPYN LKGLLIGNGW ISPNEQSLSY LPYAVQAGIV STENERWGQI LSDHEQCQKI VNRIDANFDG ELHDYEVSSS TCERVLQTLL TITRDKSLPK DEQCFNMYDY TKKDSFPSCG MNWPHELVFV MPFLREDEVK GDLNIKNNQV WRECSGAVGS HLHARNSIPS VHLLPSILET VPIVLFNGNL DIICNYMGTE SFIKKMTWGG SKGFSSQDTT DWIYDSKTAG YIKSERNLTF VNVFGASHMV PYDVPEISRA LIDLITGNYD VQETQTKSDK TKKSYVTYPI GAKAEADAKA KAEADAKAKA DAAKQGGQAS PTEEEKVDGD KSNSDASTSE SAIPEDYDKS ATVSKITRVI QLLVIIVLIW GMYVLYTSCK SRPSSIIKTG PSTGKKKNVQ WADQLRRFQE DDEEAQRQNQ GFFSKTFGKF TTGDNRGNYT PAPDKYYEDI ELGDGITEHD EQSGASLGSA SVDNFVIDSE EEDELEEQEQ VHT
|
| |