Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_31214 |
Symbol | |
ID | 4838640 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009044 |
Strand | - |
Start bp | 220450 |
End bp | 222009 |
Gene Length | 1560 bp |
Protein Length | 461 aa |
Translation table | 12 |
GC content | 42% |
IMG OID | 640389955 |
Product | predicted protein |
Protein accession | XP_001384345 |
Protein GI | 150865218 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0533] Metal-dependent proteases with possible chaperone activity |
TIGRFAM ID | [TIGR00329] metallohydrolase, glycoprotease/Kae1 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCGGATG AGAAGCAATG TTTGAAGGCC AGTTATTTAT CTCCTTCTTA TATTTTGCTA GGAGATGAAT ATGATCTTGT TCAGGCATCT CAAAATATAT CCGGGTTCAA ATTTATAGAA ATTCACGTGA TCAACCGGAA CTAGAAAAGA ATATCTGAAA AACTTTTAAA CCCCAGAAGT TTCTTTTCCT CCAGGATTTG ATTGTAATTG CATTTACCTC CAAAATATGA TATGATAGGC AACAGAATAT TGAAGAACAG TACACGGATT ATACGGCTGC CGATTTCGAC TACCTTTCGC CGAACGTACA AGGTTCTAGC TATAGAAACG TCATGTGACG ATTCTTGTGT AGCTCTCTTA GACCGATATC TGCCTCTAGA GCCCCCCAAA GTGATTGACC AGATCAAAAA GACATTAGAT TCTGCTGATA TAGGTGGAAT TATGCCTACG GCAGCGTATG ATTTCCATCT CTCTACCATA GGAGGTTTGG TAGATGAGCT CTGCAAGAAA CATGGAATGA ATGCTCGTAA TCCACCAGAT TTGATATGTG TAACCCGAGG TCCTGGAATG ACAGGATCTT TATGTTCGAG CACACAATTT GCCAAAGGGT TATCTGTTGC ATGGGATGTA CCAATTGTAG GTGTTCACCA TATGTTAGGG CATTTGCTTA TAGCCCAGCT TCCTAAGACC GAGCAGCCAT GGTTGGGTGC TCCTAAGTAT CCTTTTCTTA GTTTACTTTG TAGCGGAGGT CACACGATGT TGATATTGCT GAAGTCGATT CAGGAGCACG AGATCATTGT CGAAGTGAAT GACATCGCTG TGGGAGATTC TCTTGACAAA TGCGCTCGCG AACTTGGGCT TTATGGGAAT ATGCTCGGAC AAGAACTAGA AAAGTATATC AATAATTTCC CTGAGGAACT CAAACAAGAG TTCGACAATG TTGATATAGA AACCAGGGAC AACGAGTACA AGTTCAAACT CAAGATGCCA TTCAAAGGAC CAGGAACTGG ACGAGTTCCT AAGAATATCC AGTTTTCGTT TGCTCAGTTT TTGAGTGCTA TTCAATCGTA CCGGATTCAT TATTTAAACA ACGAGCAGTT TGACAATAAA ACGAAGCAGA TGATCGCTTA CAAGACACAA GAGACAGTAT TTGATCATAT AGTGGACCGT ATCAACGTAG CATTCCAGAA ACACGGCTTG GACAGAAGCG TGTATAGAAA CGCCGATGGA AAGTTCGTAG GTATCCAAGA CTTCATCTGT TCCGGAGGTG TAGCAGCAAA CAGGCGTTTG CGTCAAAAGT TGAGTTCGAA TCTTGAGTAT AAGGAAGCGT TACGAACCGA CCAAGATTTA GCGTTCCATT TCCCGGACTT ATCGCTTTGT ACGGATAATG CCATCATGAT CGGAGTTGCC GGAATCGAAA TCTTTGAGAA ATTGAGAGTC AAGTCGGACT TGAACATCAC TCCTATAAGA AGATGGCCCA TGAACCAGTT GCTTGATGTG GATGGCTGGG TAAAGGTGGA TGACGCCGAG TTCAACAAAG TGTGCAAGTT TGAAAACTAA
|
Protein sequence | MSDEKQCLKA SYLSPSYILL GDEYDLVQAS QNISGQQNIE EQYTDYTAAD FDYLSPNVQA LLDRYSPLEP PKVIDQIKKT LDSADIGGIM PTAAYDFHLS TIGGLVDELC KKHGMNARNP PDLICVTRGP GMTGSLCSST QFAKGLSVAW DVPIVGVHHM LGHLLIAQLP KTEQPWLGAP KYPFLSLLCS GGHTMLILSK SIQEHEIIVE VNDIAVGDSL DKCARELGLY GNMLGQELEK YINNFPEELK QEFDNVDIET RDNEYKFKLK MPFKGPGTGR VPKNIQFSFA QFLSAIQSYR IHYLNNEQFD NKTKQMIAYK TQETVFDHIV DRINVAFQKH GLDRSVYRNA DGKFVGIQDF ICSGGVAANR RLRQKLSSNL EYKEALRTDQ DLAFHFPDLS LCTDNAIMIG VAGIEIFEKL RVKSDLNITP IRRWPMNQLL DVDGWVKVDD AEFNKVCKFE N
|
| |