Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_56651 |
Symbol | |
ID | 4838152 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009043 |
Strand | - |
Start bp | 702929 |
End bp | 706153 |
Gene Length | 3225 bp |
Protein Length | 1074 aa |
Translation table | 12 |
GC content | 41% |
IMG OID | 640389467 |
Product | predicted protein |
Protein accession | XP_001383768 |
Protein GI | 150864794 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG1025] Secreted/periplasmic Zn-dependent peptidases, insulinase-like |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.798232 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TACTCTACCT CATCCCGTCT CACCCATCTC ATGCTGCATT ATACGGTCTT GGCCGAAAAC GACGTTATTG AGAAGCCTCT TCTCGACAAC AGAACCTATC GTTACTTAAA GCTCGACTCG AACGATCTCC AGGTCCTCGT GATTCATGAC TCAACTGCTG ACAAGTCTGC GGCCTCTTTG GATGTCAATG TAGGCTCGTT CGCTGACAAA AAGTATGGAA TTCCTGGTTT GGCTCATTTC TGTGAACATT TGCTATTCAT GGGTACCGAA AAGTATCCGG CTGAAAATGA ATATTCCTCG TATTTATCGA AACATTCTGG GTACTCCAAT GCTTATACTG CAGCAGAACA TACGAATTAC TACTTTCAGG TAAGTGCCGA CTACTTAGAA GGTGCTCTAG ACCGGTTTGC TCAATTCTTC GTGGCACCCC TTTTTAGCCA GAGCTGCAAG GATCGAGAAA TAAACGCTGT AGACTCAGAA AACAAAAAGA ATCTCCAAAA CGATCTCTGG AGACTCTACC AGCTCGACAA GCTGAACAGT AACCCTGACC ATCCTTACAA TGGGTTTTCT ACAGGTAACT ACCAGACTTT GCATGTAGAG CCTTCTGAAA GAGGACTCAA TGTTCGTGAT GTCTTGCTCG ATTTCTACAG CAACAGCTAC TCGTCGAACT TGATGAGTTT GGTTGTGTTG GGAAAGGAAG ATTTGGACAC TTTGTCAGCT TGGGCTATCG AGAAGTTCTC GGCCGTGCCT AACAAGAGCT TAACAAGACC AAACTTCCAT GGCGAAGTCA TCTTGACAGA TAAGTACCTC GGTAAGTTGA CCAGGGCCAA GCCCATTATG GACAAGCATC AACTCGAGTT AACATTCATG GTTCCTGATG ATTTGGAAAC CAAATGGAAA TCTAAACCCA ACGGCTACTT CTCCCATTTA TTGGGACATG AAAGCGAAGG CTCTGTGCTT TTCTTCTTGA AACACAAGGG TTGGGTAACA GAATTGTCGT CTGGTAACAT GCGAGTCTGT CAGGGTAACT CTTTCTTTAT CCTCGAATTT GAGTTGACTC CGGAAGGGTT GCAGAATTGG AAGGAGATAG TAGTTTCTGT GTTTCAGTAC TTGAAGTTGA TTCTTCCAGA AGAGCCCAAA AAATGGATCT ACGACGAGAT CTCTATGATG TCTGCCATTA ACTTCAAGTT CAGGCAAAAG GCAGATGCCG CTAACACTGT TTCTCTGATG AGCAACACTT TGTACAAGTT TGCTGTAGAT GGTTATATTC CTCCAGAATA CATTTTGAGT TCTTCTGTTT ACAGAGAATT CAACAAACAG GAGATCATAG ACTTTGGCAA ATTCTTGAAT CCAAACAATT TCAAGATTTC TTTGGTTTCG CAATCGTTAG ATGGCTTGAA CAAGTCCGAA AAATGGTACG GAACTGAGTA TGCTTATGAA GATATACCAG TAGATTTATT ACAAAACGTT GAATCGGCGC AATTGAATCC ACACTTTCAC TATCCCAAAC CCAACGATTT CATTCCCAAG GACTTTGAAG TGTTAAGAAA GAAGTCAGAA ACTCCTTTGC AACATCCTTA CCTCATTGAA GAGAGCAACA AGCTCCAGGT CTGGTACAAG CAAGACGATC TTTTTGAAGT TCCCAAAGGT AATATCGATA TAGTGTTCCA TTTGCCCAAC TCCAACTTGG ACAAGAAGAC TTCTACCTAC TCGTCTTTGT TGGCTGAATT GATAACCGAT GAATTAAACC AAGTCACTTA TTATGCCTCA TTGGTTGGCT TAAAGGTGCT GATATCATGC TGGCGTGATG GATTCAACGT GAGAGTATCA GGTTACAGTG ACAAACTTCC AGTGCTTTTA GATCAGGTTT TGTCTAAATT CTTTAATTTC AAACCTAACA AGGAAAGGTT TGAAGCTATC AGATTCAAAT TGTATCAACA ATTCAAGAAT TTTGGATATG ATGTTCCGTA CCGTCAAATT GGAACTCATA TTTTACTGTT GCTCAACGAG AAAACCTACA CTTATGATGA AAAAGTTCAA GTTATGGACG AAGATCTTTC ATTTGACGAA TTGAACGAGT TTGCAACCAA GAATCTTTGG AAATCAGGAA TCTTTACTGA AGTTTTGATC CATGGTAACT TTGATATCGC CAAAGGAGAT GAAATCAGAA AATTGATAGC TAGTCACACC AAGAGTCTTG CTCCTATTGC TGATACTTTA GATGATGTCA ACAAAGCCAT CAAGCTTCAG AACTTTGTGC TTCCATCTAA GGAGTTTATT AGATACGAAT TGCCATTGCA AGATGAGAAA AATATTAACT CTTGTATTGA GTACTACATC CAGATCAGTC CCACGAACGA TGATCCCAAA TTGAGAGTGT TGACTGATTT ATTTGGTACC ATTATCCGTG AACCTTGCTT CAATCAATTG AGGACAAAGG AACAACTAGG TTACGTAGTT TTTTCTGGTA CTAGATTGGG CAGAACGTCG ATAGGTTTCA GAATCTTGGT TCAGTCGGAA AGAACTGCAG ACTATTTGGA GTACAGAATT GATGAGTTTT TGGGCAAGTT TGGCAAGCAC ATCAACTCAG AGTTGACAGA AGTCGATTTC GTCAAATTCA AGCAAGCACT CAAGGACCTT AAATTATCAA AATTGAAGCA CTTGAACGAA GAAACTTCTA GACTCTGGAA CTCCATTACT GATGGCTACT TTGATTTTGA AGCCAGACAG AAGCATGTTA AGATATTGGA GACCATCAGT AAGGAAGAGT TTGTCGATTT CTTTAACAAC TACATTGCTG ATGGATCTGA CAAGTCCGGC AAGCTTGTCG TCTACTTGAA TTCGCAATCT CCTCCTGAAC AGACAACGCT CAAGTTGGCA CATAGTTCTA TTATAAACTA TATCTACAGA AATGGCTACG AGGCTTCCAC CGAAAAGTTG GAATCCATAG TGAAGGAGAA TCTTGAAAAT CACCAACAGT TGGTTAAACA AGTTGCTGAA GAAATTCTGG AATATGTATC CAACAAACCA CCAGCTAACT TGAAACAAGA TTTGCTTATT GCTTTTGAAA ACGATATCAA GACTCCAGTT CCCACCAAAT ACCGCCAGGG AACGGTGTAC AAGGATATTT CAGAATTCAG AAAACACTAC TCTTTGGGAG GAGTTCCTTC AGCAGTAGAG CCTTTGACCA AATATTACTA TCCTGGACGT AACCCACACT TATAA
|
Protein sequence | YSTSSRLTHL MSHYTVLAEN DVIEKPLLDN RTYRYLKLDS NDLQVLVIHD STADKSAASL DVNVGSFADK KYGIPGLAHF CEHLLFMGTE KYPAENEYSS YLSKHSGYSN AYTAAEHTNY YFQVSADYLE GALDRFAQFF VAPLFSQSCK DREINAVDSE NKKNLQNDLW RLYQLDKSNS NPDHPYNGFS TGNYQTLHVE PSERGLNVRD VLLDFYSNSY SSNLMSLVVL GKEDLDTLSA WAIEKFSAVP NKSLTRPNFH GEVILTDKYL GKLTRAKPIM DKHQLELTFM VPDDLETKWK SKPNGYFSHL LGHESEGSVL FFLKHKGWVT ELSSGNMRVC QGNSFFILEF ELTPEGLQNW KEIVVSVFQY LKLILPEEPK KWIYDEISMM SAINFKFRQK ADAANTVSSM SNTLYKFAVD GYIPPEYILS SSVYREFNKQ EIIDFGKFLN PNNFKISLVS QSLDGLNKSE KWYGTEYAYE DIPVDLLQNV ESAQLNPHFH YPKPNDFIPK DFEVLRKKSE TPLQHPYLIE ESNKLQVWYK QDDLFEVPKG NIDIVFHLPN SNLDKKTSTY SSLLAELITD ELNQVTYYAS LVGLKVSISC WRDGFNVRVS GYSDKLPVLL DQVLSKFFNF KPNKERFEAI RFKLYQQFKN FGYDVPYRQI GTHILSLLNE KTYTYDEKVQ VMDEDLSFDE LNEFATKNLW KSGIFTEVLI HGNFDIAKGD EIRKLIASHT KSLAPIADTL DDVNKAIKLQ NFVLPSKEFI RYELPLQDEK NINSCIEYYI QISPTNDDPK LRVLTDLFGT IIREPCFNQL RTKEQLGYVV FSGTRLGRTS IGFRILVQSE RTADYLEYRI DEFLGKFGKH INSELTEVDF VKFKQALKDL KLSKLKHLNE ETSRLWNSIT DGYFDFEARQ KHVKILETIS KEEFVDFFNN YIADGSDKSG KLVVYLNSQS PPEQTTLKLA HSSIINYIYR NGYEASTEKL ESIVKENLEN HQQLVKQVAE EISEYVSNKP PANLKQDLLI AFENDIKTPV PTKYRQGTVY KDISEFRKHY SLGGVPSAVE PLTKYYYPGR NPHL
|
| |