Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_73563 |
Symbol | |
ID | 4840787 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009047 |
Strand | + |
Start bp | 727596 |
End bp | 730055 |
Gene Length | 2460 bp |
Protein Length | 782 aa |
Translation table | 12 |
GC content | 41% |
IMG OID | 640392102 |
Product | predicted protein |
Protein accession | XP_001386151 |
Protein GI | 150866517 |
COG category | [R] General function prediction only |
COG ID | [COG4783] Putative Zn-dependent protease, contains TPR repeats |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.560069 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.314839 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | AATCGTAAAA GTTCCGGGAA ACCATGGTGG CCAAGAAGGG AAACCCCATT TTCGCCAATA AAGAAGACGG CAACTTCCGC GAGGCATTGA AGCTCTACGA CGCCAAACAG TATAAGAAAG CTCTCAAGCT TGTAGAGACA AACCTCAAGA AAAACTCCAA CCATGCCGAA TCTCTTGCAC TCAAAGGCTG TATCAACCAC AACATCGGCA ACAAGTCCGA GGCTGAGTCA TATGTGCTCA AGGCTATCCA AAAGGCACCA GCTAACTACT TGGTCAATCA TTTGGCTGGG ATCTATTACC GTTCTGTAGA GAACTATGTA GAAGCTAGCA AGTGGCTCAA GGCTGCAAAT GACAATGGAT CTCCCAATAA GCCCATTTTG CGTGACTTGT CATTGATGCA AACCCAGATC AGAGACTATA AAAACTTGAA AGACTCCAGA CAGCAGTATT TGGAGTTCCA ACCAGGTTAC AGAGCCAACT GGACTGGTGT AGCTATAGCC CACCACTTGA ACAAGGACTA CGCCGCGGCC GTTAGCACTC TCACAAAGAT AGAAGGCATA ATCAAGGAGC ATTTGACTGA CGCTGACCGT TACGAACAAT CGGAATGTGT ATTGTACAAG GTTGATATCA TAGCTAAGTC AGGCGACATC GCCAAAGCTC TTGCTACTTT GGAGGAGGAC TCCTCTGAAA TCAAAGACAG ATTGTCCTTC TTGGAATACA AAGCTCGATT CTTGATGCTT CTTGACAAAA AGCACGAAGC ATCTCTCATC TACCGTCAAT TGCTCCAGAG GAACCCAGAC AACATGCAGT ACTACAACTT GTTAGAACTC TCTTTGGGAA CTGTAGGCAA TTCAGTAGAT CTCAGATTGA AGCTTTACGA GAAATTGGAT CTGTTCTATC CACGTTCAGA TCCTCCCAAG TTTCTTCCAT TGACTTTCAC ACCTTCTTCC CATGCTAGAT TTGAAGAGAA GGTCCGTAGC TACCTTGTTC CACAGTTGAA AAGAGGGATT CCAGCTACTT TTGTCAATGT CAAGCCTTTG TACAAAAACC ATAAGAAGTT GAAAGTCATC GAAAAAGTTG TCTTGGATTT CTATGCAGCA GAATTGCCCA AGATCGATAA TCCGACCGTC TTCGTCTGGA CTAACTACTT CTTGGCTCAG CATTACTTGT ATCTTAACGA GTTGGACACT GCCAACAAAT ACATCGATGA TGCTTTGAAA CATTCTCCAA CATTGGTAGA GTTGTATACA ATCAAGGCTA GAATCGTTAA ACATCAAGGT AAGTTTGAGG AAGCAGCTGA CATTATGAAT GCTGGACGTG AGTTAGATTT GCAAGACAGA TTCATTAACT CTAAGGCTAC CAAATATTAC TTCAGAGCTA ATAAAGTTGA CGAGGCTATT GATTGCATTT CATTATTCAC CAAACTCGAA GATGGAGTCA TTAACGGTTG CAAAGACTTG CACGTGATGC AGGTAAACTG GGTCTTAGTA GAAAGTGCCG AAGCGTACAA GAGATTGTAC CATGAGTATG AAGCCAAGTT AAAGGAATTC AAAGAAAGCT CGCCCTTAGA AAGCGAGGAA TCCAAGGAGT TGGAAAATGA ACTTGTAGAA AATATCGAAA CCTACAAGGG TTTGGCTTTG AAGAGATACA ATGCCGTTTT AAAGATCTTC AAAATCTACT ACAATGATCA ATATGACTTC CACTCCTACT GTATGAGACG TGGGACACCC AGAGATTATA TTGACACCTT AGAATGGGAA GACAGAATCC ATTCCACCCC CATCTATACT CGAGTATTGA AGGGAATGGC TGAAATTTAC TTTGAAATTT ATAACGAGCA ACAGGTGAAA CAAAGTTCTG AATCTCCCCT GATTGAAGAA GGCAAGACCA CCAAGAAGCA AAACAATAAA AAGAATAAGA AGAAGTCACA AATCAACAAG AAGAAGGCAG ATTTCATTGC CCGTGTTGAA AGTGAAAAGG ACGACGAAGA TCCTCTTGGA GCCAAATTGT TGAGCGACTT GACCAATGAC GACCAGATCA TTGACAAGTT GTTCCAGTTG TACAAACCTT TGATTGAAGA AGGTAAGGAC TTGAGACTCA CTCACGAAGT CTTGTACAAG ACATACCTTA TTGAAGGAAA GTACGTGTTA GCTTTGCAAT CTGTGAAGAG TTTGAACAAG GTGTTGGGAG GAAGTTCTGA CGTCAAGTTA AGAAGCATTG GCGAAAGAGT AGTGGAACTT TCTGAAACTG CTCAGAATGA CAAGAATGCG AACGCTGCCA TCGTCAAAGT AGTAGGAAAG GGCCTCGTCA GTGCCTTTCC CGAATTTGGT GAGTTGAGCA AGGAAGATTT CTTGAATGTC TACAGCAAAT AGATTATAGA GGCACCACTT TATTCATAGA ATTTTATTTA ATAATAGTAT TTATTATCTA ATAGTTAATA TAATTGAATA TTATTAATTT
|
Protein sequence | MVAKKGNPIF ANKEDGNFRE ALKLYDAKQY KKALKLVETN LKKNSNHAES LALKGCINHN IGNKSEAESY VLKAIQKAPA NYLVNHLAGI YYRSVENYVE ASKWLKAAND NGSPNKPILR DLSLMQTQIR DYKNLKDSRQ QYLEFQPGYR ANWTGVAIAH HLNKDYAAAV STLTKIEGII KEHLTDADRY EQSECVLYKV DIIAKSGDIA KALATLEEDS SEIKDRLSFL EYKARFLMLL DKKHEASLIY RQLLQRNPDN MQYYNLLELS LGTVGNSVDL RLKLYEKLDS FYPRSDPPKF LPLTFTPSSH ARFEEKVRSY LVPQLKRGIP ATFVNVKPLY KNHKKLKVIE KVVLDFYAAE LPKIDNPTVF VWTNYFLAQH YLYLNELDTA NKYIDDALKH SPTLVELYTI KARIVKHQGK FEEAADIMNA GRELDLQDRF INSKATKYYF RANKVDEAID CISLFTKLED GVINGCKDLH VMQVNWVLVE SAEAYKRLYH EYEAKLKEFK ESSPLESEES KELENELVEN IETYKGLALK RYNAVLKIFK IYYNDQYDFH SYCMRRGTPR DYIDTLEWED RIHSTPIYTR VLKGMAEIYF EIYNEQQVKQ SSESPSIEEG KTTKKQNNKK NKKKSQINKK KADFIARVES EKDDEDPLGA KLLSDLTNDD QIIDKLFQLY KPLIEEGKDL RLTHEVLYKT YLIEGKYVLA LQSVKSLNKV LGGSSDVKLR SIGERVVELS ETAQNDKNAN AAIVKVVGKG LVSAFPEFGE LSKEDFLNVY SK
|
| |