Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_33244 |
Symbol | |
ID | 4840570 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009047 |
Strand | + |
Start bp | 107211 |
End bp | 108296 |
Gene Length | 1086 bp |
Protein Length | 361 aa |
Translation table | 12 |
GC content | 43% |
IMG OID | 640391885 |
Product | predicted protein |
Protein accession | XP_001386036 |
Protein GI | 150866435 |
COG category | [R] General function prediction only |
COG ID | [COG4784] Putative Zn-dependent protease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.500627 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTTCCAA TATCGAGAGC CTTTGGACGG CAATTGGGTC GACCATCAAC GTTTTCTTCC CGTTTTCAAA GTGTTTCTCC AGCTTCTTTT GCTCTTCGTA GTAGAGTTTT TCAGTCGGCC CCAGCTAGAC AATATGCGAC CTACAACCGT TTCAATGGTT CTTCGTCGTC TTCTTCATGG AATACGACTA CTTTCATAAA TCTATTAACC AGCAGAAGAA CCATCTACTT TGGTGTAGGT TTCTTGGGCT TTTATGTCTA CAATCTCAAT GAAGCACCTT TTACTCACAG GCGTAGACTC ATCTGGATTC CCTACTGGCT CGAAACTAAA ATCGGAGATT TTTCCTATAG ACAAATAATG TACCAATACG GTGATAAGTT GGTCTCCAGC CAAGATCCCT TGTATGGGCG AATCTCCAAG ATCATGAATA GATTGCTCTC AGTAGCCCTT GAGAATAACG AGAATCAGGC ACAAAGACAC CATCTCGAAA GCTTGAAATG GACCATCCAT ATCATCAAAG TAGATCCCAG AGAGTATCCG CCTAATGCTT TCATTTTGCC CAACGGTAAG ATCTTCATTT TCAGCTCGAT TTTGCCCATC TGCAAAAACG ACGATGGCTT GGCAACCGTG TTATCACATG AGTTATCGCA TCAGTTAGCT CATCATTCGT CAGAGCAGTT GTCCAAACAG CCCTTCTACA TCATGTTGTC AACGCTTTTG TATACAGTAA CAGGAATCAG CTGGTTCAAC GACTTGATGA TTAAGGGTTT ACTTGAAATG CCTGCTTCAC GTGAAATGGA ATCGGAAGCA GATCACATAG GCTGTGAACT TCTAGCCAGA TCTTGTTTCA ACATCGGTGA AGCAGTCCAA TTCTGGAAAA GAATGGCTCA AGCAGAAGAA GGCTTTCAAG CTAGAACTGG ATCTCTGAGA CTACAAGAAT TCTTCTCGAC CCATCCAGCC ACAGACAGAA GAATAAATGA TATACAACAT TGGACTCCAG GTTTGGAAAT TATAAAGGAA TCGTCCGGAT GCTACGAACA CCAATTCGGT CTCTTTCAAG AAGTTTCCCG CAACTTCTTT AGATAA
|
Protein sequence | MFPISRAFGR QLGRPSTFSS RFQSVSPASF ALRSRVFQSA PARQYATYNR FNGSSSSSSW NTTTFINLLT SRRTIYFGVG FLGFYVYNLN EAPFTHRRRL IWIPYWLETK IGDFSYRQIM YQYGDKLVSS QDPLYGRISK IMNRLLSVAL ENNENQAQRH HLESLKWTIH IIKVDPREYP PNAFILPNGK IFIFSSILPI CKNDDGLATV LSHELSHQLA HHSSEQLSKQ PFYIMLSTLL YTVTGISWFN DLMIKGLLEM PASREMESEA DHIGCELLAR SCFNIGEAVQ FWKRMAQAEE GFQARTGSSR LQEFFSTHPA TDRRINDIQH WTPGLEIIKE SSGCYEHQFG LFQEVSRNFF R
|
| |