Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_82620 |
Symbol | |
ID | 4838111 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009043 |
Strand | + |
Start bp | 530998 |
End bp | 534058 |
Gene Length | 3061 bp |
Protein Length | 892 aa |
Translation table | 12 |
GC content | 42% |
IMG OID | 640389426 |
Product | predicted protein |
Protein accession | XP_001383376 |
Protein GI | 150864528 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.338646 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.971769 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | CAGTTCAGAA TAGTTGGTTG AAATATCCAG AACTTCGTGA AACAGACACA ATTTTTGGAG TTTACCCACC TTCGTTTGAA CCCTCGAAAA AGTAGAAACT CACGACGTTT CCCCAGTCTT TTTTGGTTGC TCTATCTCGA TTTTGATAGA GTAAATTTTT CATAAAAACT CCATCCCATC ATCAACTTTC AACAACTCAT CTTCATTCAC AGTATTTAAC TCAATTTCGT GACTTTCGTT CAGTATCACA TCTCAATAAG AATACCAGTA TGCGGAAGTA TTTTTCCATG GTGAATGAAC GAGTGGTCAA AACGGCCTTT TTGCTGGTGT TGCTCATCAA CGTCTTGTAC TTTGTTAGGG TGTATTCTTC TGAAGAAATC CAGCTCACAC GTGCTGCTCT TGCTCATGCT GCTTCTACAG TCTTTTCCAC TGACGACCAG ATCGAGTTCC CCAAGGACGT TCCGTTGGAA CAGATTACAG ATTCCAAGCA GAGAGTCTCG TATCTTTTGC ACGAAGTAGA CGAGAACAAA GATGGCAATT ACTGGTTGGC CCATACGAAC GTGAAACACT CGAGTTTGAA AATCAAGCCC AGCGACTTTT TGCCATCTGA TTCTCCTAAT TGGCTGAATC GCCCCGAGCT TTTCTTTGAT CCCAGGTTCA CCCTTTCAAT CTACTTAGAT GAGTTAAAGC ACCAGTTGTT GAGTAGAAAC CCCAAAAATG AGAAGTCTCT CGACTCTGTG ATCCTGATGC CATTTGCATG GTCGGACTGG GTCGATTTGA CAATGCTTAA CGAAGAGTTG TCGAAACCTG TTGATGAACG CAGAGACTGT GAATGGCTTC AATCTCAAGT AAATAAGCCT ACAAAGAGAC CATTTTTCTG TGTAAACCTT AAGGATGCCA CGGATGAAGA AATCGAAGAA ACAGGCATTT CCAAAGAGAG CTTACCTGGT TTCTTGGTAA AGACTTCGCC CATGAACAAA GCTCCTCATA AACAGGTTAT GATGCAAGGA AAGGCACACT TGTATGCCTA TCAACAGAAC CCATTCACCA TTATTTTCTT AACCAAAAGT GGTACATATG AGGCCCAAAT TAGGGACAAT CGTAAGAGAC TTGTACACAC GGACATGTTC GAAAACTATT TGAGCAGAAG AGGAATCAAT CCAAATCATT TGGAAGACTA CGGCAATGAA ATTGTATTGA ATCCCCATGA TGAATTCTCA AGTTTGTTGA CCACTGTTGC TCCTAGACCT TTGAACTGGG ATGACGATAT TCACAAGATG AATGCCATCA CACAGCAAAC GGAAGTCAAT GCTTCTCGTG AATTGCATTT GAATCCCGAA TCGTTCAACT ACAAGCACGA GGCTGTGCTC AAGCAGTTGC ACGATTACCA GTATCGTTTA GACAAGCTTG AAGAAGCATT TTCCAACGAG TTACGTTACA GCCCTGAAGT TTTCAACGAG TTTTCGCTTG ATCGTCATGA GTTCAACCAC TACTCTGGTC TTAAAACTGC ATCGGAAACT CCAATTCAAG AGGAGCCAAC ATACTATAAG TTAGCTACTT TGCTCAAGAA GAACGGTAAT GTAGATGCCG GCTGGCACTA TGAATGGAGA TTCTTCAACG GTGCCTTAAG ATTTATCAAA GACGATACTT GGACCATGAA CCAGTTGGAA ATCAGAGAAC AAATCATCTT AGATAGATTG TTGCGTAACT GGTTCAGATT TGCAGAACAA AAGGGCATCA TCTCGTGGAT AGCCCACGGA CCTCTCTTAT CATGGTATTG GGACGGTCTC ATGTTCCCAT TCGACATCGA CATCGATATT CAGATGCCTT CAGCTGAATT GAACAGATTA TCCAAGCTCT ATAACATGAC GCTTGTAGTC GAAGACATCG ATGAGGGCTA TGGTAAATAC TTGATAGACT GTTCTACCTT CATTCACCAC CGTGACATGG CATACAAGGA TAACCACATC GATGCTCGTT TTATAGATGT TGACACGGGT ACTTATATCG ACATCACTGG TGTGGGTAAG AACAACGAGA ACCCTCCTCC GGAGTACGAC AGCTATATCA GAAGCAAGAA TGCTAAAGGA GAATCTGTAG AGTTGTACAT GGACAGAAGA AAGCACTGGT TGAACTTTGA GAAGATCAGC CCCCTCAGAT ACACACTGAT CAGTGGGGTT CCAGTGTATA TTCCAAATGA TGTGATGTCC ATGCTCAACA CTGAGTATTC CCATGGTACT TCGGCTTTCC ACTTTGATGG CTACTACTAT GTTCCATGTC TTAGATTATG GATTCAACAA GACAGAGTAG CCAAGATCTT CAACGAAAAT GACTTCAAGG TCGGCGACAA AATCGACAGA GACAAGCTTC TTAACTTGGT GGTAAATATG AACGATAATG ACAAAGCCAG ACTACTTGAA AGCGACGAAG AGTTACTCAT GGAGTATTAC TTGACTCATA AGCACACTGG CTTGCACCAA TTAGAGAAGA AGTTTCTCTT GGATGCTGGT TTGCAGCATT CCATTATAGA TTTACACGAT AACTATGATT ACCATATGTT GACGTCCAAC TTTAAGATGG GCAAGCCGTT GAGAAAGTCT TTGTTTGATT TCGAGTACTT TGAGAGATTT GAACACGATG AGTACGAACC AGCAAAGGGA GATGAGCCTC CTAAAAAAAT CATTAAGCCA AAGGTCAAAT CACAGAGTTT GGCTGCTGGT AAGTTGGAGC CTATAAAGGT TGTTCCTAAG CCAGATCCTA TAGCTGAACT CTTTAAGGAT CAAAAGCCGG CAACTCCAAA GGAACCCGAA CAACCAAAGG TAGCTGAACA ACCAAAGGTA GCTGAACAAC CAAAGGAACA ACCAAAGGAG CAATCAAAGG AACAGCCAAA GGGTAATGAA GAACAAAAGG TACCGCCTAA AGAAGAAAAT AAAGAACAAC AGGTATAATG AGTCCATACA CCTGCTAATA ATCAAGGTCA ATTAGAACAT TTCGCATTTA CGTCATTTTG CTATTCGTAG CATCCATGTT TCACTGTATA TTAGAATAAA CGTTAGAATA T
|
Protein sequence | MRKYFSMVNE RVVKTAFLSV LLINVLYFVR VYSSEEIQLT RAALAHAAST VFSTDDQIEF PKDVPLEQIT DSKQRVSYLL HEVDENKDGN YWLAHTNVKH SSLKIKPSDF LPSDSPNWSN RPELFFDPRF TLSIYLDELK HQLLSRNPKN EKSLDSVISM PFAWSDWVDL TMLNEELSKP VDERRDCEWL QSQVNKPTKR PFFCVNLKDA TDEEIEETGI SKESLPGFLV KTSPMNKAPH KQVMMQGKAH LYAYQQNPFT IIFLTKSGTY EAQIRDNRKR LVHTDMFENY LSRRGINPNH LEDYGNEIVL NPHDEFSSLL TTVAPRPLNW DDDIHKMNAI TQQTEVNASR ELHLNPESFN YKHEAVLKQL HDYQYRLDKL EEAFSNELRY SPEVFNEFSL DRHEFNHYSG LKTASETPIQ EEPTYYKLAT LLKKNGNVDA GWHYEWRFFN GALRFIKDDT WTMNQLEIRE QIILDRLLRN WFRFAEQKGI ISWIAHGPLL SWYWDGLMFP FDIDIDIQMP SAELNRLSKL YNMTLVVEDI DEGYGKYLID CSTFIHHRDM AYKDNHIDAR FIDVDTGTYI DITGVGKNNE NPPPEYDSYI RSKNAKGESV ELYMDRRKHW LNFEKISPLR YTSISGVPVY IPNDVMSMLN TEYSHGTSAF HFDGYYYVPC LRLWIQQDRV AKIFNENDFK VGDKIDRDKL LNLVVNMNDN DKARLLESDE ELLMEYYLTH KHTGLHQLEK KFLLDAGLQH SIIDLHDNYD YHMLTSNFKM GKPLRKSLFD FEYFERFEHD EYEPAKGDEP PKKIIKPKVK SQSLAAGKLE PIKVVPKPDP IAELFKDQKP ATPKEPEQPK VAEQPKVAEQ PKEQPKEQSK EQPKGNEEQK VPPKEENKEQ QV
|
| |