Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_31188 |
Symbol | |
ID | 4838630 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009044 |
Strand | + |
Start bp | 155750 |
End bp | 159415 |
Gene Length | 3666 bp |
Protein Length | 1221 aa |
Translation table | 12 |
GC content | 47% |
IMG OID | 640389945 |
Product | predicted protein |
Protein accession | XP_001383992 |
Protein GI | 150864962 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.0032345 |
Plasmid hitchhiking | No |
Plasmid clonability | unclonable |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.0791309 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCATCAT TCTGGGACAA CAACAAAGAC ACGTTTAAGT CTGCTGGCAA AGCTACGATG CGTGGAGTGG CCAGCGGAAC GAAGGCTGTG AGCAAAGCAG GCTATAGAAC CTATAAAAAC CATTCAGGAG GTGGTGGAGG CAGTAGCAAC AATGATACTA CTGGTGAAGT AGACCAGCCT GAGTATATTG GACCACCGAG ACCGCTTCCC TCCAAAGACC AATTGCTGGC ACTTCCTCCT CCACCAAAGA GAAATATTCC CACCTATGAA GTTCCAGAAA AAGGAGCTCC ATCTCAGTAC AGTGTTCCTC AGCCTCAGCA GTATCTTCAA CAACAACCGC CACAAAATCA ACCGCAACAA ATTCAACCGC AACAAATTCA ACCACCACCA CAAAATCAAC AACTAGTACA ACAACCCCAA TTGTATCAAC AACCAAATCA ATATCAAAAT CAGTATCAAA ATCAAAATCA GCCAAATCAA ATTCCGCAAC CTCAGACGAA TTTGTACACT CCCGCAAATT CGCAGCAAAC ATTTTCGCAG CCAAATGCAC CAGCTCAACA ACTTCCACAG CAACTCCCTC CACCAGTTAA TGGATACTCA GATGCTAATG GTCAACAAGG GTATTATCAA CAGGGCTACC AAACTACACC TCCGGCTGGT AATGGCTATG GGGTTCAACA TCCACAGCAG CCTCAACAGA CACAACAACT TCCTCAAGGG CAACAGTCGC AGAATGAGAT GTATGCTCAG GCAGCCAAAC TGGCACTTCC TGTTCTTCTG AATTTGTATC AACAACATCA ACAGGGCCAG ACCCAGAACT CTGATGCTTC GCAACAGTTT CAACAGACTC AGCAGACTCC GCAACAGCAG CAACAAGCAA TGTACTCTCT GGCCGCAAAA CATGTTCTTC CTGTTCTCCT GAACATGTAC CAGAACCAAC AGCAGACTAG TGAGCTTCAG CAAGGTCAGC AAGGTCAACA GCCTCAAGGT CAGCAGCCTC AAGGTCAGCA GCCTCAAGGT CAGGTGCCTC AAGGTCAAGC TGATTTGTAT GCTCTGGCAG CGAAGCACGC ATTGCCTGTA CTTCTGAATA TTTACCAGCA ACAATCTCAA CAACAGCAAC AACCTCAACA ACCTCAACAA CAGCAACAAT ATCAGCAACC TCAGCAATTT CAGCAACCTC CGCAACCTCA ACAACCTCAA CAGCCTCAAC AACCTCAACA ACCTCAACAA CCTCAACAAC CTCAACAACA GCAACAATAT CAGCAAGTCC AATTGCCATC AAACCAATTT CAAAATGAAC TTGGTCAGCT TCCTCAACCT GGCCAACTTC CTCAACCTCC AGCTAGACAA TTGCCACCAC CTCCTCCAGA ACGCACTGTT CCTGCAACCC CAGGTATTCC AATTGCTTCA GGACAGTTCC AAGCTCAAGC TCCATTATCT TACGATTCAC CTTATTACCA ACCTCCATCT ACCCAACCCG CTGCAGCTGC TCAGCCAGCA CCGGCAAAGA CTTACGGGTT TGGTGTTGCA CAAGTTCAGG ATCCAGATGA AGCTCCTAAA CCCAAGAAAG AGTTGCCTGA TCCATCTCTG TTCGCTCCTC CTCCTATTCG TGCAGATAGA GTAGGACCAC CATCTACAAA ATCGGCTACT TCTGCAACTT CTCCTTCTCC TACGCAGCTT CCAGCCACTA CTTCTGGCAG CAAACTGTCT GTTGGAACTG CTCCTCCTTC AGCTCCCCCT AGAGCATCTT CTGCTGAACA AACCACTGCT CCTTCTGAAG TCAAAGAGGA ACCGAAGCTT CCTTCCAAGA CAAACCTCAT GGACTTTGAT ATATCCAAAT TCGGTGCTCC TCCTCCTAAA ATATATAGAG GACCACAAGA CGCTCTTCCC CTAAAGAAAT CTTCTGCTAA TGCTTCGGCT TCAACGGTTT CTTCTTCTTT TACACCTCCA CCTCCACCGC CTGCTCGTGT TTCCCCTTCT CCTCTGCCCC AAGCGTACTC TGAACAACCT GTAAAACCAC CAAAACCACC AAAACCAACC AAACCAACTA AGCCAAAGAC TTTGGAAATG GAGGATATAC CTCCTCCAAG GCCCGCTAGA GCAGACTCTA CAGAGGCTAC TCCACCACCG ATGCCAGCTA GAAAGCATGA TATTGTTGAA ATGGAAACCC CACCTCCTAA ACCTTCTAGA CCAGTAATAA CAGTAGATCT GAAGAAGGCT CCTCCTCCAA CTCCTTCAAG AAAGGCAGGG GTAGCAACAG ATCGCCCAAC TCCACCTCCT CCATATTTGG AAGTTTCTCC CCATCCCGAA ACATCGAATT CGCCTGTTCC TCCTCGTACA CCTAATTTTG CTGCAGAAAT CGCAAAAAGA AATGGTCACA GTGCTTCTCC GCAGCCAGAA CCAATTCAGA AGAAGGCAGC ACCTCCACCT GTATCCAAGA AGCCACTGAG TCTTCTGACC CACGATGCTA AAGAAGAGGA ACATGTATCT CTGAGGAATC CTTCTGCAAA AGGAGCTTTT ATTGAACAGC TTCAATCGCA ACTTCAAGCT ACCCATGTTG GAGAAATTCC TCATAAGGTT CCTCCTCCAG TACACTCAAA ACTAGGACCT AAACCCTTCG AAAAAAATGC ACCAGTAAAA GAACTGGAAC CCATAGAAAA GGCAAAACCA GCAGTGAAAC CAAAGCCAGC CGTTCAACCC AAATCTGTTG TGCTGCCTGT TGTTCCTTCA GTACACTCTG TTGTTCCTGT TATTCCTGCT CCAGCCGTTA TACCTTCTGT TCCTGCTCCT AGATCAACAG CTCCGGAAGT TTCTGCTGTA CCACCACCTC CTCCCACAAG AAACTACGTC AGATCCAAAG CTCCAATACC AGTAGCCCAT CCATCAAATG AACCACCGCA ACTTGATTTG GAGATGTATT CGGGTTGGTA CGCTGATGTC AATGGACCAA TTAATTTTCC TGAGGCATTA GCAGGCTTGA ACAACCAAAG TTCAATGCTG TACTCTACAT CGGGTGGCAT CACTAACTAC GAGAGACGTA TAAGTTTAAG ATTGAAAGAT TTGTCGTCTA TCAGATATGT AATAAAGTGG TCTAGTAATA ATGTAGCAGG AGCTACGGTG AAAATCGACA AGTTTATTCC TTCTCCAATC TCAAGCAATA TTCCATCCAA GGAGGAATTG GTAGGGTATC TGCAACAATA TGGAGAGCAT GTTGCTTCGT GGTGTGAACA TAGATATGGC CAGCAAGTTG GCCGAGGTGA GTGCTGGGAT CTTGCCAAAG AAGCTTTGGA GAAAGGGTGT GGAAAGCACG CTTTTGTCAG TGAATACTAT CATCACGGTT ATCCTATACT TCTGGTTAGA GGTGTTAATG GTATTATGCA ATTGATAGAT GACAAACAAC CGTTGGACGA AGTGAGGCGT GGAGATATCC TCCAGTTCAA GAGCTGTACA TTCTACAATG CTGCTAGTGG AAGAACCCAA ACCGTCGGAG CTCCAGACCA TACTTCCGTA GTTTTGGGTA ATGTAGGCGG CAAGATTCTT GTGGCAGAGC AGAACGTTAA CAACGTTAGA ACCGTTCAGA ATGGAGAGTA TATTTTGAGA GATTTGACTC TGGGAGACGT TTGTGCATAT AGACCAGTTC CTGCCAGTTG GGCAGGGTCA TTGTAG
|
Protein sequence | MSSFWDNNKD TFKSAGKATM RGVASGTKAV SKAGYRTYKN HSGGGGGSSN NDTTGEVDQP EYIGPPRPLP SKDQLSALPP PPKRNIPTYE VPEKGAPSQY SVPQPQQYLQ QQPPQNQPQQ IQPQQIQPPP QNQQLVQQPQ LYQQPNQYQN QYQNQNQPNQ IPQPQTNLYT PANSQQTFSQ PNAPAQQLPQ QLPPPVNGYS DANGQQGYYQ QGYQTTPPAG NGYGVQHPQQ PQQTQQLPQG QQSQNEMYAQ AAKSALPVLS NLYQQHQQGQ TQNSDASQQF QQTQQTPQQQ QQAMYSSAAK HVLPVLSNMY QNQQQTSELQ QGQQGQQPQG QQPQGQQPQG QVPQGQADLY ASAAKHALPV LSNIYQQQSQ QQQQPQQPQQ QQQYQQPQQF QQPPQPQQPQ QPQQPQQPQQ PQQPQQQQQY QQVQLPSNQF QNELGQLPQP GQLPQPPARQ LPPPPPERTV PATPGIPIAS GQFQAQAPLS YDSPYYQPPS TQPAAAAQPA PAKTYGFGVA QVQDPDEAPK PKKELPDPSS FAPPPIRADR VGPPSTKSAT SATSPSPTQL PATTSGSKSS VGTAPPSAPP RASSAEQTTA PSEVKEEPKL PSKTNLMDFD ISKFGAPPPK IYRGPQDALP LKKSSANASA STVSSSFTPP PPPPARVSPS PSPQAYSEQP VKPPKPPKPT KPTKPKTLEM EDIPPPRPAR ADSTEATPPP MPARKHDIVE METPPPKPSR PVITVDSKKA PPPTPSRKAG VATDRPTPPP PYLEVSPHPE TSNSPVPPRT PNFAAEIAKR NGHSASPQPE PIQKKAAPPP VSKKPSSLST HDAKEEEHVS SRNPSAKGAF IEQLQSQLQA THVGEIPHKV PPPVHSKLGP KPFEKNAPVK ESEPIEKAKP AVKPKPAVQP KSVVSPVVPS VHSVVPVIPA PAVIPSVPAP RSTAPEVSAV PPPPPTRNYV RSKAPIPVAH PSNEPPQLDL EMYSGWYADV NGPINFPEAL AGLNNQSSMS YSTSGGITNY ERRISLRLKD LSSIRYVIKW SSNNVAGATV KIDKFIPSPI SSNIPSKEEL VGYSQQYGEH VASWCEHRYG QQVGRGECWD LAKEALEKGC GKHAFVSEYY HHGYPILSVR GVNGIMQLID DKQPLDEVRR GDILQFKSCT FYNAASGRTQ TVGAPDHTSV VLGNVGGKIL VAEQNVNNVR TVQNGEYILR DLTSGDVCAY RPVPASWAGS L
|
| |