Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_62257 |
Symbol | |
ID | 4839905 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009046 |
Strand | + |
Start bp | 301481 |
End bp | 304360 |
Gene Length | 2880 bp |
Protein Length | 854 aa |
Translation table | 12 |
GC content | 41% |
IMG OID | 640391220 |
Product | predicted protein |
Protein accession | XP_001385401 |
Protein GI | 150865970 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.374999 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCCAGT CATATCTACT ACACAAGTCG CAACTTTGGC CGTTGCTAAC AGATCAGCTC GATGCTGTTC ATCGTAAGAA ATCCATCAAA TTGGCCAATC TACCTGGTTC CAAACTTGTA GTATCAGATA TCATAGAGGT AATCAAAGAT TTTGTCAATC TGTACAACCC TCCATTAGAA TTAGAACGAC TCCTGCTTCA AAGTTGTCAG CTATCCGAAA TCTCTGGCAA CTTCTCATCA TTTTCATCTT CCATCAAATA CTTAGATCTT CACCAGAATA ATGTCCATTC CATCTCAGAA TCAATCATAT CCTGTTTTCC CAGCTTGGAG ATTCTTGATC TTTCTTCAAA TAACTTGTCA ACATTGCCAG GAAATTTAAG CCAACTTAAG AACTTACGAA TCATCCTGAT CCGTAATAAC AAGTTCAAAT ACTTGCCTCC CGTACTAGCG GAATTACCCA GTATCAATTT GATGGAGATC GCTGAAAATC CTTTAGTTGT TCCCTCGCTT GAACTCATCA GATCGTTACA AAAACTGAAT AGCGACTACG ATTGGATTCT TGAACTAAAA TCGTACTTGA TGGCTCACAA ATCACTTTTA GACTTCAAAA TTCAAGAGCA GCAACAAAAT CTCCAGAAAC AAAGCCAGCC TCTTCTGGCA CCTCCTGCTT TACCCAATGG TCAACAGCCA ATGCTTACAA GACCCAAAAG CATCTCAGAC AAGTCAAAGG CATCTAGAGC AGCAAGAAGA ATGGGATTGT TGATTAAACG AGAAGATGCC TCGAGTCCCC CAAGTTCAGC TGATGCCAGT AGTGTCAGTA TCCATGATAG CACTAATGAT ACGTTTGCGT CTTCAGGTGT TCTGAATGCC AATTCAACTC TAGATGATAC TACTGCCAGC TTCCCATTTG CTTCGCAGGC TCAAACCAAT GGAAATCACA GTATTATAAA TACTACTAGC ATACTTACAG GCGGTAGCGA TTATCTTCAT ACCGACTTCA CAAACACTTT GAGCAACCCC TCGTCTGCAT CTGCTATAGA AACTTCTTTC TCTATAGTTA CACCACCACC ACCAAACATG ACCATGTCCA CAACTACAGC CACCACAGCA TCAAATTCTC CAACATCTAT TGCCGCACCA ATCTCTAAGA CTGCTAATGC TTCTGTGGCT TCATCAAATA CAACGTCAAA TGTATCTACA TTAAGTCGTC CTACAAGCAG AAACAGATCT CGGTCTAATA CGCTTAAGGA AATAGATAGG ATTTTAGAGA AAAACGATAA AGTTGATACA GAACACAAAT CGGGCGCTTA TTTCAGAAGG TTATCTACTC TACAAGAAAT TCCAGGAGAT AAGAAGGATG TTCAATCCGA GAATGTGAGA CTGATCCAAA ACTCTCAGAC CACTAAAACT AATGAAGATA CAATTACAAA AACACATTCG GCAGCAAATA GAACAGCTCC CGCAATAGTT TCCAATGACA TTTCTCCTTC AAGAACCAAT CCAGTAGCTG TTGCCAATAA GAAACACCTG ATACTGTCAA TTGTGAAAGT CTCCCGAAAA GTATTGTTTT CTTTCAGCGA GTTGCATTCC TCGGTAAGAA GATTCACTGG ATTCTGCGCT GACAAGAAGG TCACCATGAA AATGGTTTCT CTCTTGTATA CAACCAAATC CAACATAGAC TCGTTGGTAG AGAATTTGGA GATTATGGAA GAAACAGGTA CCAACCTAGA CCAAATTGTC GCCTCCTTAC ATGTTTGCAT CAGTTCATTC AAGGCAATCA TGACTTTGCT TAGTGAAAAC TTTGCATCAT TCGTAGACAA GATAGATGTC TGTTTCATTC GGATGCTCTA CTTGACGCTT TATGGGTCGT TCAACGAGTT ACTAAATGCC TACAGAACAC TTGTACCTGC AGTTCAGAAG CCTCCTCCTA AGTTTAATGT ACCAGTTACC GCTGCTGCAA CTGAGTCGAC AAAACAAAAA TTGTCTATCA ATACTACTCA TTTCGACCAC GATGATGTAG ATGAAAAATT ATATGTTACA ATTGATTCTG CTACTACCAC AGCTCAAAAT ATCTTCAGCG AGTTGAACAA AGCCATCAAT AAGAGTGCAA TTGCCAGTGC CTCATCTTCA GATCCAGCGA CAAATGCTAG TGTTGCCAAC AAGGTCAACG AATTAACCAA TGTCTGTGTG TCATTTTTTG ATATTACAAA GAGATTGAAG ACGAAGTTGA TAACTATTCG AAATAACCCT TCTCAAACAA CTAAAAAGTC GTTTGGAGAA GATATCAATT TGTTTATCAA ATCCATAGTT CAAACATTAG CTTGTGTCAA GGGTATTGTC AAGGACTTGC CTATCTTAGA TGACATTAGG GCTTCTATGT CTACTTTGAC AAAGACTGCA AAGGAAGTGA CGTATATGTT AGAAGTTTCA TCCTACAAGA CTTTGGCTAC GGATAGTACA GGTACGCAAT ATCCGCCAGC ATTGGTATCG ATACCCAGTG TATCGAATCT CTTTACTCCT GTATCTGCTC ATCCACCATT GCAATCGCCG TCGCTGGTTA ACTTGGCTCA GCTTGCAAGT AATGGAAACA TGGGAGTAAC TAGAACACCT TTGACTTCAT CGTTAACGGC TTCTTCCATC AAGACGTCAT CGTTGTCAAT ATCAGGACCT TCTTCTAGCA CGTTAGGAAA TATTACCAAT ACAGCCGCAG GGGTTACCCC AATGGGTAGT CCGGGACTTG TTAATGGCCA CAACATTGGT CCATTGACAG CCCCGACGCA GTCTTCTGGC CAGTACTATG CTAAGAACGG CATGAATCCG TTTGATGGTT TAATTATGGC TAGTCGTAAT CATGACAGAG AGATTGTAGA TGCAGAATAG
|
Protein sequence | MSQSYLLHKS QLWPLLTDQL DAVHRKKSIK LANLPGSKLV VSDIIEVIKD FVNSYNPPLE LERLSLQSCQ LSEISGNFSS FSSSIKYLDL HQNNVHSISE SIISCFPSLE ILDLSSNNLS TLPGNLSQLK NLRIISIRNN KFKYLPPVLA ELPSINLMEI AENPLVVPSL ELIRSLQKSN SDYDWILELK SYLMAHKSLL DFKIQEQQQN LQKQSQPLSA PPALPNGQQP MLTRPKSISD KSKASRAARR MGLLIKREDA SSPPSSADAS SVSIHDSTND TFASSGVSNA NSTLDDTTAS FPFASQAQTN GNHTSNSPTS IAAPISKTAN ASVASSNTTR NRSRSNTLKE IDRILEKNDK VDTEHKSGAY FRRLSTLQEI PGDKKDVQSE NVRSIQNSQT TKTNEDTITK THSAANRTAP AIVSNDISPS RTNPVAVANK KHSISSIVKV SRKVLFSFSE LHSSVRRFTG FCADKKVTMK MVSLLYTTKS NIDSLVENLE IMEETGTNLD QIVASLHVCI SSFKAIMTLL SENFASFVDK IDVCFIRMLY LTLYGSFNEL LNAYRTLFNV PVTAAATEST KQKLSINTTH FDHDDVDEKL YVTIDSATTT AQNIFSELNK AINKSAIASA SSSDPATNAS VANKVNELTN VCVSFFDITK RLKTKLITIR NNPSQTTKKS FGEDINLFIK SIVQTLACVK GIVKDLPILD DIRASMSTLT KTAKEVTYML EVSSYKTLAT DSTGTQYPPA LVSIPSVSNL FTPVSAHPPL QSPSSVNLAQ LASNGNMGVT RTPLTSSLTA SSIKTPGLVN GHNIGPLTAP TQSSGQYYAK NGMNPFDGLI MASRNHDREI VDAE
|
| |