Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_10382 |
Symbol | |
ID | 4838376 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009044 |
Strand | - |
Start bp | 1066590 |
End bp | 1069412 |
Gene Length | 2823 bp |
Protein Length | 810 aa |
Translation table | 12 |
GC content | 37% |
IMG OID | 640389691 |
Product | predicted protein |
Protein accession | XP_001384507 |
Protein GI | 150865336 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.012102 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.0536067 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTTCCCCGAA TGTCGGAAAG AAAAAGCAGG AGAACTCATA AGAATTCTAG AGACGGGTGT CCAAACTGTA AAGCCAAAAG GATAAAATGT TCCGAAGAAC TACCGTCATG TCATAATTGT ATCAAGAAAA ACTATCGTTG TGGGTACTTG GATTTTCCCA AGGAGAAATT GGACCACATC AGAAAGAAGA ATGATAAGAA ATCTAAAGAA CAAGACGAGA ATACTAAAGA AGAACTGTCA TCAAATTTAC AAACAGATCC AACCACCTTC GCATTTCATA AACAAAATCA CGGTTTATCA TCCGGTCATG GCCAATCACA GACGCTGCTG TTGCATCAAA TGCTGCAAGG TCAACAACTG TTTAATCAGT TCAACTATTC AAACCAGCCA TCTTCATCTT CTTCATTCAT GCCATCATCA TTTTCAATAA ATCAATCTCG AATGGGAGTT CCAGTATCTA CTGCACTTGT GAGGCATTCT TCATCTAACC AATCATTAGA GAGCAGTCCT CCTCAGAACG CTACTCCACC AATTATTCCA TTTACGCCTC TTTCTACGAC CCCAATTCAG CAACCCAGTA ATGAGAATAG CAACAATAAT AGCACAAACT TTCTTCTGGC ACCACCATCA GTTAACAATA TTCACTTCTC AGACAACAAC AGCACTTCCA GTAATGGTAC ATCAAATTCC AATGCAAATA ATGCAACCAA TCCTCCACAT GATATTTTTG CAGATCTATC TCCATTGAAT CAGGACTCCT TCGAAAATAA CGAATTTTGT CTTACGAGGG ATGACCTACG TGTTGCTTTG TATAGGGATT CCTTTCATAA GTTGACAAGT TACGATGTAA GTTCTGCAGG AGCCTTTGTC AACGAACACC CGCACTATGG ACACCATGAT GAGATCAATT CCAATGGTCA GCTGAAAACT TCGGATAATG ACGACAATTC AAGTTCCAAC AGCTTCGAAA ATTCCCAACT CAATCTCAGC CGTCACATCA AGACCGCAAA TGAAAATGTG GAGATTGAAG AATTAGAAGA TTTAACTCCT TTGGATCATC ATAGTATTTT TCATCCTTAT CCACCACCCT CGCATCCTGC TTCGCCACCA GGAACTCCAC TTGCGAACGC TATATCGCAA ATTTTAACTG TAAACAAGTT GAAGAAGACT AAAAGACTTA GGAATGGATA CTTGAAAAGG TATTTAAAAC AACCTGATAG AAGTTTTAAA TCCTTGAGCA ACACCGAATT TCGTCTACCT CATTTGCCGG TTTGGCGAAG TGACTATGTT GACCAATTTT GGATATCAGT TTTTAATCAG AGTATCATTA TTAAAGTATA CTATTCATTC TTTATGGACA AATCCATAAA TATACTTCTT AAAGTTTGCA ATAAGGCAAT CAACACTGGA AATAGCTCTA CTTCATTTAC AAAGAAGGAT TTGGATATTT TGACCAAAAA ATCATATACC TATTACGGTT TGTTAATACG AGATCTTCGT GAATCAATTA CAGAGATTCA TATAGAGTAT CCTATTAAGA TTTCCTTATA CGCAGCATGG TCAACCTTTT TGCATTTGCA TACTAATATG GAGACTTTGT GTTTAATGTT CACTGGAACT GCTTCTCTAT TTGGGAAGAT TGTTAATGAT GCAAAATCTG TGAACGACAT CACTCACACA TTGCAAATTT CAATACAAGC ATTCAATGAT AACACGAATA CTTGTTTAGT TCCCGATTAC AAATTTGATG TCATTAAGGA TTTGTATCAA GATCTTCTCC ATTTCAAGTC CTTCATTGTT AATAATCAAT CACTAACAAG CCCGAATAAT GTCCACATAT TAAGGAATTT CTTGGAGCTT GAATCGTTCA TGAAGAATCT CATCGAATGC GTTTACCCTA GGATCATGTA CATCAATAGT TACTACAAGC TTGTCAACAA CGTCGACGAT GATTCGGACA ACATTATTTT CACTTCTCCA AGTTTGTTTT TTGAAATGTT GGCTGATTGG TTTGATATAT TTCCTAGCGA AGCTGTCTCA ATTGGTTCTA AAATGAAGCC ATTGAGAAAA ACTTTTTATT TGTTTTATGT TGCTATTGGA AGAGCATTAC TTAATGTTTT CTCACCCATT AGAAGCATGC TATTGATTGA TACTGTACAT GTTATACACC CAAAAGTCGA CTTCAATTGC AACCTCTACC GTATTGGCAT TGATGAGGTG GAGTCAAAGG ATCAATTTTT TTTCTTAAGG AATCTTTCCC ATAAGCTTAT GAGAACCGTT ATTTTCTTCA ACAATAGGAC TGAATTGCTC TGCTATTATT TGTCCACAAA GACTGTTTTG AACCGTACAG ACTCGCAATC CTATTTGATG AGCATCGACG CAAAGCACAA CCTCTCTGAA GTTCATTATC AAGATATCGT GAAGTTTTTG CCTCAAAAGT TGGAAATTGA TGAAATAATG TTGAGCAGCC TAGGTCCAGA AACTACAATT AATTATTATA ATTATCCACT TTTGAAGAGT TTGCTATTGG GAATAGACGA TAATCCAATA AATCGTACAA AGGTTTTAAG GGAATTGATA GAGAAGCAGG AGAATTTGCG TCAATCTAGA ACCCAAACAA ACAATACGTT TCGAAAGTCT GGAGATGGTT CGTTTAATTA CAGAAGTGGA ATATTTGAGC TGGATTTCGA CATTTCGGAA CCTATCGATT ATTATAGACA ATCACAGCAA CCAAATTGGT CCATTTCGAA TTGGTCAATT GAAGAAACAA GGTTGAGAAT TTCGAATTTC GATTGTTCAA GAAGGCAAAT TGCTAAAAGC GTC
|
Protein sequence | FPRMSERKSR RTHKNSRDGC PNCKAKRIKC SEELPSCHNC IKKNYRCGYL DFPKEKLDHI RKKNDKKSKE QDENTKEESS SNLQTDPTTF AFHKQNHGLS SDNNSTSSNG TSNSNANNAT NPPHDIFADL SPLNQDSFEN NEFCLTRDDL RVALYRDSFH KLTSYDVSSA GAFVNEHPHY GHHDEINSNG QSKTSDNDDN SSSNSFENSQ LNLSRHIKTA NENVEIEELE DLTPLDHHSI FHPYPPPSHP ASPPGTPLAN AISQILTVNK LKKTKRLRNG YLKRYLKQPD RSFKSLSNTE FRLPHLPVWR SDYVDQFWIS VFNQSIIIKV YYSFFMDKSI NILLKVCNKA INTGNSSTSF TKKDLDILTK KSYTYYGLLI RDLRESITEI HIEYPIKISL YAAWSTFLHL HTNMETLCLM FTGTASLFGK IVNDAKSVND ITHTLQISIQ AFNDNTNTCL VPDYKFDVIK DLYQDLLHFK SFIVNNQSLT SPNNVHILRN FLELESFMKN LIECVYPRIM YINSYYKLVN NVDDDSDNII FTSPSLFFEM LADWFDIFPS EAVSIGSKMK PLRKTFYLFY VAIGRALLNV FSPIRSMLLI DTVHVIHPKV DFNCNLYRIG IDEVESKDQF FFLRNLSHKL MRTVIFFNNR TELLCYYLST KTVLNRTDSQ SYLMSIDAKH NLSEVHYQDI VKFLPQKLEI DEIMLSSLGP ETTINYYNYP LLKSLLLGID DNPINRTKVL RELIEKQENL HGSFNYRSGI FESDFDISEP IDYYRQSQQP NWSISNWSIE ETRLRISNFD CSRRQIAKSV
|
| |