Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_43698 |
Symbol | |
ID | 4838194 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009043 |
Strand | - |
Start bp | 1006758 |
End bp | 1008614 |
Gene Length | 1857 bp |
Protein Length | 618 aa |
Translation table | 12 |
GC content | 42% |
IMG OID | 640389509 |
Product | predicted protein |
Protein accession | XP_001383821 |
Protein GI | 150864838 |
COG category | [S] Function unknown |
COG ID | [COG0397] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.237593 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.455803 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGTAAAT TATCGGAATT GCCAAAGACT TCGTCTTTTT CAAGCTATAT AGAACCAGAT GGGAAGATCG CCTCTACGGA AGTAGCTGCG AAGAATGAAG ATGGCATTAT CAATAAACCA AGAATACTCT CGTCAGGAGG ATTTTCCTAT TCGTTGCCCG AGTTGAGAAA GGAGTATCGA TTCTTGACTG CTAACGAAGC GGCCTTGAAC GATCTTGGAC TTGATCCGGA ACAAGTCAAC GATAAAGAGT TCCAAGAATT AGTCAGCGGA GAATTCTACC TAATGTACAA AGATACGTTT CAAGATAAAG GTTATCCATT TCCATACTCC CAGGCATATG CTGGCTGGCA ATTTGGTCAA TTCGCTGGAC AATTGGGCGA CGGAAGAGTG GTGAACCTCT TTGAAGTGCC GAAAGCAAAA GTGCAGTCAA ATAATAGGCA CAAGTATGAA GTGCAATTGA AGGGCTCTGG TAAGACTCCG TACTCGAGAT TTGCTGACGG AAAGGCTGTT CTCAGGTCTT CTATTCGTGA ATACATAATC TCTGAGCACT TGAATGCCAT TGGAATCCCA ACCACCAGAG CTTTGTCGCT CACGTATCTT CCTGCTACTT ACGCCCAGAG ACATGCTGCC GAGAAATGTG CCATTGTGTC TAGATTCGCT GAGCTGTGGA TCAGGTTGGG CACGTTCGAT CTTTACAGAT GGAGGGGCGA TAGAAGTGGT ATAAGGAAGT TGAGTGATTA TGTCATTGAT GAGCTCTTCA CTGTAGAGGG TACTAAATTC TGCAACTTTG AAAACCTTCT CAGGGAAAAG TCTGACTTCT TTGATAACAC TACCGAATCA CTCGGTGAAC TAACTGACTA CGATAAAATG TATTACGAAA CTATAGTGAG AAATGCCACT ACCACGGCTC TCACACAATC CTATGGGTTC TTGAATGGAG TCTTGAATAC TGATAATACT TCTATTCTTG GCTTGACAAT GGACTTTGGT CCTTTCTCTA TCATGGACAA GTACAGTCCA ACGTACACTC CCAATTCAGA AGACCACGAA CAGAGATACG GGTACCGTAA TACTCCTACG GCAATCTGGT GGAACTTAAC CAGATTAGGT GAAGACTTGG CTGAATTGAT AGGTGCCGGT TCCAAATTGT TATCTGATCC TAAATTTGAA AGAGGCGAAA TAGATAAGGA TTGGGAAGAT GCAATTATCA AGAGAGCTAC TAAAATAATA GAAATAGGTG GAGATGTATA CCAATACGCA TTTACCAAGA AGTATGTGGA AACTTTCTTT GCCCGTTTGG GTATATCGCC AAAGATAATA GACTACACAA ATATTGATAA GCACAACGTC GAGTTGATTG CACCCTTGCT TGAAGTGCTA TACAAAGTCA AATGTGACTA CAATAAGTTT TTCTTAATAT TGCAGGACCA GAAATTTGAT GCTGAGAACT ATAACCCCGA CGCAATTGCC GATAATATTT TGGCTCCTTC TTATGACGAG AATGATAACA GATACTCTAA AAAGGAATTG ACCGATGAAA TTAAAAGCTG GTTAGGGGTA TACCGTGCAC ATTTGGAAGA GTCTCGGGCA ATAGATCCTA CCTTCTCCCG CTTAGAAAGC AAGAAGTATA ATCCTGTGTT CTTGCCCCGC AACTGGATTC TCGACCAGGT TATTGCCCAT GTTCAAGATT CGGGTGCTTA CGACTTGTCC TACTTGAAAA AGTTAGAGAG GATGAGTTTC TATCCATTTG ATTCCACTAA ATGGGGTGAT GACTTGAAAG AGTTGGAACA ATCATGGTTG CTTCAGGGAG ACAAAGGAGA AGATTATTCC ATGCTACAAT GCAGTTGTGC CAGTTAG
|
Protein sequence | MSKLSELPKT SSFSSYIEPD GKIASTEVAA KNEDGIINKP RILSSGGFSY SLPELRKEYR FLTANEAALN DLGLDPEQVN DKEFQELVSG EFYLMYKDTF QDKGYPFPYS QAYAGWQFGQ FAGQLGDGRV VNLFEVPKAK VQSNNRHKYE VQLKGSGKTP YSRFADGKAV LRSSIREYII SEHLNAIGIP TTRALSLTYL PATYAQRHAA EKCAIVSRFA ESWIRLGTFD LYRWRGDRSG IRKLSDYVID ELFTVEGTKF CNFENLLREK SDFFDNTTES LGELTDYDKM YYETIVRNAT TTALTQSYGF LNGVLNTDNT SILGLTMDFG PFSIMDKYSP TYTPNSEDHE QRYGYRNTPT AIWWNLTRLG EDLAELIGAG SKLLSDPKFE RGEIDKDWED AIIKRATKII EIGGDVYQYA FTKKYVETFF ARLGISPKII DYTNIDKHNV ELIAPLLEVL YKVKCDYNKF FLILQDQKFD AENYNPDAIA DNILAPSYDE NDNRYSKKEL TDEIKSWLGV YRAHLEESRA IDPTFSRLES KKYNPVFLPR NWILDQVIAH VQDSGAYDLS YLKKLERMSF YPFDSTKWGD DLKELEQSWL LQGDKGEDYS MLQCSCAS
|
| |