Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_67022 |
Symbol | |
ID | 4837668 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009042 |
Strand | + |
Start bp | 1775371 |
End bp | 1778256 |
Gene Length | 2886 bp |
Protein Length | 749 aa |
Translation table | 12 |
GC content | 41% |
IMG OID | 640388983 |
Product | predicted protein |
Protein accession | XP_001382567 |
Protein GI | 150863921 |
COG category | [Z] Cytoskeleton |
COG ID | [COG5059] Kinesin-like protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.626682 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | CCTCGAGCCG CCTCGTAGCC AAGCAATCCT TGGAATAACC TTCGCTTACA TTCTTGGGAG AACTGGAGGA GGAGTTTGAT TTGATTTGAT TTTCCGAATT GGTGTTGATT TGTTCTTTGT GAATATTTTT GTCTCATATT TTGATTTCAA ATATTTTTCT TGTTTCCATA TCTCGGTTTG TCCTTTTGGC TGTGAATATT ACTTTTGCCA GATACTTCTT TTCCTGCCAG CAGAGCATGG TATTACAGAA GAGCCAGCCG TATTGAAGTA TATCTCATAG AATTATCTTG CATTCAAAGC GTTCCGATCT TTTGGTCTCG TGTTCACTTC CGCCGTATCG CTTCGTGTTC TGATACAAAT TGAATCCGTC ACAAATACCA TACATTCCAT ATTCCATTAC ACGTTACATA GACACCCCAG CCACCGATAG AACCGCTTAT CACACGTATC CCGTATCGCA TTATCACATA GTCACTAGTG TCAATCCACT TCTCATCTAA TATCACAGTT ATACCCATCT TCACATTCGT TAGCCCATCA TTCATTCATA TAAAAGTTGT ATTATCTGTA CACCACACAC TTTTCCATTG ATACACATAA CAATCGCTCC GACACTCAAT TTTCATCATT TTCATCATGT CGTGCAAATC CTATGACGAG TCGCTTTCCA GCAACGATCT GTTTGAAAGA GAGGAATCGA AACAGATGTC CTCACCCTTC AACTACTTGT CGCGCTCTTC GTCAAACCTA GCATCTAACA TTTCGCTCAC TAGATCGGTC TTGCCCACCA ACAACGTTAA GGTCGTTTGT AGATTCAGAC CAACGACAGC CGAGGAAAAA GCCGCATATG ACGATAACTC TGTAGTGTAT CCCAGCATTC ATACGGTTTC GGTAAAGTCC AAGGACATCA CCAATAGTTT CACATTCGAC CGTGTGTTTG ATCCCAACTC GAGCCAAAGA GACATATACG ATTATTGCAT CCCTGAGACC GTAGACGATC TCATGAACGG CTACAATGGC ACAATACTTG CCTACGGCCA AACAGGGAGT GGAAAATCGT ACACCATGAT GGGTAAACCT GAAGGAGACG AACGTGGTTT GGCATCAAGA ATAGCAGAAG ACATCTTTGA TAGAATTGCC CACGGATCCC TGGAAATTGA GTACACCTTG GCAGTATCCT TCTTTGAAAT CTATATGGAA CACATAAAGG ACTTGATTGA CTTGTCCAAT AACGACAATG CTGACCACAA GTTCTCAATC CACGAAGACA AGCTCAATGG GATACATGTC AAAGGAGTAG CACAAGCATT TGTCACCACC AGCGAGGAAA TGGTGCTGAT TTTGAATGGA GGATTGAAGA AAAGAAGTAC ATCGTCGACG TTTCTGAATC TTGAATCATC TCGTTCACAC GCCATTTTCC AGATAGACCT ATCACAAAAA CACTCACAAA CTGAAGTTAT AAAGAAATCA CGATTGTTCC TCGTAGACTT GGCTGGTTCA GAAAAAGTAG ACAAGACAGG AGCTCAGGGT CAGACTTTGG AAGAGGCAAA GAAAATCAAT AGCTCTTTGG CTGCTCTTGG AAATGTCATC AATTCCTTGA CGGACGGAAA GTCTACCCAT ATTCCATATA GAGACTCCAA GTTGACGAGA ATTCTCCAAG AGTCTCTCGG TGGTAACTCA AGAACGTTGC TCATAGTGAA TTGTTCTCCA TCCACTACAA ACGAATTAGA GACTTTGTCA ACACTCAGAT TTGGAACAAG AGCCAAAAAT ATCAAAAACG TAGCACATGT GAATACCGAA CTCTCCAGTG CCAGTCTCAA ACAAAAAGTG TCACAATTAG AAAAGATCAA CGAGAACAAC ATGTCGTATA TCAAACAATT GGAAGCTGAA CTAGCAGACT GGAGATCAGG AGAAAAATCT CCCCGTGGTC TTCAACCACC AAATTCGTTT TTAAGTAAGT CCATGGGTTC ACCTGACACG CCTACCAAGG ACAAGTACCA TTCTAGAATA CCTTTGCCAA TTGCGACATC TCCTTCTAAG GCTAAGTTGC TCTTCAACGA AGAAATAGCA AGAAAGGACA AGAAGCTTCA AGAATTGGAA AATACAATAT TGAGTATGAA AATGCTGAAC TTGAAAACTT CGCATACCGA AGAATCTAAG TTGTTTTCAT TGGAAAATTC CTTACACAAG ATAAGCAACA AACTAAACGA TGTCGAGTTG ATCAATATTA ATTTACGGAA GCACTTATTG ATCAGCGAAA AGATTATAGA ATCTCGTGAT CTGAAAATCA ATAAGTTAAA GAATGCATTG AAGGAACAAC AACTATTGAT CTCAAGAGAG ACCTTGGGTT TCCGCAACAA GCTTGCTGAC ATTCAAACTA AGCTTGAGGA ACTCAATAAC AAGAAACACG AGGAGTTGAA ACTCCGTCGT GAAACATTAG TCTGGGAAAT GGAAAAGGAC AAATTGAAAC ACAAATCGAA TGGTTCAAAA GATACCGAGA AAGACGCCAG AATACTGCTA GTCTCCGGAT CAGAAACATT GGCTGAAAAA GAAAATCTTC ATGAAGGTTT CTCTATCAAC CTGTCTAAGA AGTTGCACTC CCCCAATACC TTCACTGACA GATTACCAAG TTTAAAAACA ATACAGACTG CGGACCTGGA CATCATGAAC TTCCATCAAG AGTATCTTTC TGTAAGCGAG TTCTTGTCTG AAAGTAGAAG ACGGTCAAAC ATGATTGAAG CTGTCATAAA CGACGCAGAA ACATCTGAAA CATCAGGAAT GTTCGATAGT TCGCCTTCAA AATCACCCAC CAAGGGAATT AATCTCCGTA TAATTAAACC CATTCGAGGA GGAGCATTAC CCAATCCAAT CAATTTATTT CATTGA
|
Protein sequence | MSCKSYDESL SSNDSFEREE SKQMSSPFNY LSRSSSNLAS NISLTRSVLP TNNVKVVCRF RPTTAEEKAA YDDNSVVYPS IHTVSVKSKD ITNSFTFDRV FDPNSSQRDI YDYCIPETVD DLMNGYNGTI LAYGQTGSGK SYTMMGKPEG DERGLASRIA EDIFDRIAHG SSEIEYTLAV SFFEIYMEHI KDLIDLSNND NADHKFSIHE DKLNGIHVKG VAQAFVTTSE EMVSILNGGL KKRSTSSTFS NLESSRSHAI FQIDLSQKHS QTEVIKKSRL FLVDLAGSEK VDKTGAQGQT LEEAKKINSS LAALGNVINS LTDGKSTHIP YRDSKLTRIL QESLGGNSRT LLIVNCSPST TNELETLSTL RFGTRAKNIK NVAHVNTELS SASLKQKVSQ LEKINENNMS YIKQLEAELA DWRSGEKSPR GLQPPNSFLS KSMGSPDTPT KDKYHSRIPL PIATSPSKAK LLFNEEIARK DKKLQELENT ILSMKMSNLK TSHTEESKLF SLENSLHKIS NKLNDVELIN INLRKHLLIS EKIIESRDSK INKLKNALKE QQLLISRETL GFRNKLADIQ TKLEELNNKK HEELKLRRET LVWEMEKDKL KHKSNGSKDT EKDARISLVS GSETLAEKEN LHEGFSINSS KKLHSPNTFT DRLPSLKTIQ TADSDIMNFH QEYLSVSEFL SESRRRSNMI EAVINDAETS ETSGMFDSSP SKSPTKGINL RIIKPIRGGA LPNPINLFH
|
| |