Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_66248 |
Symbol | SMY2 |
ID | 4850850 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009068 |
Strand | + |
Start bp | 223638 |
End bp | 226611 |
Gene Length | 2974 bp |
Protein Length | 926 aa |
Translation table | |
GC content | 47% |
IMG OID | 640392558 |
Product | kinesin-like protein |
Protein accession | XP_001387283 |
Protein GI | 126273706 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG3064] Membrane protein involved in colicin uptake |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATCTCCA GGACACAACG CAAATCGCAG CTCGAATTCA CAAACGGAAA CTCCGTTTCC GGCTCCACGG TAGAACTGAC CGATTTCTCG CTCTCTTCGC TTCCTACTTC GACCTCAATT CCACCCACGC CAAACTCTGG CCAAGTCCCT TACACCATTC CACCCAACAG AGGCAGCAAA CGTTATTCTT TGAACGAGGT TTTTGACGTG TGGTACACCA ACAAAGAGAA CATCGTTAAC CCTGCTATCG AACTTCCTGT ACTTTCGCAT GAATCATACA AGCTCCATAA ACCTCGCCAA ATCTACCATT TGGACCTTCA GCCTACGCTG GCTCCGTCGT TTGTGTCGTC TGATTCGGCC AACTCAGATT CAGTAGCCTC CAGCGCTTCC AACTCAGACT ACCCTTTAAC TTCCACTGTA GATTCCGCTT CAGTAGCAGC TGGAAATCCA TCTGGAAATT CTGGTGCAAA TTTGTCTGGA AATTTATCTA GTAGTGGGAT TTCTAGCAAT GGTATTTCCG CTGTTTCGGC TGAGAAGAGC ATCTTGCCGT CTTTGACCTC AGGTGAAATT GACAATTCTA CCTTCAACTC GACTTTGGCA TCACTTTCTC TCAACGAGTC CCCTGGAACG TTTCAAGCTC AAGCTCAGAC TCAAGCTCAG ACTCAAACTC TGACACCTTC AGCTTCGTCT CATCCTCCTC CCGGAATGAC TTCCACTCTT CAGAACATCA ACGCTGTTCC TCTTCTTACG TCTGACAAGA TCAACTGGTT CTATGTAGAC CCACTTGGAA CAGAGCAGGG TCCCTTCAAT GGAGATACAA TGCAAGAATG GTTGACCGGA GGCTACTTGC ACTTGGACTT GAACATTCGT AGATTGGAAG AATCGTCCTA CAGACCTCTC AAAGAACTCT GTGAAAAAGT CCAGAACTAC ATCCAACCAT TCAAGGTACC TCTTCCAGAC TTGACCGTTC GTTCTGCTCC AGTGGCCGAG ACCAGACCGC TGTTCCCTCC ATTTGGATCC CAGGGCCAGG TTCCTCAGTT TAGCTCTCTT GGGCAACAGC AGCAACAACA GTCGCAGTTC CATCCCTTGT TGTCAGGAGG CCCCAACCTT GGCAGAATCA ACTCTGGACT TGGCCATAGT CAACAGCAGT CGGGATTGTT TGGCAACGAT TTCATGTCCA ACGATCCTTT TTCTGCCCAG GGGTTCCAAA ACTCTCCAGG AGCTAACCAG TTCGGAATCG ACTCTATCAA CCGGAACCTC GGCTTCAACA GTGGCTTGCA GATCAACACC ATGCCAGCCT TGTTGCAACT GCAGATTCAA CAGCAGCAAT CACAACCTGG TTTGTCGAGA AACAACAGTG GCTGGGGCAG TTTGGACCAT TCCACCGGTT TGATGAGTAA CAGTAACCCT GCAACGCCAC TATCTGTTAA CCCGGTGTTG CCTGGCCAGA TCACCCAGCC AACCCCAATA TCTCCATGGA TTTCTGGTGG GATCCAGTCC CTGTCCAGAG TCAGCTCACC CTTTGTTGCA TCTAGCACTA TTAGCAACAA TAACAGCAAT AGCAACAATA ACGTAAGCTC TGCAGAAGTG ATCGACACTG TTGCTGCCGA CGATGTTGTG GTTAACGATG ACCATGTTTT GAGCAACATG CATTCTTCTG TAGTGAACGA CTTCTTGGAC GACGATCATT TCAAGCAAGA AGTCCCAGCC CCAGCTTTAG CTACTGAAAA GTCGGAAAAG ACTGAAAATT CAGAGCCAGC TGTCAATGAA CAGCAAGCAA AGGCAGGACC GGAACAAGAA GCTGAGCCGG AACAAGAACT TGAAGACGAA GAAATTGTTT CCGCTCCAGT GGTTGAAACC TTGAAGCCTT CGGTGCCTAA CCAACAAGAG TTAGCTCCAT GGGCCAGCAG CAAAAATCCA GAAGCTGAAC AAAAGCCAGC CATGACTTTA AAGGAAATCC AGCAATTGGA AGCGGAAAGA TTGGAAAAGC AAAAACAATT GCTTGCTCAA CAAAGAGCCG AAGTCAACCG TGCTTGGGCA CAGGCAGAGG AGAAGGCTGC TGCTGAACAA AAGGTGCCAG CACTTCCTCC AACTACCAGT TGGGGTGCTG TTCCTGTTCA AACTGTAGCC AAAAAGACAT TAGCTGAAAT CCAGAAAGAA GAAGCCGAAG CTGCAGCTGC CAGAGCCAAG GCTGCCAAGG CATCTGCTGC TGCGTCGGGT CTTCCAGTTC AAAAAGCTTC ATTCGCCAGT GCTTTGACGG CATCATCTAC TCCAAAGGAC GATGGAGTGT GGCAGACTGT TGCTGTTAAG AAGCCAACAG TTAAGAAGCC AGTTGCTCCA TCTATTACCA ATGCTGCTTC TTCTAGTAGA GCCACCCCTC AAGTATTGAG ATCGGTTTCA GCTACGAGGC CAGCTGTCTC TGGCAGCAAC TCTCTTGCCT TGAGAGAGGA TTTCTTGATT TGGGCAAGAT CCAACATGAC CAATTTGTAC CCTACCGTTT CAAAAGATGA TTTGTTAGAT ATGTTTATTA CCTTGCCAGC TTCCAGCAGC GACTCTGGTG CCTTGATTGC TGAGACCATC TACGCTTCAT CTGCTACTAT GGATGGTAGA AGATTTGCCC AGGAATTCTT GAAGAGAAGA CAAAAGGTAG ACCAACAGAT TGGCAAGGAT GACGATGTTT CATGGTCGTC TGCAATCATC TCTTCTGCTG ACAAGTTGAC TACTGTTGAT GAAGACGGCT GGAGCACCAA CGTTAAGTCC AAGAAGAAGA AGAGAACTTG AACTGTGGAG TCATATTGGT TCTATACATA CATACTATCA ATTAGAAATT GAAACATTCT GCATTGGACG CACCGTAGTA AAATGATAAT ATATTTCACT GATAATACAA TAAGTCAGTA TTATAGTTCA TAGAATTATT GTAAGAAAGT AATATGGCAT GTATTAGGTG CGAAATAAAA TGGCTGCGAG ACGC
|
Protein sequence | MISRTQRKSQ LEFTNGNSVS GSTVELTDFS LSSLPTSTSI PPTPNSGQVP YTIPPNRGSK RYSLNEVFDV WYTNKENIVN PAIELPVLSH ESYKLHKPRQ IYHLDLQPTL APSFVSSDSA NSDSVASSAS NSDYPLTSTV DSASVAAGNP SGNSGANLSG NLSSSGISSN GISAVSAEKS ILPSLTSGEI DNSTFNSTLA SLSLNESPGT FQAQAQTQAQ TQTLTPSASS HPPPGMTSTL QNINAVPLLT SDKINWFYVD PLGTEQGPFN GDTMQEWLTG GYLHLDLNIR RLEESSYRPL KELCEKVQNY IQPFKVPLPD LTVRSAPVAE TRPLFPPFGS QGQVPQFSSL GQQQQQQSQF HPLLSGGPNL GRINSGLGHS QQQSGLFGND FMSNDPFSAQ GFQNSPGANQ FGIDSINRNL GFNSGLQINT MPALLQLQIQ QQQSQPGLSR NNSGWGSLDH STGLMSNSNP ATPLSVNPVL PGQITQPTPI SPWISGGIQS LSRVSSPFVA SSTISNNNSN SNNNVSSAEV IDTVAADDVV VNDDHVLSNM HSSVVNDFLD DDHFKQEVPA PALATEKSEK TENSEPAVNE QQAKAGPEQE AEPEQELEDE EIVSAPVVET LKPSVPNQQE LAPWASSKNP EAEQKPAMTL KEIQQLEAER LEKQKQLLAQ QRAEVNRAWA QAEEKAAAEQ KVPALPPTTS WGAVPVQTVA KKTLAEIQKE EAEAAAARAK AAKASAAASG LPVQKASFAS ALTASSTPKD DGVWQTVAVK KPTVKKPVAP SITNAASSSR ATPQVLRSVS ATRPAVSGSN SLALREDFLI WARSNMTNLY PTVSKDDLLD MFITLPASSS DSGALIAETI YASSATMDGR RFAQEFLKRR QKVDQQIGKD DDVSWSSAII SSADKLTTVD EDGWSTNVKS KKKKRT
|
| |