Gene PICST_66248 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_66248 
SymbolSMY2 
ID4850850 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009068 
Strand
Start bp223638 
End bp226611 
Gene Length2974 bp 
Protein Length926 aa 
Translation table 
GC content47% 
IMG OID640392558 
Productkinesin-like protein 
Protein accessionXP_001387283 
Protein GI126273706 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3064] Membrane protein involved in colicin uptake 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCTCCA GGACACAACG CAAATCGCAG CTCGAATTCA CAAACGGAAA CTCCGTTTCC 
GGCTCCACGG TAGAACTGAC CGATTTCTCG CTCTCTTCGC TTCCTACTTC GACCTCAATT
CCACCCACGC CAAACTCTGG CCAAGTCCCT TACACCATTC CACCCAACAG AGGCAGCAAA
CGTTATTCTT TGAACGAGGT TTTTGACGTG TGGTACACCA ACAAAGAGAA CATCGTTAAC
CCTGCTATCG AACTTCCTGT ACTTTCGCAT GAATCATACA AGCTCCATAA ACCTCGCCAA
ATCTACCATT TGGACCTTCA GCCTACGCTG GCTCCGTCGT TTGTGTCGTC TGATTCGGCC
AACTCAGATT CAGTAGCCTC CAGCGCTTCC AACTCAGACT ACCCTTTAAC TTCCACTGTA
GATTCCGCTT CAGTAGCAGC TGGAAATCCA TCTGGAAATT CTGGTGCAAA TTTGTCTGGA
AATTTATCTA GTAGTGGGAT TTCTAGCAAT GGTATTTCCG CTGTTTCGGC TGAGAAGAGC
ATCTTGCCGT CTTTGACCTC AGGTGAAATT GACAATTCTA CCTTCAACTC GACTTTGGCA
TCACTTTCTC TCAACGAGTC CCCTGGAACG TTTCAAGCTC AAGCTCAGAC TCAAGCTCAG
ACTCAAACTC TGACACCTTC AGCTTCGTCT CATCCTCCTC CCGGAATGAC TTCCACTCTT
CAGAACATCA ACGCTGTTCC TCTTCTTACG TCTGACAAGA TCAACTGGTT CTATGTAGAC
CCACTTGGAA CAGAGCAGGG TCCCTTCAAT GGAGATACAA TGCAAGAATG GTTGACCGGA
GGCTACTTGC ACTTGGACTT GAACATTCGT AGATTGGAAG AATCGTCCTA CAGACCTCTC
AAAGAACTCT GTGAAAAAGT CCAGAACTAC ATCCAACCAT TCAAGGTACC TCTTCCAGAC
TTGACCGTTC GTTCTGCTCC AGTGGCCGAG ACCAGACCGC TGTTCCCTCC ATTTGGATCC
CAGGGCCAGG TTCCTCAGTT TAGCTCTCTT GGGCAACAGC AGCAACAACA GTCGCAGTTC
CATCCCTTGT TGTCAGGAGG CCCCAACCTT GGCAGAATCA ACTCTGGACT TGGCCATAGT
CAACAGCAGT CGGGATTGTT TGGCAACGAT TTCATGTCCA ACGATCCTTT TTCTGCCCAG
GGGTTCCAAA ACTCTCCAGG AGCTAACCAG TTCGGAATCG ACTCTATCAA CCGGAACCTC
GGCTTCAACA GTGGCTTGCA GATCAACACC ATGCCAGCCT TGTTGCAACT GCAGATTCAA
CAGCAGCAAT CACAACCTGG TTTGTCGAGA AACAACAGTG GCTGGGGCAG TTTGGACCAT
TCCACCGGTT TGATGAGTAA CAGTAACCCT GCAACGCCAC TATCTGTTAA CCCGGTGTTG
CCTGGCCAGA TCACCCAGCC AACCCCAATA TCTCCATGGA TTTCTGGTGG GATCCAGTCC
CTGTCCAGAG TCAGCTCACC CTTTGTTGCA TCTAGCACTA TTAGCAACAA TAACAGCAAT
AGCAACAATA ACGTAAGCTC TGCAGAAGTG ATCGACACTG TTGCTGCCGA CGATGTTGTG
GTTAACGATG ACCATGTTTT GAGCAACATG CATTCTTCTG TAGTGAACGA CTTCTTGGAC
GACGATCATT TCAAGCAAGA AGTCCCAGCC CCAGCTTTAG CTACTGAAAA GTCGGAAAAG
ACTGAAAATT CAGAGCCAGC TGTCAATGAA CAGCAAGCAA AGGCAGGACC GGAACAAGAA
GCTGAGCCGG AACAAGAACT TGAAGACGAA GAAATTGTTT CCGCTCCAGT GGTTGAAACC
TTGAAGCCTT CGGTGCCTAA CCAACAAGAG TTAGCTCCAT GGGCCAGCAG CAAAAATCCA
GAAGCTGAAC AAAAGCCAGC CATGACTTTA AAGGAAATCC AGCAATTGGA AGCGGAAAGA
TTGGAAAAGC AAAAACAATT GCTTGCTCAA CAAAGAGCCG AAGTCAACCG TGCTTGGGCA
CAGGCAGAGG AGAAGGCTGC TGCTGAACAA AAGGTGCCAG CACTTCCTCC AACTACCAGT
TGGGGTGCTG TTCCTGTTCA AACTGTAGCC AAAAAGACAT TAGCTGAAAT CCAGAAAGAA
GAAGCCGAAG CTGCAGCTGC CAGAGCCAAG GCTGCCAAGG CATCTGCTGC TGCGTCGGGT
CTTCCAGTTC AAAAAGCTTC ATTCGCCAGT GCTTTGACGG CATCATCTAC TCCAAAGGAC
GATGGAGTGT GGCAGACTGT TGCTGTTAAG AAGCCAACAG TTAAGAAGCC AGTTGCTCCA
TCTATTACCA ATGCTGCTTC TTCTAGTAGA GCCACCCCTC AAGTATTGAG ATCGGTTTCA
GCTACGAGGC CAGCTGTCTC TGGCAGCAAC TCTCTTGCCT TGAGAGAGGA TTTCTTGATT
TGGGCAAGAT CCAACATGAC CAATTTGTAC CCTACCGTTT CAAAAGATGA TTTGTTAGAT
ATGTTTATTA CCTTGCCAGC TTCCAGCAGC GACTCTGGTG CCTTGATTGC TGAGACCATC
TACGCTTCAT CTGCTACTAT GGATGGTAGA AGATTTGCCC AGGAATTCTT GAAGAGAAGA
CAAAAGGTAG ACCAACAGAT TGGCAAGGAT GACGATGTTT CATGGTCGTC TGCAATCATC
TCTTCTGCTG ACAAGTTGAC TACTGTTGAT GAAGACGGCT GGAGCACCAA CGTTAAGTCC
AAGAAGAAGA AGAGAACTTG AACTGTGGAG TCATATTGGT TCTATACATA CATACTATCA
ATTAGAAATT GAAACATTCT GCATTGGACG CACCGTAGTA AAATGATAAT ATATTTCACT
GATAATACAA TAAGTCAGTA TTATAGTTCA TAGAATTATT GTAAGAAAGT AATATGGCAT
GTATTAGGTG CGAAATAAAA TGGCTGCGAG ACGC
 
Protein sequence
MISRTQRKSQ LEFTNGNSVS GSTVELTDFS LSSLPTSTSI PPTPNSGQVP YTIPPNRGSK 
RYSLNEVFDV WYTNKENIVN PAIELPVLSH ESYKLHKPRQ IYHLDLQPTL APSFVSSDSA
NSDSVASSAS NSDYPLTSTV DSASVAAGNP SGNSGANLSG NLSSSGISSN GISAVSAEKS
ILPSLTSGEI DNSTFNSTLA SLSLNESPGT FQAQAQTQAQ TQTLTPSASS HPPPGMTSTL
QNINAVPLLT SDKINWFYVD PLGTEQGPFN GDTMQEWLTG GYLHLDLNIR RLEESSYRPL
KELCEKVQNY IQPFKVPLPD LTVRSAPVAE TRPLFPPFGS QGQVPQFSSL GQQQQQQSQF
HPLLSGGPNL GRINSGLGHS QQQSGLFGND FMSNDPFSAQ GFQNSPGANQ FGIDSINRNL
GFNSGLQINT MPALLQLQIQ QQQSQPGLSR NNSGWGSLDH STGLMSNSNP ATPLSVNPVL
PGQITQPTPI SPWISGGIQS LSRVSSPFVA SSTISNNNSN SNNNVSSAEV IDTVAADDVV
VNDDHVLSNM HSSVVNDFLD DDHFKQEVPA PALATEKSEK TENSEPAVNE QQAKAGPEQE
AEPEQELEDE EIVSAPVVET LKPSVPNQQE LAPWASSKNP EAEQKPAMTL KEIQQLEAER
LEKQKQLLAQ QRAEVNRAWA QAEEKAAAEQ KVPALPPTTS WGAVPVQTVA KKTLAEIQKE
EAEAAAARAK AAKASAAASG LPVQKASFAS ALTASSTPKD DGVWQTVAVK KPTVKKPVAP
SITNAASSSR ATPQVLRSVS ATRPAVSGSN SLALREDFLI WARSNMTNLY PTVSKDDLLD
MFITLPASSS DSGALIAETI YASSATMDGR RFAQEFLKRR QKVDQQIGKD DDVSWSSAII
SSADKLTTVD EDGWSTNVKS KKKKRT