Gene PICST_73373 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_73373 
Symbol 
ID4839920 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009046 
Strand
Start bp1016118 
End bp1018998 
Gene Length2881 bp 
Protein Length907 aa 
Translation table12 
GC content39% 
IMG OID640391235 
Productpredicted protein 
Protein accessionXP_001385551 
Protein GI150866075 
COG category[Z] Cytoskeleton 
COG ID[COG5059] Kinesin-like protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.7622 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
CCACAATGTC CAACATCCAG GTGGCCGTGC GGTGTAGGGG CCGTAACGAC AAAGAGGTGG 
CTGCGAAGTC TCCCATCGTC ATCGACTTGC CCAACGACAC GTTTTCAGTA ACCGATCCAT
ATGTTTCCAT TAACCACAAC TACACAACGC AAACAGTTTC GTCATTGTCC AACGCCAACT
CTAGCGGCAG CCAGTCAAGA AGGTCGTCCA CCAACAACAG CGGGTCGCCA TTCAGCAATT
ACAAAACCTA CAAAGTTGAC CAAGTCTACG GTTCTCAGGC TGACCAGAAC CTCATCTTTG
AAAAAGTTGC ATTGCCTTTA TTCAACGATT TTTTGCAAGG TTCCAACGTT ACCATACTTG
CTTACGGTCA AACTGGTACA GGGAAGACGT TCACTATGTG CGGCGGGGAA CAGAAAAACA
GCAACGTAGA CTATAAACAT TCAGAGACGG CTGGTATCAT TCCGCGTGTG CTCATTGAAC
TCTTCAACAA ATTAGAGCCA GAAGGAGCCG CTTCTGACTA TGTGGTGAAA TGCTCATTTT
TGGAGCTCTA TAACGAAGAC TTGAAGGATT TGTTGAACGA CGACGAAAAA CCAGCGAAGT
TGAGAATATA TGAGTCCACA GTGGCTGCCA ATGGTAACAA GAAGGAAGCA GGTAAAGCAT
CAAAAACAAT CTCGATCCAA AACTTAAGGG AAGAAAGTAT TCTGTCGTGT CAAGACGGGT
TCCAAATTTT ACAGAAAGGT CTCTTGAAAA GAAAAACGGC AAGCACAAAA CTAAATGACG
TATCTTCAAG ATCCCACACT TTATTTACAA TAAACTTATA CAGAAACCAA CCCGGCTCCG
ATGGTACTGG TTCCCAATTA TTCAAAGTAT CAAAAATGAA CTTGGTAGAC TTAGCAGGTT
CTGAGAATAT CTATAGGTCT GGAGCGCAAA ATCAGAGAGC CAAAGAAGCA GGATCAATCA
ACCAGAGTCT ATTGACTTTA GGGAGAGTAA TAAATTCATT AAGTGAACTT GCAAATTCTT
CTAATGCAGA TAATACCTTC CATATACCTT ACAGAGAGTC TAAACTTACA AGACTACTTC
AAGATTCAAT TGGAGGTTGC ACCAAAACAT CATTGATAGC TACAATTTCT CCAGCAAAGA
TCAACATTGA CGAAACCATT TCCACTTTGG ACTATGCGTG CAAAGCTAAG AATATCAAGA
ATTTGCCACA ATCGGGCCAT GATTCTGATT TGATAATGAA GAGAGTGCTT GTCAAAAATT
TATCTCAGGA AATAGCAAAG TTAAACTTCG ACTTAATAGC TACCCGGAAC AAGAATGGGA
TCTGGTTAAA CGAAGATAAT TATAACGCCA TAATGGAAGA AAATGAATCC TTGAAAGCTA
GTCTAAAGGA ATCCAATCTC CAAAATGAAC TGTTGAATTC CAAAATCTCT CAATTTGAAG
TCTTCAAGGC AAACAATGAG AATAATATCA AAAAGCTCCG GGAGCAAATA AACAAGCAGG
TGGGAATCAA TGAAGAGTTA TCCAACGAGT CAACACTGTT AAAGTCTTAC ATAGTTTCAA
AGGACGAAGA AATTAAGCAA CTAAGCGAGC AATTGGTGAA AGCAAACGAG AAGTTCAGTT
CCACTACTAA CCAGTTGGTC AAGGTGATTT ACCGTAATCT AGATACATCC ATCAATTCAA
TTCAAGATAT ATTGAGTCAG TATAATAATT CAGCAAACGG GGAAACCTTG TTCACGTTCA
ATACTCAACT TACGGGCAAC ATTGAAAACT TTAGAAAATC ACTTGAAGAA AAAATAGCTG
AGATTAACGA TAATCTTGCC AACTCATTAC TACAGGATCT TCCATATTTT CTTGAGAAAT
ACAATGAGAA CTATGATAAG TTGAGCACAT TGATATCTAG TCTAAATTCG CAGTTGATGC
AGAATTTGTC GGATCTCAAG GTTGCAAATG ACAAGTTGTC AGGATATATT ATCGAAGATC
ACTTGAATTA CAACGCACAA GAATTAATTT CCCAACTAAT CGAAACTAAG GTCTCATCTC
AATTGACTAA ATTACATGAA AAGATGGACA AAAGCATTGC CACAATCTTG CGAGACTCGA
AACAGAATTA CAAGAAACTT TTCGAGACTT CAATTTCTCA AATATCGCAA GAGTTAATTG
AGTCTGAAAG AAATGAAATC TCTAAGAGAG AAAAGAACTG GTCCACCGAG ACATCCAGAG
TCTTGAATGT AATAGACCTG CAAATGTACG AAGCACGTCA AGAAGAAGTC GAACAAAGCA
AAGCTATCTT TGACTCTTTG AGCATGCTGA CTACTGAAAG GCTCACTGAC TTAAAGAATA
AAACTACCGA GAATTTGTCA AAGTTAACAG AACTCGTTTC CAACGAAGAG AACCCCAAGG
TAAGTCTCCT CCAAAGAAAT TTGCCTTGTT TAGAGGATAT ATCGAAGAAT ATTCAATTGA
ACGACATCAA AATACGAGAT TCTCTAACAA CGATAGACAA GAGTTTACAG GATATTAAAC
AATTTGATGC AAAACAAGCA TTCAAATTGT CACCAGTACG TGGCTCAAAG CAAATCGAAA
TTGATGGCTT GAAAAGATCT CCTTCGAGGA GTCCCTCTTA CTCCAACCCC CCCTCTAGAA
CTGCCAGCAG ACAAATATCT CCAATAAAAA CAGCTGGAAC TTTGGCAAGA ACTAAAATAC
CTCAGCTTAA TAGATCGCTT GATAATAAGG AGAATCAGGG CCCAAGCCAG AAGAGGAGAA
GAGTTTTGCA ACAGGTCGAT AATTTCCTCC ATGGATGAAA CCTTGTTTGA AGAAATCTCC
TTGTATTATA GAAGTATAGA CACCCACGAC GCATGTATAA TTAATGTACT TATAAGATTG
T
 
Protein sequence
MSNIQVAVRC RGRNDKEVAA KSPIVIDLPN DTFSVTDPYV SINHNYTTQT VSSFNYKTYK 
VDQVYGSQAD QNLIFEKVAL PLFNDFLQGS NVTILAYGQT GTGKTFTMCG GEQKNSNVDY
KHSETAGIIP RVLIELFNKL EPEGAASDYV VKCSFLELYN EDLKDLLNDD EKPAKLRIYE
STVAANGNKK EAGKASKTIS IQNLREESIS SCQDGFQILQ KGLLKRKTAS TKLNDVSSRS
HTLFTINLYR NQPGSDGTGS QLFKVSKMNL VDLAGSENIY RSGAQNQRAK EAGSINQSLL
TLGRVINSLS ELANSSNADN TFHIPYRESK LTRLLQDSIG GCTKTSLIAT ISPAKINIDE
TISTLDYACK AKNIKNLPQS GHDSDLIMKR VLVKNLSQEI AKLNFDLIAT RNKNGIWLNE
DNYNAIMEEN ESLKASLKES NLQNESLNSK ISQFEVFKAN NENNIKKLRE QINKQVGINE
ELSNESTSLK SYIVSKDEEI KQLSEQLVKA NEKFSSTTNQ LVKVIYRNLD TSINSIQDIL
SQYNNSANGE TLFTFNTQLT GNIENFRKSL EEKIAEINDN LANSLLQDLP YFLEKYNENY
DKLSTLISSL NSQLMQNLSD LKVANDKLSG YIIEDHLNYN AQELISQLIE TKVSSQLTKL
HEKMDKSIAT ILRDSKQNYK KLFETSISQI SQELIESERN EISKREKNWS TETSRVLNVI
DSQMYEARQE EVEQSKAIFD SLSMSTTERL TDLKNKTTEN LSKLTELVSN EENPKVSLLQ
RNLPCLEDIS KNIQLNDIKI RDSLTTIDKS LQDIKQFDAK QAFKLSPVRG SKQIEIDGLK
RSPSRSPSYS NPPSRTASRQ ISPIKTAGTL ARTKIPQLNR SLDNKENQGP SQKRRRVLQQ
VDNFLHG