Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_73373 |
Symbol | |
ID | 4839920 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009046 |
Strand | + |
Start bp | 1016118 |
End bp | 1018998 |
Gene Length | 2881 bp |
Protein Length | 907 aa |
Translation table | 12 |
GC content | 39% |
IMG OID | 640391235 |
Product | predicted protein |
Protein accession | XP_001385551 |
Protein GI | 150866075 |
COG category | [Z] Cytoskeleton |
COG ID | [COG5059] Kinesin-like protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.7622 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | CCACAATGTC CAACATCCAG GTGGCCGTGC GGTGTAGGGG CCGTAACGAC AAAGAGGTGG CTGCGAAGTC TCCCATCGTC ATCGACTTGC CCAACGACAC GTTTTCAGTA ACCGATCCAT ATGTTTCCAT TAACCACAAC TACACAACGC AAACAGTTTC GTCATTGTCC AACGCCAACT CTAGCGGCAG CCAGTCAAGA AGGTCGTCCA CCAACAACAG CGGGTCGCCA TTCAGCAATT ACAAAACCTA CAAAGTTGAC CAAGTCTACG GTTCTCAGGC TGACCAGAAC CTCATCTTTG AAAAAGTTGC ATTGCCTTTA TTCAACGATT TTTTGCAAGG TTCCAACGTT ACCATACTTG CTTACGGTCA AACTGGTACA GGGAAGACGT TCACTATGTG CGGCGGGGAA CAGAAAAACA GCAACGTAGA CTATAAACAT TCAGAGACGG CTGGTATCAT TCCGCGTGTG CTCATTGAAC TCTTCAACAA ATTAGAGCCA GAAGGAGCCG CTTCTGACTA TGTGGTGAAA TGCTCATTTT TGGAGCTCTA TAACGAAGAC TTGAAGGATT TGTTGAACGA CGACGAAAAA CCAGCGAAGT TGAGAATATA TGAGTCCACA GTGGCTGCCA ATGGTAACAA GAAGGAAGCA GGTAAAGCAT CAAAAACAAT CTCGATCCAA AACTTAAGGG AAGAAAGTAT TCTGTCGTGT CAAGACGGGT TCCAAATTTT ACAGAAAGGT CTCTTGAAAA GAAAAACGGC AAGCACAAAA CTAAATGACG TATCTTCAAG ATCCCACACT TTATTTACAA TAAACTTATA CAGAAACCAA CCCGGCTCCG ATGGTACTGG TTCCCAATTA TTCAAAGTAT CAAAAATGAA CTTGGTAGAC TTAGCAGGTT CTGAGAATAT CTATAGGTCT GGAGCGCAAA ATCAGAGAGC CAAAGAAGCA GGATCAATCA ACCAGAGTCT ATTGACTTTA GGGAGAGTAA TAAATTCATT AAGTGAACTT GCAAATTCTT CTAATGCAGA TAATACCTTC CATATACCTT ACAGAGAGTC TAAACTTACA AGACTACTTC AAGATTCAAT TGGAGGTTGC ACCAAAACAT CATTGATAGC TACAATTTCT CCAGCAAAGA TCAACATTGA CGAAACCATT TCCACTTTGG ACTATGCGTG CAAAGCTAAG AATATCAAGA ATTTGCCACA ATCGGGCCAT GATTCTGATT TGATAATGAA GAGAGTGCTT GTCAAAAATT TATCTCAGGA AATAGCAAAG TTAAACTTCG ACTTAATAGC TACCCGGAAC AAGAATGGGA TCTGGTTAAA CGAAGATAAT TATAACGCCA TAATGGAAGA AAATGAATCC TTGAAAGCTA GTCTAAAGGA ATCCAATCTC CAAAATGAAC TGTTGAATTC CAAAATCTCT CAATTTGAAG TCTTCAAGGC AAACAATGAG AATAATATCA AAAAGCTCCG GGAGCAAATA AACAAGCAGG TGGGAATCAA TGAAGAGTTA TCCAACGAGT CAACACTGTT AAAGTCTTAC ATAGTTTCAA AGGACGAAGA AATTAAGCAA CTAAGCGAGC AATTGGTGAA AGCAAACGAG AAGTTCAGTT CCACTACTAA CCAGTTGGTC AAGGTGATTT ACCGTAATCT AGATACATCC ATCAATTCAA TTCAAGATAT ATTGAGTCAG TATAATAATT CAGCAAACGG GGAAACCTTG TTCACGTTCA ATACTCAACT TACGGGCAAC ATTGAAAACT TTAGAAAATC ACTTGAAGAA AAAATAGCTG AGATTAACGA TAATCTTGCC AACTCATTAC TACAGGATCT TCCATATTTT CTTGAGAAAT ACAATGAGAA CTATGATAAG TTGAGCACAT TGATATCTAG TCTAAATTCG CAGTTGATGC AGAATTTGTC GGATCTCAAG GTTGCAAATG ACAAGTTGTC AGGATATATT ATCGAAGATC ACTTGAATTA CAACGCACAA GAATTAATTT CCCAACTAAT CGAAACTAAG GTCTCATCTC AATTGACTAA ATTACATGAA AAGATGGACA AAAGCATTGC CACAATCTTG CGAGACTCGA AACAGAATTA CAAGAAACTT TTCGAGACTT CAATTTCTCA AATATCGCAA GAGTTAATTG AGTCTGAAAG AAATGAAATC TCTAAGAGAG AAAAGAACTG GTCCACCGAG ACATCCAGAG TCTTGAATGT AATAGACCTG CAAATGTACG AAGCACGTCA AGAAGAAGTC GAACAAAGCA AAGCTATCTT TGACTCTTTG AGCATGCTGA CTACTGAAAG GCTCACTGAC TTAAAGAATA AAACTACCGA GAATTTGTCA AAGTTAACAG AACTCGTTTC CAACGAAGAG AACCCCAAGG TAAGTCTCCT CCAAAGAAAT TTGCCTTGTT TAGAGGATAT ATCGAAGAAT ATTCAATTGA ACGACATCAA AATACGAGAT TCTCTAACAA CGATAGACAA GAGTTTACAG GATATTAAAC AATTTGATGC AAAACAAGCA TTCAAATTGT CACCAGTACG TGGCTCAAAG CAAATCGAAA TTGATGGCTT GAAAAGATCT CCTTCGAGGA GTCCCTCTTA CTCCAACCCC CCCTCTAGAA CTGCCAGCAG ACAAATATCT CCAATAAAAA CAGCTGGAAC TTTGGCAAGA ACTAAAATAC CTCAGCTTAA TAGATCGCTT GATAATAAGG AGAATCAGGG CCCAAGCCAG AAGAGGAGAA GAGTTTTGCA ACAGGTCGAT AATTTCCTCC ATGGATGAAA CCTTGTTTGA AGAAATCTCC TTGTATTATA GAAGTATAGA CACCCACGAC GCATGTATAA TTAATGTACT TATAAGATTG T
|
Protein sequence | MSNIQVAVRC RGRNDKEVAA KSPIVIDLPN DTFSVTDPYV SINHNYTTQT VSSFNYKTYK VDQVYGSQAD QNLIFEKVAL PLFNDFLQGS NVTILAYGQT GTGKTFTMCG GEQKNSNVDY KHSETAGIIP RVLIELFNKL EPEGAASDYV VKCSFLELYN EDLKDLLNDD EKPAKLRIYE STVAANGNKK EAGKASKTIS IQNLREESIS SCQDGFQILQ KGLLKRKTAS TKLNDVSSRS HTLFTINLYR NQPGSDGTGS QLFKVSKMNL VDLAGSENIY RSGAQNQRAK EAGSINQSLL TLGRVINSLS ELANSSNADN TFHIPYRESK LTRLLQDSIG GCTKTSLIAT ISPAKINIDE TISTLDYACK AKNIKNLPQS GHDSDLIMKR VLVKNLSQEI AKLNFDLIAT RNKNGIWLNE DNYNAIMEEN ESLKASLKES NLQNESLNSK ISQFEVFKAN NENNIKKLRE QINKQVGINE ELSNESTSLK SYIVSKDEEI KQLSEQLVKA NEKFSSTTNQ LVKVIYRNLD TSINSIQDIL SQYNNSANGE TLFTFNTQLT GNIENFRKSL EEKIAEINDN LANSLLQDLP YFLEKYNENY DKLSTLISSL NSQLMQNLSD LKVANDKLSG YIIEDHLNYN AQELISQLIE TKVSSQLTKL HEKMDKSIAT ILRDSKQNYK KLFETSISQI SQELIESERN EISKREKNWS TETSRVLNVI DSQMYEARQE EVEQSKAIFD SLSMSTTERL TDLKNKTTEN LSKLTELVSN EENPKVSLLQ RNLPCLEDIS KNIQLNDIKI RDSLTTIDKS LQDIKQFDAK QAFKLSPVRG SKQIEIDGLK RSPSRSPSYS NPPSRTASRQ ISPIKTAGTL ARTKIPQLNR SLDNKENQGP SQKRRRVLQQ VDNFLHG
|
| |