Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_59543 |
Symbol | VSP33 |
ID | 4838814 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009044 |
Strand | + |
Start bp | 372621 |
End bp | 374744 |
Gene Length | 2124 bp |
Protein Length | 707 aa |
Translation table | 12 |
GC content | 41% |
IMG OID | 640390129 |
Product | Vacuolar sorting protein VPS33/slp1 (Sec1 family) |
Protein accession | XP_001384027 |
Protein GI | 150864989 |
COG category | [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG5158] Proteins involved in synaptic transmission and general secretion, Sec1 family |
TIGRFAM ID | [TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.0683102 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCATACA CGCCTTCGGA TGATCTTGGG CTCGCATTGG ATGAATTCAA TAGCAAATCG CTCGAGAATC TCCTCCAGCT TCTAGCCAAG ATCTACACTT CCAACAACTT GCTAGTCCTA GACCAGAGTC TCTCACCATT TCTCAATAGT CTCACAACTT TCCTGAAGCT AAAGGAGCAA GGCAAGTTCA GCAACATCAT CTGGCTTAAC AACGAGCTTC TATCGCATGA TCTCGACGTT TTCAGAAAGT TTGCCGGGTT AATCATCTTA GTTCAAGAAA CAGAAGACAA TCGCAGTTTG ATAGAGCAAT TGCTCGTTTC TAACCTCGCT GTGTTAGACA AAATTCATGG TCTTAAAGTT AACATAATAG TTAAGGACTT ATCCCGTTCT TATCTATACG AGTTGAACAA ATCGTTTGGC GGAAACGTCG TCAACTTCGA TCTGATTCTT AGTCACGATC TCAAATCGAA ACCTGTAACA AGTATATCAC CCCGTATCAA GGTGCTAAAC TGGGAAACAT TACCTGTCTA TAATGACGAC ATCATTGTGG TCGATGTAGG TAAATACGGC GGTATAGATT CGTACTTCGA CGAGCCGTTG AAACAGGTGA ATCAACTTGT AGAAGCGCTA GTTCAAATTC TTTTCGCCAG TCATGGAAGT AAGAATAATG TACTCAAATT GAGGAACATC TACGGAAAAG GCAACCATGC TGATTTCCTA GTCCAGCTAC TTCAAGACGT TAAGATTCCT GAGTATCTCA ATACCAACTT GAACCAACTA GAGATCGAGT TCTACCGTAC AAAATTGCTC AGTAATACTG ATTTGGTAGT TCTTGAAAGA AACTTGGACT TCTATTCAGT TCTCTTTAAC CAGTTAAACT ACCAGGGTTT GATCGACGAC TTGTTTGAGT TGAAGTTCAA CACCATAACA AATCCAGTAG GTGATTCTCT GGCTTACAGT AATCTATCCA ATGATGTTCT TTATTCCGAC TCGTTGAGAC ATTTGAACTT TGCATCAATA GGACCAGAAT TGAATAAGCT AGCCAAAGAG ATTCAGCAAC AGTTCAAGTT GAAAGACACG GAGAACGATA CGTTAAACTC AAACTTACAG GATATACGTA AAATAGTCCA GAACTTGGGA ACTTTGACGC AACAGCAAGA TCTCATTAAA AAACACACCT CGATCAGCGA GTCTATTCTC GAGAGAATCA ACTCTGAGTA CGAAACTTTT CTTACGTTCC AGAATGATAT CTTTGAAATG GATTACAAAT TGCAATTGTC CAAACTAAAG TACTTCTTCA ATATCAACTT TAACCAGTAT ATAATTCTCA CCACTATCGT CTTGGTGTCG ATTACTAACA ATGGTATTAA AGAGCGAGAC TTTGACTGGA TCTCCCAGGA AGTGTTGGAT AGCTATGGAA TTTCTACTAG TTTGGCTCTT GAAAAATTGG TCGAATACAA AATGATAAGA ATCAACGTCG ATTATGGGAA CGACTTCTTG AGCACAATCA CAGGTGGTTT GGCCAGTAAC CAACAGACAA CACCTGTTCC GGACGAGAAC CAAAGTCTTG CCAACTTGGG AATTACTGGT GCTCAGGATA CATACAGAGC GAACTTTACC TTGATAGACA AGTTCTGGAA TCTACATCCT TTGGAAGAGG AAGAAGAAAA AGGAGAGGAG CCAGTACAGC ATCCAGACTC CAAAGACCTT TTGGTGGATT TGTATCCGAA TCCATCCTTT ACTTTGCCTT CTAACACGGT TCCGTTGTTA ACACGATTGG TAGAGGCTCT CTATCTTCGG GACTTCTTGA AGTATAAACC AGTGAACAAC ATCAAGAAGC GTCCCAACTG GGACAACCTA TCTACGAGCA CTATGTTCAA TGGCAAAACG GTGGATATCA ACATTGACGA TACTCTGGAT GTGAGAACCA CCAAGCCGTC ACCGTTGGCT ACACACCAAG AGTACATTGT TGTTGTAGTA ATAGGTGGAA TCACCAGAAG CGAAATCAGT TGTTTGAAGT ATCTACAGCA AAGGCTTGTC AAGAACAAAA AGAACAAGAA AATCGTTGTA GTTTCCAGTG GAATCGTCAA CAACAGGAAG CTTCTCGATT TCTTTTTAGA CTAG
|
Protein sequence | MSYTPSDDLG LALDEFNSKS LENLLQLLAK IYTSNNLLVL DQSLSPFLNS LTTFSKLKEQ GKFSNIIWLN NELLSHDLDV FRKFAGLIIL VQETEDNRSL IEQLLVSNLA VLDKIHGLKV NIIVKDLSRS YLYELNKSFG GNVVNFDSIL SHDLKSKPVT SISPRIKVLN WETLPVYNDD IIVVDVGKYG GIDSYFDEPL KQVNQLVEAL VQILFASHGS KNNVLKLRNI YGKGNHADFL VQLLQDVKIP EYLNTNLNQL EIEFYRTKLL SNTDLVVLER NLDFYSVLFN QLNYQGLIDD LFELKFNTIT NPVGDSSAYS NLSNDVLYSD SLRHLNFASI GPELNKLAKE IQQQFKLKDT ENDTLNSNLQ DIRKIVQNLG TLTQQQDLIK KHTSISESIL ERINSEYETF LTFQNDIFEM DYKLQLSKLK YFFNINFNQY IILTTIVLVS ITNNGIKERD FDWISQEVLD SYGISTSLAL EKLVEYKMIR INVDYGNDFL STITGGLASN QQTTPVPDEN QSLANLGITG AQDTYRANFT LIDKFWNLHP LEEEEEKGEE PVQHPDSKDL LVDLYPNPSF TLPSNTVPLL TRLVEALYLR DFLKYKPVNN IKKRPNWDNL STSTMFNGKT VDINIDDTSD VRTTKPSPLA THQEYIVVVV IGGITRSEIS CLKYLQQRLV KNKKNKKIVV VSSGIVNNRK LLDFFLD
|
| |