Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_28179 |
Symbol | VPS53 |
ID | 4850958 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009068 |
Strand | - |
Start bp | 552342 |
End bp | 554675 |
Gene Length | 2334 bp |
Protein Length | 777 aa |
Translation table | |
GC content | 38% |
IMG OID | 640392666 |
Product | protein required for protein sorting at the late Golgi |
Protein accession | XP_001387747 |
Protein GI | 126273919 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.665656 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTCACAT ACGAGTACGA CCCGGTGCCC GATCTCCGGC GGATTTTCAT ATCGCCGAAT ACACTTGACG AGTTGCCGCA ATTGCTAGAC TACACGAATT CATACAAACA GAAACTTGAT GAAGAAATCC AACAGGATAT CTCTGAATAC AACAGTAGTC GTCCCAACGG TCTCAATGAC GACATTTGCA ATCTCGTAGA TCTCATTAAA GGAATCAAGC TGGACTCAGA TGCTACTCGT CTGTCAATTG TAGCTATGAC CGGATCGATC CAGAACTTGG ATCAGTACAA GAAGAACTTG GTATTGTCGA TGACGATTCT CAAACGACTT CAAATGCTTA TAAATGCCAA CAATACGCTC ATACAGGTGA TGTCTTCACA CAATTACCAG GAGATTCTAC TGCTTTTTAG CGTTATTAAG GAATTATTGG GCTTTTTCAA ACCATACAAA TCGATAGATG AAATCAATCA GTTGAACCTA ATGGTTGTCA GTACCCAAAA TAAGCTCATA GATGATATCT TCATAGATTT TGAGGATTTT TCGACACAGA GACTACAAGA CCGCGAGGAT CAGTTGATCT ACGGCTGTAA GATCTTAGAG TTGATTGACT TGAAGTACAA GGACAAGCTC TTGAACTGGT TTTTCAACTT GCAACTCAAA GATATAAGGT CCATCTTCAA CAACTTGGAT GAGGCAGGCT CCTTGGACAA CTTGAACCGC AGATATATCT ACTTCAACAA CACTTTGAAG AGCGTCCAGG AAAGGTATCT TGAAATCTTC CCCAAGGACT GGAAGGTCGA CTTGGAATTG AGTAAAATCT TTTGCTCCAT GACCAAGCAG GATCTCATAA ATTTGCTCAC ATCGTCAAAT GTCAAATCTA ATACCCTCTT GGACAACTTG ACAGCAACAT TAGATTTAGA AAAACTCTTG AACGATACCT TCAAAACGAG CGAGTTCACA CTGATCATAT CCCTGGTGTT TGAACCTTAT TTGTTGATAT GGATAAATGA ACAGGACAAA CTTCTCAGCT CCAAATTCGC AGAATTTATG TCGATTTCAC AATTGCCATC AGAATTAAAC GAGAAGGACG ACTTCTTAAC GGTATTGAAG GTTAATAATG TTCCTAACAT CGCAAACTCT TCAACTGAAT TGTTCAAGAA TTTCCAAAAG ATTCTTACCC TGATTTTGAA ACTCAGCAAT GGTGAGATTT TGATAGAGTT GAGTAAATTA TTCATCAAGT ACTTGTATGA TTTCCACAAT AAGATATTGG CACCGATGGT TCCAAAAAAC GATGACGAAT TGGGAGGTGG TATAGAGCCT TTGAAGTACT TGACTATGTT ACTCAATACT GGGGATTATG TTATAAATAA TATTGACGAT CTAGCTGACA AATTCAAAAC TTTAATCAAG GATCAATACG AGCAGAGGTT GCCTTCCTAC GAAAACGTCA AAGACATATA CTTTAAGTTG ATCAACAAGT CCATTTCCAA TCTCTTGATC AAGATCTCCA ATGATCTCAA GTTCAGCTGG AGACAATTCT TGAATATAAA CTGGTCCAAT TTAGACACGA TTAATGATGT TTCCAGCTAT ATGTTGGAAT TGAAGAAGCA GATTATCACC AATTTACAGG TGATTCTCCC TTTGATTATA AGAGAAAGTT ATATAAGAAA TTTCAACGAT AAACTTGTTG AATTGCTTAT AACTACACTC AGCAACAACC TCAAGTTTGT AAAGCCTTTG AATATGATTA GTTTGGAGCA GATCCTTCTT GACATTACCA ATCTCAAAGA CGTTTGTTTG ACATTCCCAT TGTATTCCGA TCCTAATTAT TCCGAATCTA AGAATACAAC TAGTAGCTCA CCATCATACC AGAAGTTTGT CAGCAACCAG TTCCACAGCT TTGAAAGCTT GTTGAAGGTC TTGATGGTTC CTGAACTTCC AATAGAGAAT ATAATAGAGA GTTACTTTGA ATTGATAGGA GACAAGTCTA TTCGCAACTT CATGAAGATT CTCAACTTGA AGAATATCGA CAAGTCGGCT CAATCAAAGT ATATTGAAAA TTTCAAACTA CAGCTCACTC TAGATGATGG TACCTTGACA AACCAAAACC AGTTACTATC TAATTTAGAA GACGAAGAAG AGTCGGGATC GGTTAGTATT TCTCAGGTCA GCACACCTAC CCCAGACTTT AAGTCACCAA AATTGCTTCC TACGAAAATC AACAACTTTG AGAAAAATCT CCGAGAATTT GCTATCACTG GAGAAAGCCA TGTCAATAAG TTAAACGAGA ACTTCAAGAA CTTTGGTAAG TTCTTCAGAA AGGATAACGA CTAG
|
Protein sequence | MVTYEYDPVP DLRRIFISPN TLDELPQLLD YTNSYKQKLD EEIQQDISEY NSSRPNGLND DICNLVDLIK GIKLDSDATR LSIVAMTGSI QNLDQYKKNL VLSMTILKRL QMLINANNTL IQVMSSHNYQ EILLLFSVIK ELLGFFKPYK SIDEINQLNL MVVSTQNKLI DDIFIDFEDF STQRLQDRED QLIYGCKILE LIDLKYKDKL LNWFFNLQLK DIRSIFNNLD EAGSLDNLNR RYIYFNNTLK SVQERYLEIF PKDWKVDLEL SKIFCSMTKQ DLINLLTSSN VKSNTLLDNL TATLDLEKLL NDTFKTSEFT LIISLVFEPY LLIWINEQDK LLSSKFAEFM SISQLPSELN EKDDFLTVLK VNNVPNIANS STELFKNFQK ILTLILKLSN GEILIELSKL FIKYLYDFHN KILAPMVPKN DDELGGGIEP LKYLTMLLNT GDYVINNIDD LADKFKTLIK DQYEQRLPSY ENVKDIYFKL INKSISNLLI KISNDLKFSW RQFLNINWSN LDTINDVSSY MLELKKQIIT NLQVILPLII RESYIRNFND KLVELLITTL SNNLKFVKPL NMISLEQILL DITNLKDVCL TFPLYSDPNY SESKNTTSSS PSYQKFVSNQ FHSFESLLKV LMVPELPIEN IIESYFELIG DKSIRNFMKI LNLKNIDKSA QSKYIENFKL QLTLDDGTLT NQNQLLSNLE DEEESGSVSI SQVSTPTPDF KSPKLLPTKI NNFEKNLREF AITGESHVNK LNENFKNFGK FFRKDND
|
| |