Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_80203 |
Symbol | VPS72 |
ID | 4851234 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009068 |
Strand | - |
Start bp | 1273547 |
End bp | 1276578 |
Gene Length | 3032 bp |
Protein Length | 896 aa |
Translation table | |
GC content | 41% |
IMG OID | 640392942 |
Product | vacuolar targeting protein |
Protein accession | XP_001387886 |
Protein GI | 126274220 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.397004 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.0921162 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | CCATAACCGC TTTCTTAAGA CTGTCCGTAT CCTCGTCCAT CCATTCAACT GACACTCTGT CCCACACTGA ACCTTCGCAT ATATAGTACC GCTAAATCTC CCTGCAACTA AGTCCCCACA GTTTCTATGG CACCAGAAAC AGGCCCAGAC CCAGGCGAAA ACAGCACAGA GTCTATCGCC AGAAACTTGG CTGCTGAAGT CCCGAACGTC GTCTCGGAAA ACTCCCAGTC TGTTTCTAAC GGCATCTCTA GTCGCACATC TGAGAATCCA TTTGCCAATG TAGATGATAC GACTCCACTC TTGAACAGCA ACGAATCCTA CACTAATGAA AACAACGAAG ACGAAAACAA TACGACATGT GACACCAATT TCAATAACAA TGATGACGAA TATGACACGC AGTCTAAGTT TGACTCCGAG TCTGTCCTCA AGAAGATCAA AAGACCGTTC TGGTGGTTTT TCGCTTTGGG AATAGTGGCT ATCATCATCT TCGAGCTTTC CTTTCTTCCT CGGACTTCTC TCAGTAGAGA CTTCAGAAGA TGGTACGGGC TACATCTCAC ACGTTCAGAC GTCAAGAGAC ACTTTATCTT ATTTTCAGGA ATTGGAAATT CTCATGACAG TTTGACCACT GAAGAGTACA TCAACACCTG GCTCACAAAC TTGACTGCTA TTAACAGCAA GAGTCCAGCC AACATTATTG CCGACGACAA CATAGAGCTA GTTTCACTTG TAGAGAAAAC ATTCAAGAAG TTCGGCTTCA AGACTTCATC TCATTCTTAC GATGTTCCCT TTTTGCAGAG ACCACAATCT CTGTCTGTGT CACTTGTAGA CTCTCTGAAT GGGAATGTAG TTTATAATGC CAACTTGAAG GAACCTCACT ACAAGACTCC TGCCTTTTAT GCTTTTGGAG CTAATTCGTC CGCAGCGGGC GATTACATCT TTGTCAACGA AGGTACTATC TCGGACTATC TCACTCTCAC GGCTCGCAAT TATGACATAA ATGGCAAAAT TGTCATCGTG AAGTCAGTCT TGAATTCAAA CATATCTGTA GCAGAAAAAG TATTGATTGC TGAGAAGTTC GGAGCGATAG GCTTCATCAA CTATTACGAC TTGCAACTGG AAAACAACAA GGAAAGTGAG TTGCAATTGA ATATAGCCAT TTCTCGCGAC AACGTAGTTA CTGGTCATAT TGGCAATTGG AAGCGACCTT CGATCCCGGC CATTCCATTA AGTCGTAAGG CTGTCAATCC CATTTTGGGC ACTTTGGCGA AGAGTAAACA GATGGAAGTG GTTTCTGAAT GGGAGTACAA TCCGACTAAT ATCGGAGGAT CGCTTACACT CAACATTTCA GCGGTATTTG AAGATACAAA GACGCGTAGA TTGACCAACA TAGTAGGAAC ATTAAAGGGT GTGATGAATG ATGGTAATAT TATTATTGGA GCCAGGAGAG ATTCGTTGAC ATCTTCGAAT CCTTCCAGTG GTCACGCGGT GTTGTTTGAA ATCATGAGAA ACTATCAACG TTTGACCATC AAAGGGTGGA AACCGTTGAG AACAATAAAG TTCATTTCTT GGGATGGTTC TTCCTCTGGC GTACTTGGAT CTCAGTTGTT GATAAATGAC ACGAATGTTT TGGACCCTAA GCAATCTGTT ATTGCCTATA TTAATATCGA TGGCGATGCT GTCACGGGTT CCCGATTCAA AGTTGATTCT AACCCGTTGT TCAATCATCT CTTGAGAAAA ACAGCCAAGT ATGTTCCCAT TCCGAAAACT GCTGCTTCGT ATAAGACGTT GTCTGAAGTG GACAAGGAAA AGTTTTTCAA AAGTTTGGAC GACACGGCTA CCAATCAAGC TGACGAAATG ATGAAGATAT TCAAACTCAC ACAGGACGAT GTAACTGCAG ATGATGACGA TGCTGATGAC GATAATGACA ACGACAACGA CGATGACGGA GATGACGAGG ACGGCTACAC TACTTTACAC AAGTATTGGT CCAAACAGGA TAACAATACG ATCCATGGAA TATCGGGACC GGAATTGACA CATTCTGAGG CTTTCATTTT CCAAGGTCAC TTGAGTACAC CTTCTATCAA TATCAAGTTT GATAATGATG CCAAGCGTGA CTCTTCGTTG TATGTTCCTA ACTCTAATTA CTATTCGTAC GATTGGCTTG TCAAGAGACA AATAGACAAT GACCTACTTT TGCATGGTCT GTTGATTCGT TTCATAGGTT TGTTGGCGAT CTCTTTGAGT GAACATGAAA TGGTAGAAGT CAGAACCAGA TATTACTATC GTGATATCAA TCGATTCTTT TCTTTCTTCC TCATTGAAAA CCAACCCCAA TTATCGAAGT GGGGTCAAGA CAAAGTTTCC TCGTATCTTA TAAACAAATC GTACATTTTA CTGGATCTCA AGCGAGATTT GAAAGACGAA CCTACAGTGA GATTCGTTGA CTTGCTTTCG CAATTTCAGG TGTTACTCAA CGACTTGACA CACCAATCGT TGATTTTCGA CAAGTGGAAT AAAAAAGTTC AAGAGGGATT AATCGAAGAT TACCCTTGGT ATAGATGTTA TAAAAAGTTT GCTCATTTTG CCCAGTTCAA GGTATCCAAT CACAAGTTGC TCCATTTGGA GCGTGAGCTT ACTTTGAACC CGAGAGATTA CCAGTTTCTC CAGAATGGCA ATGGAAATGA CGAGAAACAG AAAGAAGCAT ACTTCAACCA TGTTATCTAT GGGCTTCCCA AGTTCTCTGT CAACTCTAGT ACAGATTATC TTAACAGCCG ATTCAAATAC AGCACATTCA CCAATCTCCA TGAATCGGTA CAAGAGAGTG ATTTCGAGCT AACCGTCAAA TGGCTAGCAG TTACTTATGA TAAGTTACGA AACTTAAATT ACAAAATGAC ATAAACAGGT TTTCAGTTCG GTACTATAGT TTTGGAAGTA ATATACTAGC TATCTATTTC TTTATCTATT TATTTATTAG TGTATTTATT GCTTTATTTG TGTGTTTATT TCATTTCTTA TCCGATTCAA AA
|
Protein sequence | MAPETGPDPG ENSTESIARN LAAEVPNVVS ENSQSVSNGI SSRTSENPFA NVDDTTPLLN SNESYTNENN EDENNTTCDT NFNNNDDEYD TQSKFDSESV LKKIKRPFWW FFALGIVAII IFELSFLPRT SLSRDFRRWY GLHLTRSDVK RHFILFSGIG NSHDSLTTEE YINTWLTNLT AINSKSPANI IADDNIELVS LVEKTFKKFG FKTSSHSYDV PFLQRPQSLS VSLVDSLNGN VVYNANLKEP HYKTPAFYAF GANSSAAGDY IFVNEGTISD YLTLTARNYD INGKIVIVKS VLNSNISVAE KVLIAEKFGA IGFINYYDLQ LENNKESELQ LNIAISRDNV VTGHIGNWKR PSIPAIPLSR KAVNPILGTL AKSKQMEVVS EWEYNPTNIG GSLTLNISAV FEDTKTRRLT NIVGTLKGVM NDGNIIIGAR RDSLTSSNPS SGHAVLFEIM RNYQRLTIKG WKPLRTIKFI SWDGSSSGVL GSQLLINDTN VLDPKQSVIA YINIDGDAVT GSRFKVDSNP LFNHLLRKTA KYVPIPKTAA SYKTLSEDDV TADDDDADDD NDNDNDDDGD DEDGYTTLHK YWSKQDNNTI HGISGPELTH SEAFIFQGHL STPSINIKFD NDAKRDSSLY VPNSNYYSYD WLVKRQIDND LLLHGLLIRF IGLLAISLSE HEMVEVRTRY YYRDINRFFS FFLIENQPQL SKWGQDKVSS YLINKSYILL DLKRDLKDEP TVRFVDLLSQ FQVLLNDLTH QSLIFDKWNK KVQEGLIEDY PWYRCYKKFA HFAQFKVSNH KLLHLERELT LNPRDYQFLQ NGNGNDEKQK EAYFNHVIYG LPKFSVNSST DYLNSRFKYS TFTNLHESVQ ESDFELTVKW LAVTYDKLRN LNYKMT
|
| |