Gene PICST_80203 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_80203 
SymbolVPS72 
ID4851234 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009068 
Strand
Start bp1273547 
End bp1276578 
Gene Length3032 bp 
Protein Length896 aa 
Translation table 
GC content41% 
IMG OID640392942 
Productvacuolar targeting protein 
Protein accessionXP_001387886 
Protein GI126274220 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.397004 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0921162 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
CCATAACCGC TTTCTTAAGA CTGTCCGTAT CCTCGTCCAT CCATTCAACT GACACTCTGT 
CCCACACTGA ACCTTCGCAT ATATAGTACC GCTAAATCTC CCTGCAACTA AGTCCCCACA
GTTTCTATGG CACCAGAAAC AGGCCCAGAC CCAGGCGAAA ACAGCACAGA GTCTATCGCC
AGAAACTTGG CTGCTGAAGT CCCGAACGTC GTCTCGGAAA ACTCCCAGTC TGTTTCTAAC
GGCATCTCTA GTCGCACATC TGAGAATCCA TTTGCCAATG TAGATGATAC GACTCCACTC
TTGAACAGCA ACGAATCCTA CACTAATGAA AACAACGAAG ACGAAAACAA TACGACATGT
GACACCAATT TCAATAACAA TGATGACGAA TATGACACGC AGTCTAAGTT TGACTCCGAG
TCTGTCCTCA AGAAGATCAA AAGACCGTTC TGGTGGTTTT TCGCTTTGGG AATAGTGGCT
ATCATCATCT TCGAGCTTTC CTTTCTTCCT CGGACTTCTC TCAGTAGAGA CTTCAGAAGA
TGGTACGGGC TACATCTCAC ACGTTCAGAC GTCAAGAGAC ACTTTATCTT ATTTTCAGGA
ATTGGAAATT CTCATGACAG TTTGACCACT GAAGAGTACA TCAACACCTG GCTCACAAAC
TTGACTGCTA TTAACAGCAA GAGTCCAGCC AACATTATTG CCGACGACAA CATAGAGCTA
GTTTCACTTG TAGAGAAAAC ATTCAAGAAG TTCGGCTTCA AGACTTCATC TCATTCTTAC
GATGTTCCCT TTTTGCAGAG ACCACAATCT CTGTCTGTGT CACTTGTAGA CTCTCTGAAT
GGGAATGTAG TTTATAATGC CAACTTGAAG GAACCTCACT ACAAGACTCC TGCCTTTTAT
GCTTTTGGAG CTAATTCGTC CGCAGCGGGC GATTACATCT TTGTCAACGA AGGTACTATC
TCGGACTATC TCACTCTCAC GGCTCGCAAT TATGACATAA ATGGCAAAAT TGTCATCGTG
AAGTCAGTCT TGAATTCAAA CATATCTGTA GCAGAAAAAG TATTGATTGC TGAGAAGTTC
GGAGCGATAG GCTTCATCAA CTATTACGAC TTGCAACTGG AAAACAACAA GGAAAGTGAG
TTGCAATTGA ATATAGCCAT TTCTCGCGAC AACGTAGTTA CTGGTCATAT TGGCAATTGG
AAGCGACCTT CGATCCCGGC CATTCCATTA AGTCGTAAGG CTGTCAATCC CATTTTGGGC
ACTTTGGCGA AGAGTAAACA GATGGAAGTG GTTTCTGAAT GGGAGTACAA TCCGACTAAT
ATCGGAGGAT CGCTTACACT CAACATTTCA GCGGTATTTG AAGATACAAA GACGCGTAGA
TTGACCAACA TAGTAGGAAC ATTAAAGGGT GTGATGAATG ATGGTAATAT TATTATTGGA
GCCAGGAGAG ATTCGTTGAC ATCTTCGAAT CCTTCCAGTG GTCACGCGGT GTTGTTTGAA
ATCATGAGAA ACTATCAACG TTTGACCATC AAAGGGTGGA AACCGTTGAG AACAATAAAG
TTCATTTCTT GGGATGGTTC TTCCTCTGGC GTACTTGGAT CTCAGTTGTT GATAAATGAC
ACGAATGTTT TGGACCCTAA GCAATCTGTT ATTGCCTATA TTAATATCGA TGGCGATGCT
GTCACGGGTT CCCGATTCAA AGTTGATTCT AACCCGTTGT TCAATCATCT CTTGAGAAAA
ACAGCCAAGT ATGTTCCCAT TCCGAAAACT GCTGCTTCGT ATAAGACGTT GTCTGAAGTG
GACAAGGAAA AGTTTTTCAA AAGTTTGGAC GACACGGCTA CCAATCAAGC TGACGAAATG
ATGAAGATAT TCAAACTCAC ACAGGACGAT GTAACTGCAG ATGATGACGA TGCTGATGAC
GATAATGACA ACGACAACGA CGATGACGGA GATGACGAGG ACGGCTACAC TACTTTACAC
AAGTATTGGT CCAAACAGGA TAACAATACG ATCCATGGAA TATCGGGACC GGAATTGACA
CATTCTGAGG CTTTCATTTT CCAAGGTCAC TTGAGTACAC CTTCTATCAA TATCAAGTTT
GATAATGATG CCAAGCGTGA CTCTTCGTTG TATGTTCCTA ACTCTAATTA CTATTCGTAC
GATTGGCTTG TCAAGAGACA AATAGACAAT GACCTACTTT TGCATGGTCT GTTGATTCGT
TTCATAGGTT TGTTGGCGAT CTCTTTGAGT GAACATGAAA TGGTAGAAGT CAGAACCAGA
TATTACTATC GTGATATCAA TCGATTCTTT TCTTTCTTCC TCATTGAAAA CCAACCCCAA
TTATCGAAGT GGGGTCAAGA CAAAGTTTCC TCGTATCTTA TAAACAAATC GTACATTTTA
CTGGATCTCA AGCGAGATTT GAAAGACGAA CCTACAGTGA GATTCGTTGA CTTGCTTTCG
CAATTTCAGG TGTTACTCAA CGACTTGACA CACCAATCGT TGATTTTCGA CAAGTGGAAT
AAAAAAGTTC AAGAGGGATT AATCGAAGAT TACCCTTGGT ATAGATGTTA TAAAAAGTTT
GCTCATTTTG CCCAGTTCAA GGTATCCAAT CACAAGTTGC TCCATTTGGA GCGTGAGCTT
ACTTTGAACC CGAGAGATTA CCAGTTTCTC CAGAATGGCA ATGGAAATGA CGAGAAACAG
AAAGAAGCAT ACTTCAACCA TGTTATCTAT GGGCTTCCCA AGTTCTCTGT CAACTCTAGT
ACAGATTATC TTAACAGCCG ATTCAAATAC AGCACATTCA CCAATCTCCA TGAATCGGTA
CAAGAGAGTG ATTTCGAGCT AACCGTCAAA TGGCTAGCAG TTACTTATGA TAAGTTACGA
AACTTAAATT ACAAAATGAC ATAAACAGGT TTTCAGTTCG GTACTATAGT TTTGGAAGTA
ATATACTAGC TATCTATTTC TTTATCTATT TATTTATTAG TGTATTTATT GCTTTATTTG
TGTGTTTATT TCATTTCTTA TCCGATTCAA AA
 
Protein sequence
MAPETGPDPG ENSTESIARN LAAEVPNVVS ENSQSVSNGI SSRTSENPFA NVDDTTPLLN 
SNESYTNENN EDENNTTCDT NFNNNDDEYD TQSKFDSESV LKKIKRPFWW FFALGIVAII
IFELSFLPRT SLSRDFRRWY GLHLTRSDVK RHFILFSGIG NSHDSLTTEE YINTWLTNLT
AINSKSPANI IADDNIELVS LVEKTFKKFG FKTSSHSYDV PFLQRPQSLS VSLVDSLNGN
VVYNANLKEP HYKTPAFYAF GANSSAAGDY IFVNEGTISD YLTLTARNYD INGKIVIVKS
VLNSNISVAE KVLIAEKFGA IGFINYYDLQ LENNKESELQ LNIAISRDNV VTGHIGNWKR
PSIPAIPLSR KAVNPILGTL AKSKQMEVVS EWEYNPTNIG GSLTLNISAV FEDTKTRRLT
NIVGTLKGVM NDGNIIIGAR RDSLTSSNPS SGHAVLFEIM RNYQRLTIKG WKPLRTIKFI
SWDGSSSGVL GSQLLINDTN VLDPKQSVIA YINIDGDAVT GSRFKVDSNP LFNHLLRKTA
KYVPIPKTAA SYKTLSEDDV TADDDDADDD NDNDNDDDGD DEDGYTTLHK YWSKQDNNTI
HGISGPELTH SEAFIFQGHL STPSINIKFD NDAKRDSSLY VPNSNYYSYD WLVKRQIDND
LLLHGLLIRF IGLLAISLSE HEMVEVRTRY YYRDINRFFS FFLIENQPQL SKWGQDKVSS
YLINKSYILL DLKRDLKDEP TVRFVDLLSQ FQVLLNDLTH QSLIFDKWNK KVQEGLIEDY
PWYRCYKKFA HFAQFKVSNH KLLHLERELT LNPRDYQFLQ NGNGNDEKQK EAYFNHVIYG
LPKFSVNSST DYLNSRFKYS TFTNLHESVQ ESDFELTVKW LAVTYDKLRN LNYKMT