Gene PICST_33956 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_33956 
SymbolVPS70 
ID4840855 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009048 
Strand
Start bp862344 
End bp864746 
Gene Length2403 bp 
Protein Length800 aa 
Translation table12 
GC content43% 
IMG OID640392170 
Productmembrane protein involved in vacuolar protein sorting 
Protein accessionXP_001386580 
Protein GI150866850 
COG category[R] General function prediction only 
COG ID[COG2234] Predicted aminopeptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTGACG AAAAGGGTTT CTACACCCCT ATAGAGGTGG CTCCTTCGAG GAAGCAATCG 
TGGCTCAAGA GGTTCGCAGC TGGAGCCATT GGAGCCGCTG CATCTCTCTA TCTTCTTAGT
TCTGCTGCTC TGTACTTGAG TACTGAAATC TCAGAGACCG AGGCTAAGAA GATCATTTTG
GATGCGCTTG AAACAAACTT GGCCGGAAAC TGGTCCAAAA TATATACTGC TGAGCCTCAT
TTGGCAGGAA CCAACTTTGG CTTGGTGCAA TTCACCAGAG ATAAATTTGA AGAGTACGGT
TTGGATGCTT CTATCGATAC TTACGAGATT TATGTCAGTT ACCCTAACGA CCATGACTTG
AACTTATTGG ATGCGAAAAC TGAGGAAATC TTGTACAAGG CTCCATTAAA GGAAGACGTA
CTTAAAGAAG ATGAGACCAC CCACGGAAAT GATACTGTTC CTACCTTTTT GGCTTATGCT
GCTAATGGTA ATGTTACTGG TCAATATGTC TACGCAAACT ACGGTACCAA GAAGGACTTT
GAACAGTTAC AAGAATGGGG CGTAGATGTA AAGGGCAAGA TTGTCATTGT CAGATACGGT
GCTATTTTCC GTGGTTTGAA GGTGAAGTTT GCCCAAGAGA ATGGTGCCAT TGGTGTATTG
ATCTACTCTG ATCCTGGCGA TGACCATGGA ATTACAGAGA AAAACGGCTA CAAGCAGTAT
CCCCACGGTC CTGCTAGACA AGAAAGCTCA GTACAAAGAG GTAGTGTACA ATTCTTAGGA
GGAGTAGGTG CTACTCCTGG AGATCCTACT ACTCCTGGTT ATGCTTCTAA GCCTGGTGTA
GAACGTAAAG ACCCACACGC TAGTATTGGT AAGATTCCAG CCTTGCCAAT CTCATACCGT
GAAGTAAAGC CAATCTTGGA AAAGTTGAAC GGTGAAGGTC ACCATGCTAA GGAAGATTCT
TGGATTGGTG AATTAGAAGG CTTCAAGTAC TATACTGGCC CAAGCAAGAA TTATACATTG
AACTTGTACA ACGATCAGAT CTACAACATC TCCACTTTGT ACAACGTCTA TGGTGAAATT
CCTGGCGAAA AGAGGGATGA AGTTATCATA ATTGGTAACC ACAGAGATGC TTGGATCAAG
GGTGGTGCTG GAGACCCCAA CTCCGGTTCT GCTACCATCG TTGAAGTGGC TAGAGCTTTG
GGTGAGTTGA AAAAAGCCGG TTACAAGTTC AAGAGAACTA TAGTCTTCCA AAGTTACGAC
GGTGAAGAAT ACGGTCTTTT AGGCTCTACT GAGCAAGGTG AATACTTTGC AGACAAGTAC
AAGCGTACTG TAGTTGCTTA CTTGAACTTG GATGCATCCA CTACAGGCAG TCACTTGAAG
TTGGGTGCTT CTCCATTGTT GAACAACATC CTCAGAGAAG CTGCTAAGGA ATTGGAATAT
CCAGCCAAAG GTGTTGGCTC CTTGTACGAT CACTTTGTTC AGGAAACAGG TGACAACATC
AAGAACCTTG GATCTGGTTC GGACTACACT GTCTTCTTGG AACATTTAGG TATCCCTTCT
GTTGATTTGG GTTTTGGCCA AGGAAAGGGT GACCCTATCT ACCATTACCA TTCTAACTAC
GATTCCTTCT ACTGGATTTC CGAGTTTGCC GACAAAGGTT TTGTCTACCA TAACTTGGCT
GCTAAGTACT TGTCTCTCAT TGTTTTCGGC TTGGGTGAAC GTGAATTAAT CAACTTCAAG
TTGACTGATT ACTCTAAGGA CCTCATCAAA TACTACAATG AGACTATTAA GGCAGTTCCA
AAGAAGTGGT TCCACCAAGA AATCTCGCAA GAGACCTGGG ATAAATACTT GAACCATGAT
ATTGCTGATA TCGAGTTTAT TTCTAAGGCT ACTTCAAGAA CTTACCTTCC AAGAAGATGG
TTCCCTTTCC CAGAGTTGTT GAACGTGAGA GACTGTAAAA TGAAGGGTCA CTTGATGTAC
GACTCCAAGC ACCACAAGAA TGCTACTTTG GAGGACCTCC TTTTGAAGAC TCTTGAAGAT
ATTAACACTT TACAAAACAG TACTGTCTCA TTCGATTTGA AGACCTTTGA TCTTCAATCG
CAACATGAAA ACAAAGATGA CTTGAACTTC TGGCAGAGAT TTAAATTACA TTTCCAGATC
AAGGGCCACA ATAAGTTACT TCAATATTTC GAAAGAAACT TCTTATACCA CAAGGGCTTG
TACGAGAGAC CATGGTTCAA ACACATTGTG TTTGCCAGTG GTAGATTTAC AGGTTACGAA
GGCCAAACTT TCCCAGGGTT GAAGGAAGCT CTTGAAGATG GCGATTTTGA GAGATTTGTT
GATTGGCTCG GTATTGTGTC CAAGGCTATC AGAAGAATTA ATGCCGAGAT TGCAGTTAAA
TGA
 
Protein sequence
MSDEKGFYTP IEVAPSRKQS WLKRFAAGAI GAAASLYLLS SAASYLSTEI SETEAKKIIL 
DALETNLAGN WSKIYTAEPH LAGTNFGLVQ FTRDKFEEYG LDASIDTYEI YVSYPNDHDL
NLLDAKTEEI LYKAPLKEDV LKEDETTHGN DTVPTFLAYA ANGNVTGQYV YANYGTKKDF
EQLQEWGVDV KGKIVIVRYG AIFRGLKVKF AQENGAIGVL IYSDPGDDHG ITEKNGYKQY
PHGPARQESS VQRGSVQFLG GVGATPGDPT TPGYASKPGV ERKDPHASIG KIPALPISYR
EVKPILEKLN GEGHHAKEDS WIGELEGFKY YTGPSKNYTL NLYNDQIYNI STLYNVYGEI
PGEKRDEVII IGNHRDAWIK GGAGDPNSGS ATIVEVARAL GELKKAGYKF KRTIVFQSYD
GEEYGLLGST EQGEYFADKY KRTVVAYLNL DASTTGSHLK LGASPLLNNI LREAAKELEY
PAKGVGSLYD HFVQETGDNI KNLGSGSDYT VFLEHLGIPS VDLGFGQGKG DPIYHYHSNY
DSFYWISEFA DKGFVYHNLA AKYLSLIVFG LGERELINFK LTDYSKDLIK YYNETIKAVP
KKWFHQEISQ ETWDKYLNHD IADIEFISKA TSRTYLPRRW FPFPELLNVR DCKMKGHLMY
DSKHHKNATL EDLLLKTLED INTLQNSTVS FDLKTFDLQS QHENKDDLNF WQRFKLHFQI
KGHNKLLQYF ERNFLYHKGL YERPWFKHIV FASGRFTGYE GQTFPGLKEA LEDGDFERFV
DWLGIVSKAI RRINAEIAVK