Gene PICST_80500 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_80500 
SymbolVPS29 
ID4850972 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009068 
Strand
Start bp597943 
End bp599146 
Gene Length1204 bp 
Protein Length249 aa 
Translation table 
GC content39% 
IMG OID640392680 
Productprotein involved in endosome to golgi protein transport 
Protein accessionXP_001387345 
Protein GI126273933 
COG category[R] General function prediction only 
COG ID[COG0622] Predicted phosphoesterase 
TIGRFAM ID[TIGR00040] phosphoesterase, MJ0936 family 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0205498 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
CTTTATCTCA TTCTTCATGC TTACATTAGC CATTGGTGAT CTCTACATTC CTGAGCGAGC 
TCTCGATTTG CCGGCCAAAT TCCGCAAGTT GTTGTGCCCC AATCCTCAAA GTATCCCTAC
CAATAGTAAA ATATCTGAGG TGATATGTCT TGGGAACATC ACCAATTCTG TTGATACGTT
GAAGTTTTTG CATGATTTAT CACCTTCGTT GCATTTAGTG AAAGGTGAGT TTGACGACTT
GCCAATATTG TCACAGCAGT TGTCGCTGGT GAGCAAGAAA GATGAGAATG TTGGTATATA
TGGGGTAATA ACTCATGATA ACTTGAGAAT CGGATTCACC AACGGCTACC AGGTAGTACC
CAAGAACGAC CCGTTGGCAT TGCTGACGTT GGCGAGAGAA TTGGATGTAG ATGTATTGAT
TTGGGGAGGA ACTCACAAAG TAGAAGCATA TACCTTAGAT GGCAAGTTCT TCGTGAATCC
TGGAAGCGGA ACGGGTGCTT TCAGTTTTGA TTGGCCCGAA TGGTACGAAG AAGAAGAGAA
CGCAAAGGAA GAAGAAATAA AGGAAAATGA AGACGAGGCA AAGCCAGAAG TAAACGAGGA
AGAAAAAGAA GAACCTACTG TGGAAACGAA AGAAGAAAAG AAAGATGAAA CTGAGGATGT
AGAAAAGAAA GACGCAGAAG TTACAGAAGT TTCAGAAAAC AAAGCAAATC TATTCGATGA
AGGACAAACG GAAACTGATG CACAAGATAC ACATGGTGAT CCTTTACTGG ATACCAGGGC
CAATGTAATA GACGAACACA TCTTAAGTGA AGTTACCGAG CTCAATGCCA TAGTTCCGTC
TTTCTGTCTT CTTGATACTT TTGGATCTAC TTGTACTTTG TATATCTACA CACATTTGAA
CGGCGAGGTT AAAGTGGACA AAGTGTCCTA CACTAAGGAA TAAGATGCTT GCACAACGAA
TTCCGGGTAC TAGTGTTGTA AATGCCTAAA ACTATTTACG CAACTATTAT AACATTGTGG
GCATACACTA TCAATACAAT ACCTGAGATA GAGTTAATTT TGAGTATTCA CTCAGAAAGA
AATTTACAGG TAGTTGCTTC TAAGATATCA AGTTCTATGA ATTCTTCGTT CCTTCATGAG
TAGTCATGGT GATAATACTA TAATATATAC ATTACAAGTA AAAATACAGT GGCAGCAATC
ATCT
 
Protein sequence
MLTLAIGDLY IPERALDLPA KFRKLLCPNP QSIPTNSKIS EVICLGNITN SVDTLKFLHD 
LSPSLHLVKG EFDDLPILSQ QLSLVSKKDE NVGIYGVITH DNLRIGFTNG YQVVPKNDPL
ALLTLARELD VDVLIWGGTH KVEAYTLDGK FFVNPGSGTG AFSFDWPEWY EEEENAKEEE
IKENEDEAKP EKLQKANVID EHILSEVTEL NAIVPSFCLL DTFGSTCTLY IYTHLNGEVK
VDKVSYTKE