Gene PICST_63364 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_63364 
SymbolVAN1 
ID4840555 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009047 
Strand
Start bp1012996 
End bp1014273 
Gene Length1278 bp 
Protein Length403 aa 
Translation table12 
GC content46% 
IMG OID640391870 
ProductMannan polymerase I complex VAN1 subunit (M-pol I subunit VAN1) (Vanadate resistance protein) 
Protein accessionXP_001386380 
Protein GI150866702 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GAAGATAGCT ACGAGTTTTT CCTGGTGACC AAAGATGGTG TCACACAGCA CAGTCCTGGC 
GATAACGGCC AGAATGTGGA CTTGTCCAAC GTGGCTGGTG GTGAAAATGA TGCTGGAGTC
CGGAATCTGG GCTCAGAGCC TGTAGATCAT TACGCCTCAG AAGTTGAATA TTTTGACTTG
CAGGACTACA GAGGCTCGGC TGATGGCCTT TCTCAAGGCG ATGTAGTTTT GTTTCTTATG
CCTTTGAGAA ATGCCGAAGC AGTATTGCCC ATGGCTTTCT ATAACCTCAT GAACTTGACC
TACGACCATA GTTTGATCGA CATAGCGTTT TTGGTATCCG ACTGCTCGCC AGACGACAGG
ACATTGGAAA CAGTTTTTGA CTACTCTGTT TCCTTGCAGA ACGGAACATT GGTAGACAAG
TTAAGAGCAG AGGACGAAGA AAGAAAGAAG AGTGATGTTC GTGGCTCATC TGACTTATAC
AAAAACTACA TGGACAAGAA CTATATGGAT GGAGTCCGTA GAGCTTATCT GAATCCCGAA
CATCACAAGG GCTATAGAAA GCCGTTCAGA TCGGTGTCGA TCTTCAAGAA GGACTTTGGT
CAAATCATTG GCCAAGGCTT CAGCGATAGA CATGCTGTTA AGGTCCAGGG TATTCGTCGT
AAGTTGATGG GAAGAGCCAG AAACTGGTTG ACTTCAAGTT CTTTGAAGCC GTACCATTCA
TGGGTCTATT GGAGAGATGT GGATATCGAA ACATGCCCCG GTAATGTGAT CGAAGAGTTG
ATGCAACACG ACTACGATGT GATGGTACCT AATGTGTGGA GACCTTTACC CACTTTCTTG
GATGAGCAGG AACAGGCCTA TGACTTGAAC TCGTGGATAG AATCAGACCC AGCATTGGAA
TTGGCCAAAA AATTGGACGA AGACGACGTA ATTGTAGAAG GGTACGCCGA GTATCCAACA
TGGAGAGTAC ATTTAGGATT TATAAGAGAC GCCAATGGAA ATCCTAAAGA AGTAGTCGAT
TTGGATGGTG TTGGTGGGGT GTCGATATTG GCACGAGCCC AGATCTTCAG ACAGGGAGTT
CATTTCCCAG CGTTCACATT CTTGAACCAC GCCGAGACAG AAGCATTTGG TAAGATGGCC
AAAAAAATGG GTTTCAGAGT CGGCGGTTTG CCGCATTACA CTTTATGGCA TATCTACGAA
CCAAGTGAAG ATGATCTTGA GAAAGTCTCC AAGTTGGAAA GAAAGAAGAG GAGACAGAGG
TGGAAGATCT CCAGCTGA
 
Protein sequence
EDSYEFFSVT KDGVTQHIRN SGSEPVDHYA SEVEYFDLQD YRGSADGLSQ GDVVLFLMPL 
RNAEAVLPMA FYNLMNLTYD HSLIDIAFLV SDCSPDDRTL ETVFDYSVSL QNGTLVDKLR
AEDEERKKSD VRGSSDLYKN YMDKNYMDGV RRAYSNPEHH KGYRKPFRSV SIFKKDFGQI
IGQGFSDRHA VKVQGIRRKL MGRARNWLTS SSLKPYHSWV YWRDVDIETC PGNVIEELMQ
HDYDVMVPNV WRPLPTFLDE QEQAYDLNSW IESDPALELA KKLDEDDVIV EGYAEYPTWR
VHLGFIRDAN GNPKEVVDLD GVGGVSILAR AQIFRQGVHF PAFTFLNHAE TEAFGKMAKK
MGFRVGGLPH YTLWHIYEPS EDDLEKVSKL ERKKRRQRWK ISS