Gene PICST_52299 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_52299 
SymbolVPH1 
ID4850909 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009068 
Strand
Start bp409298 
End bp411673 
Gene Length2376 bp 
Protein Length791 aa 
Translation table 
GC content43% 
IMG OID640392617 
ProductV0 domain of vacuolar H+ATPase 
Protein accessionXP_001387315 
Protein GI126273870 
COG category[C] Energy production and conversion 
COG ID[COG1269] Archaeal/vacuolar-type H+-ATPase subunit I 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.303606 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGCTTG TCCAGCTATA TGTGCCTACA GAAGTGTCCC GTGACATTAT CTACAAGATT 
GGGCAATTGA ACTTAATTCA GTTTAGAGAT TTGAATCTGA AAGTCAATGA GTTCCAGCGT
TCGTTTGTCA AGGAATTGAG ACGGTTAGAC AATGTAGAAC GTCAATTCAA TCGTTTCAAG
AAGGAATTGG ATCAAAGAGA CATTCCTGTC AAGACTTTTC CCTATGAATC GCTGCCCATT
GTGCCTCAGT CAGATATTGA TGAGCATGTA GAGAATGCTC AGATCTTGGA GGATCGACTT
TTACAATTGA TTGACTCGAC CAATTCGCTC TATGAAAAGC AGAAGGAATT GAAACAGTTC
AAAGCAACAA TTCAGGGTGT AGACAATTTC TTTGTAGTCA ATGCTGGGCC TCAGCTGGAG
ACCAGTGAAG AATCAGCGTT ACTTTCGCAA TTGGAATCGC AGGCGCAAGA AGCTTCACAT
GGCTCGTTTA TCAGTGGAGT TATCTCTCGT GAAAAGGTTG GAACTTTACA GCAGATCTTG
TGGAGAATTT TGAGAGGTAA CTTGTACTAT CATAGTGAAG AGTTGGCTGA GCCAGTATAC
GAGGTTCATT CCAACGAATA TGTGAACAAG AACTCTTTTA TCATCTTTTC GCATGGTGCT
ATCATTTACG ACCGTATAAA GAAGATCTGC GAGTCGTTGG ATGCGGACAT CTACGATGTG
GATGCCACTG TATCGTTGCG TTCCGACCAG TTGGCTGAAA CAAACATGAA GTTGGCTGAC
TTATCTGCGG TTCTCACTCA ATCTGAAAAC GCATTATCTT CTGAATTGAT TGCTATATCA
AGAGACTTGG CTAAATGGTG GGAAGTTATT GCTCGCGAAA AGGCCGTTTA TCAATCTATG
AATCTTTGCG ATTACGACGA CTCCCGAAAG ACGTTAGTAG CTGAGGGCTG GATTCCTACG
GATGAGATCT CGAACTTGAC GACTACAATC AAGGGTTCCG ACGATTCACA ATCGATCCCT
ACCATCATCA ACGTTCTTGA AACAACGAGA ACGCCTCCTA CTTTCCATCG AACCAACAAG
TTCACTGATG CCTTCCAGAA CATATGTGAC GCCTACGGTA TAGCCACGTA CCGTGAAGTG
AATCCTGGTT TACCTACCGT CATCACATTT CCATTTATGT TTGCTATCAT GTTTGGAGAT
TTGGGCCACG GTTTCATCTT GACATTAGTA GCTTTAGCAT TGGTGTTGAA CGAAAAGAAA
TTAGGAGCTT CCAAGCATGA CGAGATCTTT GACATGGCCT TCAGCGGTCG TTACATCTTG
TTGTTGATGG GAATCTTCTC GATGTATACC GGTTTGTTAT ACAACGATAT CTTTTCTAGA
TCTATGACGC TTTTCAGCTC CGGCTGGGAG TGGCCCGAGA AGTTTGCTAT TGGTGAAACT
GTGTTGGCCA AACAGGTTGG AACGTACATC TTTGGTTTGG ATCCAGCATG GCATGGATCT
GAGAATGCGT TGTTGTTTTC TAATTCCTAC AAGATGAAGT TGTCGATATT AATGGGTTAC
ACCCACATGT CGTACTCGTA CATCTTCTCA TTGGTGAACT ACATCCATTT CAAGAGTGTC
ATAGACATTG TTGGAAACTT CATTCCAGGC TTGTTATTCA TGCAAGGTAT ATTTGGGTAC
TTGTCGTTGT GCGTAGTATA CAAATGGACT GTAAACTGGT ATGCCATCGA TAAACAGCCC
CCAGGGTTGT TGAACATGTT GATTTCCATG TTCTTGTCAC CTGGAAATGT TGCTGAGCCA
TTGTACGAAG GTCAAGCCAG TATTCAAGTT TTTCTCTTGT TGGTAGCTTT GATTTGTGTT
CCATGGTTAT TGTTACTTAA GCCCTTGTAC TTGAAGAGAC AATTGGACAA AGCAGCTGCC
GAGTACCAGG AATTGCCTAC TGACGAAGAT GAATTGGAGG AAGGTGATGC TGCAGCTCAC
GATGACGATG AACCTCATGA AGAACACAAC TTTGGCGATA TCATGATTCA TCAGGTCATT
CACACAATCG AGTTCTGTTT GAATTGTGTA TCTCATACGG CATCTTATTT GAGATTGTGG
GCATTGTCTT TGGCCCATGC TCAATTGTCT ACTGTCTTGT GGACTATGAC CATTGGAGGC
TCTTTTGGTG CTACTGGTGC TCTAGGAGTA TTCATGACTG TCTTCTTATT TGCAATGTGG
TTCTCATTGA CGGTCTGTAT CTTGGTAGTA ATGGAAGGAA CTTCGGCCAT GTTGCACTCT
TTGAGGTTGC ACTGGGTCGA GTCTATGTCC AAGTTCTTCC AGGGTGAAGG TACATTATAC
GAGCCATTTG GCTTTAAGAA CTTGATTGAC CTCTAG
 
Protein sequence
MSLVQLYVPT EVSRDIIYKI GQLNLIQFRD LNLKVNEFQR SFVKELRRLD NVERQFNRFK 
KELDQRDIPV KTFPYESLPI VPQSDIDEHV ENAQILEDRL LQLIDSTNSL YEKQKELKQF
KATIQGVDNF FVVNAGPQLE TSEESALLSQ LESQAQEASH GSFISGVISR EKVGTLQQIL
WRILRGNLYY HSEELAEPVY EVHSNEYVNK NSFIIFSHGA IIYDRIKKIC ESLDADIYDV
DATVSLRSDQ LAETNMKLAD LSAVLTQSEN ALSSELIAIS RDLAKWWEVI AREKAVYQSM
NLCDYDDSRK TLVAEGWIPT DEISNLTTTI KGSDDSQSIP TIINVLETTR TPPTFHRTNK
FTDAFQNICD AYGIATYREV NPGLPTVITF PFMFAIMFGD LGHGFILTLV ALALVLNEKK
LGASKHDEIF DMAFSGRYIL LLMGIFSMYT GLLYNDIFSR SMTLFSSGWE WPEKFAIGET
VLAKQVGTYI FGLDPAWHGS ENALLFSNSY KMKLSILMGY THMSYSYIFS LVNYIHFKSV
IDIVGNFIPG LLFMQGIFGY LSLCVVYKWT VNWYAIDKQP PGLLNMLISM FLSPGNVAEP
LYEGQASIQV FLLLVALICV PWLLLLKPLY LKRQLDKAAA EYQELPTDED ELEEGDAAAH
DDDEPHEEHN FGDIMIHQVI HTIEFCLNCV SHTASYLRLW ALSLAHAQLS TVLWTMTIGG
SFGATGALGV FMTVFLFAMW FSLTVCILVV MEGTSAMLHS LRLHWVESMS KFFQGEGTLY
EPFGFKNLID L