Gene PICST_32083 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_32083 
SymbolVCP1 
ID4839105 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009045 
Strand
Start bp640696 
End bp643194 
Gene Length2499 bp 
Protein Length832 aa 
Translation table12 
GC content40% 
IMG OID640390420 
ProductAAA ATPase 
Protein accessionXP_001384792 
Protein GI150865535 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0464] ATPases of the AAA+ class 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.190339 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTAAGA AACGGCCTGT GACGAATAGC CTTGACGTGA AGATATACGA GTTAATTTGT 
GAATTGCTCG ATGAGACTAC AGCTGAAAAT TTGAAGAGCT TACCTCAATC TGATGACAAC
ATCTCATACA TTGCACTCGC CAAAGATATT ACTATTTCCC AGGCACTTGA TTATGTTCAG
TCTAAGGACT ATAAACTAAA GAGGATAAAG AAGAATTTGT TGGAAAAGTC AATAATAGCG
TCTCTTCAAA CACTACGAAC TGAAGAGGAA GAGACTATGG GGACAATTGA TAGTTCAACA
AAGACAGATG ATAGTGATGC CGAACTTTAT GGTGAAGGTA GATTGATGGA AGTAAAAGAT
TCTAATTCAT TGAACAAGTC TGTTGTAACT GCTTGGAAAT TGAATAATTC AATTGCAAAA
AGTGAGTTAG CCAACGAAGT GCTTGATCCA AATGCAGAAG AGCTGGTAGA AGATGTTGCA
AAACAATCGA AAAAGAGAAC AAAAGATTCA TCCAAACAAG TTACTAAGAA ACACAGGAGC
AAAATAGATC ATACTCCTCC AAGTTTAACA TTGTCTTCCT TAGGTGGATT GGACAGTACG
ACTACTCAAC TAATGGAATT GATTGGGTTG CCTATCCTTC ATCCGGAGAT CTACACGTCT
ACTGGTGTAG AACCTCCTCG TGGTGTCTTG CTCTACGGTC CTCCTGGATG CGGTAAGACC
ACTATTGCCA ATGCACTTGC TGGTGAATTG CAGGTCCCAT TTATAAACAT TTCAGCTCCA
TCTATAGTGT CTGGGATGTC TGGCGAATCA GAAAAAAAAT TAAGGGAAAT ATTTGATGAA
GCTAAAACGT TGGCGCCTTG CATCATCTTT ATGGATGAAA TTGATGCAAT CACACCTAAA
AGAGATGGAG GTGCTCAGAG GGAAATGGAA AGGCGAATTG TTGCTCAATT ATTGACTTTG
ATGGATGAAT TATCGTTAGG CAAGACCGAA AAACCAGTGA TTGTAGTAGG AGCTACTAAT
CGACCCGATT CGCTTGATGC TGCCTTGAGA AGAGCCGGCA GATTTGATAG AGAAATATGT
CTCAATGTCC CTAACGAGGA CGAAAGACTC TCGATATTGA AAAAGATGAC CAGCAATATT
AAGTTAGAAA ATGGAGACTT CAATTTCCGA GAGTTAGCAA AAATGACTCC AGGATATGTA
GGAGCTGATT TGAAGAGTTT GGTGACTGCT GCTGGAATTT CCGCAATAAA GAGAATATTC
GAAAGTCTCA GTGAGCAGCA AGAGGAATTA CTAGTTGCAG ATTCCAATGC AAACAGTTTT
GATTCAATGG ATGTTGATAG TGCAACTTTC ATTCCTGACA ACCAGTCGCT CGTAAAATTT
GAAGGAAAGA CGGATACCGA AAAATTATCT ACGATTCAAA AGTTTTTAAT TAAGCATCCC
AACCCATTGA GCGACGAACA ATTAAACCCA TTAGCAATTT CTTATTCTGA TTTCTTGGAA
GCACTTCCAA CGGTACAACC AACAGCCAAG AGGGAAGGAT TTGCTACTGT ACCAGATGTG
ACATGGAAGA GTGTCGGAGC CTTACACAAG GTGAGAATGG AGCTACACAT GTGTGTGGTT
CAGCCTATTA AAAAACCAGA ACTTTACTTA AAAGTTGGTA TCACCGCCCC AGCCGGTGTC
TTGATGTGGG GACCGCCAGG TTGTGGGAAG ACATTATTAG CAAAAGCTGT CGCTAATGAA
TCTCGAGCTA ATTTCATATC TATCAAGGGT CCAGAATTAC TCAACAAGTA CGTAGGTGAA
TCCGAGAGAG CAATTAGACA AGTCTTCCTG AGAGCAAGAG CTTCCATTCC GTGTATTATA
TTTTTTGATG AATTGGATGC TTTAGTTCCA AGAAGGGATG CATCTTTATC TGAATCTAGT
TCACGAGTAG TGAACACATT ACTTACCGAA TTGGACGGGT TGAATGATCG AAAGGGTGTG
TTTGTAATTG GAGCCACTAA CAGACCAGAC ATGATAGATC CTGCCATGTT GCGTCCCGGC
AGATTGGACA AGACTTTGTA TATCGAGCTC CCTTCTGCTG AAGAAAGACT TGAAATATTG
AAAACTCTTA TTAATGCTAA CAAGACCCCC GTAAGTGTTG ACGTGGACTT GAATTCAATA
GCAAATGATA ACAGGTGCAG AAACTTCTCC GGTGCAGATT TGTCTTCTCT AGTAAGAGAA
GCTGGAGTCT TAGCTCTTAA AAAGAAATTC TTTCAAAATC AAAAAATCGA CGATTTGGAT
GCATCTGGCT ACTACGAGAA TGAAAATGTT GATGACCAAG TGGAAGTGAC TCAACAAGAC
TTTAATAGAG CCCTTTCCAA TGTTCATCCC AGTGTGAGCG ACAAAGATAG AATGAAATAC
GAAAAGTTAA ACAAGAGAAT GGGGTGGAGT GTTATCGAGG ATTCTGAAGT TATAGAGCCA
ATAGAGCCTA CAGCGAACAC TCCAAATGCA AGTGTATAA
 
Protein sequence
MSKKRPVTNS LDVKIYELIC ELLDETTAEN LKSLPQSDDN ISYIALAKDI TISQALDYVQ 
SKDYKLKRIK KNLLEKSIIA SLQTLRTEEE ETMGTIDSST KTDDSDAELY GEGRLMEVKD
SNSLNKSVVT AWKLNNSIAK SELANEVLDP NAEESVEDVA KQSKKRTKDS SKQVTKKHRS
KIDHTPPSLT LSSLGGLDST TTQLMELIGL PILHPEIYTS TGVEPPRGVL LYGPPGCGKT
TIANALAGEL QVPFINISAP SIVSGMSGES EKKLREIFDE AKTLAPCIIF MDEIDAITPK
RDGGAQREME RRIVAQLLTL MDELSLGKTE KPVIVVGATN RPDSLDAALR RAGRFDREIC
LNVPNEDERL SILKKMTSNI KLENGDFNFR ELAKMTPGYV GADLKSLVTA AGISAIKRIF
ESLSEQQEEL LVADSNANSF DSMDVDSATF IPDNQSLVKF EGKTDTEKLS TIQKFLIKHP
NPLSDEQLNP LAISYSDFLE ALPTVQPTAK REGFATVPDV TWKSVGALHK VRMELHMCVV
QPIKKPELYL KVGITAPAGV LMWGPPGCGK TLLAKAVANE SRANFISIKG PELLNKYVGE
SERAIRQVFS RARASIPCII FFDELDALVP RRDASLSESS SRVVNTLLTE LDGLNDRKGV
FVIGATNRPD MIDPAMLRPG RLDKTLYIEL PSAEERLEIL KTLINANKTP VSVDVDLNSI
ANDNRCRNFS GADLSSLVRE AGVLALKKKF FQNQKIDDLD ASGYYENENV DDQVEVTQQD
FNRALSNVHP SVSDKDRMKY EKLNKRMGWS VIEDSEVIEP IEPTANTPNA SV