Gene PICST_83848 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_83848 
SymbolPEX1 
ID4839167 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009045 
Strand
Start bp1634637 
End bp1637830 
Gene Length3194 bp 
Protein Length1053 aa 
Translation table12 
GC content42% 
IMG OID640390482 
ProductAAA ATPase, peroxisomal biogenesis 
Protein accessionXP_001385011 
Protein GI150865688 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0464] ATPases of the AAA+ class 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.979611 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATTTAGTGAA TAACATCCAC TACTGTTGCA ATATGGAGGG CCATCGTGGA AGTATCAAGT 
TTAGGCCAAT TAAGTCCAAC TTGGTCAATC TTCCGGCCAA TTTGTCCAAT TTGCTCTACA
CTGCTAATAT CCAAATTCAG GATGTCATCA TAGAAATCGT ATCCAATGCT ACAAAGAAGA
AGCTGTATGC TGGCTGGACT GGTATGCTGT CTGCTGTTGT CCAGACTGTT GAGATCGACC
CTGTTTTTGC AGGAGCTCTT GGCTTGAAGG ACGACGAGAA AATCACCCTC AACTTGAAGA
TCGGCAACTT TGAAGCAGGT AACATCAACT TGGAGCCGGT AACGAGTTCT GATTGGGAAC
TTGTAGAGCT TCATGCTCAG GCCATAGAAG ACACATTATT GCTGCAAACT AGATGTGTTT
CTGTGGGCCA GATTTTGGTC GTGTATCCTA ATCAGACGAC TTCAGCAAAG TTGGTAGTTG
TCGATATCGG TTCTAAGGAC CATGTATACG CTAAGATTTC GCCTTATTGT GAGATAGCGA
TTGCTCCAAA AGTAAGAGAA AAGAAAGAAA CCGTAAAATC GCTGAAGAAT TTGAGTAGCT
CTAAGTCGAA TGGAGCTGCA TCTCGTGAAG ACTACTCTTC TATGCCTGTA GCTCTCAAGA
GGGGAATTTC TCTTCCTCAT AACTTGTTTT CTGATACTCA AAATAGGGGC TACGAAGTGT
ATGCCAATCT CAGTGAATTT AACCCGACTG TTGTCAGCGA CTATGTTGCT ATCTCGGTTA
TTCCTGGTCC TAACGATAAG CTACTGAAAG CCGCTCCTCA GAGTACAGAC GATTCTGATA
ACCGAGTAGC TTCTTTGAGA GAGAATAAGC GCGTCATAGC TCGCCTAGTT GACTTTCCAT
CCAGTCCCGC CAATAACGTG GGACTTTCAA CCAAGTTGGC CATAGCCTTG AACATCGAGG
GCCAGATTGG AAACATCATA GCCATAAAGC CTGCTGTAAA AAATCTTCCC AAAAGACCTA
CTACTTTCAT TGTACATCCA TATATTACAC AAACGAAGAA ATCAGGAGAA ATAAGCATCA
ACACGACAGA GAAACTGGAG TTGCAAACTA GGGCAAAGCT GTTGTCAGAG TACTTGTATA
ACGAAGGAGC TATTTATAGT TCTTCTGCAA CGAACTACAC CAAGATCCCC ATAATTCCCA
AGCTATTACC CAATGGTGGG CTCTTAAAGT TCAGAAAAAA TGACGACATC AATGCCTGGA
TCAAACCATA TAACATCGAA TCCAAGAAGC CTATCAAAAT TGAACTTGGT GAAGAGTTGT
TGCGTGCTGG TAGTTTTGTC CAACAAGACC AGAAGGAAGA AGAGCTAGAA GTAATTGGCT
TAGATGGAGT CATAGACGAA ATTATCGAGT CTTTTACCAC ATCTAAGAAT ACAGGAACCT
TGGTTTATGG TAACTCTGGC AGTGGAAAAT CACTTGTGCT CAAGTTGGCG AGCAGGAAGA
TTGCGGCCGA ACATGGGTTC TATACGAAAT ACGTCTCTTG TGATTCGCTT ATGAATGAGA
GCTTCAATCT GTTATCCAAA AACCACATTT TCAAGTGGTT GCAACAATGC TCGTGGAATA
AGCCTTCGTT GTTAATCTTG GACAATGTAG ACAAGATTCT CAGTGTTGAA AGGGAACACT
TGGACGCTTC CAAGTCTAAT CAACTAACCG AATACTTGAT TTCCAACTTA GAAAAGATTC
ACAATCAGCA TAACAGTAAT TTGTCGATAT TACTTTCTGC TTCTTCAAAG GAAGCCATCA
ATAAATTATT GATGCAATGT CATCTTATAG AAAACTTCCA CCACTTGAGC CCCCCCGACA
AGGCACTCAG ATTAGATATC CTAGACAACT ACATTGTCAA CAAGCTTGGA TGTAAGATCG
ACTTTGATCT AATGGACTTG GTAACCGAAA CCGAAGGCTA CTTGCCCAAC GATCTCAAGA
TCTTGAGTGA CAGAATCTAC CATGAAGTAC TTTTTTCTTC TCAAGATCCT TCGGCAGAGT
TGACGGTGAC TAAACAGCAC ATTGAAAAGT CTATCCAAGG ATATACTCCT TCCAATTTGC
GTGGTGTCAA GTTACAGAAA TCGACTATTA GCTGGTCTGA TATTGGAGGC TTGAAAGAGG
CTAAGAACAT CTTGTTGGAG ACTCTTGAAT GGCCAACAAA GTATGCGCCA ATCTTTGCTA
ACTGTCCGTT ACGTTTACGT TCTGGTATCT TACTCTATGG CTATCCTGGT TGCGGTAAGA
CCTTACTTGC TAGTGCCATT GCTGGTCAGT GTGGGTTGAA TTTCATTTCC ATTAAAGGTC
CAGAAATCTT GAATAAGTAT ATTGGTGCTT CAGAACAGTC TGTTCGTGAG CTCTTTGAAC
GAGCACAGGC TGCAAAGCCA TGTATTTTGT TCTTTGACGA ATTCGACTCT ATTGCACCCA
AGAGAGGCCA CGATTCTACT GGTGTTACCG ATAGAGTTGT TAACCAGATG TTGACTCAGA
TGGATGGTGC TGAAGGTCTC GATGGTGTTT ACGTCTTGGC CGCGACTTCC AGACCGGACT
TGATCGATTC TGCGTTGTTA AGACCTGGTA GATTGGACAA AAGTGTCATT TGTGACATGC
CGGACTACGA CGACAGATTG GATATCTTGA AGAGTATTAC TGATAAAATG GACTTGGCTG
ACGATGTCAA TTTGGAAGAA ATCGCTGAGA AGACTTCAGG CTTTTCAGGT GCTGATATGC
AAGGTTTGGG GTACAACGCC TACTTGAAGG GTGTCCACGT TAAGTTGGCC AAGTTGGAAC
AGGAAAACAG TGAACCTATC AAGTCTTCTG ACAATCAGGA CACAATAGAG TTTTTCCAAG
TAAACTCTGA GAAGTTAAAG AATGCTAAGT TGAGACCAGC TGATCGAATC AAGCTCTTGA
ACCAGATACA GCAATTGTTC ACAAAAGAAG AACAAAATGA AGCTGCTTCT TCAAAGAAGG
CTCAGGACGA TTCAAAAGTC TATATAACTC ATGAGAATTT CAGAGAATCG TTAGTAGAAA
CAAAACCCTC GATTTCGTAT TCTGAAAAGA GAAAGCTTGA ACGAATCTAT AGTCAGTTTG
TCTCTGCTCG AGATGGCAAT ATGCCCGATG GCAGCGCCAG TAACGAAATC GGAGGACGAA
CCACTTTAAT GTGA
 
Protein sequence
MEGHRGSIKF RPIKSNLVNL PANLSNLLYT ANIQIQDVII EIVSNATKKK SYAGWTGMSS 
AVVQTVEIDP VFAGALGLKD DEKITLNLKI GNFEAGNINL EPVTSSDWEL VELHAQAIED
TLLSQTRCVS VGQILVVYPN QTTSAKLVVV DIGSKDHVYA KISPYCEIAI APKVREKKET
VKSSKNLSSS KSNGAASRED YSSMPVALKR GISLPHNLFS DTQNRGYEVY ANLSEFNPTV
VSDYVAISVI PGPNDKLSKA APQSTDDSDN RVASLRENKR VIARLVDFPS SPANNVGLST
KLAIALNIEG QIGNIIAIKP AVKNLPKRPT TFIVHPYITQ TKKSGEISIN TTEKSELQTR
AKSLSEYLYN EGAIYSSSAT NYTKIPIIPK LLPNGGLLKF RKNDDINAWI KPYNIESKKP
IKIELGEELL RAGSFVQQDQ KEEELEVIGL DGVIDEIIES FTTSKNTGTL VYGNSGSGKS
LVLKLASRKI AAEHGFYTKY VSCDSLMNES FNSLSKNHIF KWLQQCSWNK PSLLILDNVD
KILSVEREHL DASKSNQLTE YLISNLEKIH NQHNSNLSIL LSASSKEAIN KLLMQCHLIE
NFHHLSPPDK ALRLDILDNY IVNKLGCKID FDLMDLVTET EGYLPNDLKI LSDRIYHEVL
FSSQDPSAEL TVTKQHIEKS IQGYTPSNLR GVKLQKSTIS WSDIGGLKEA KNILLETLEW
PTKYAPIFAN CPLRLRSGIL LYGYPGCGKT LLASAIAGQC GLNFISIKGP EILNKYIGAS
EQSVRELFER AQAAKPCILF FDEFDSIAPK RGHDSTGVTD RVVNQMLTQM DGAEGLDGVY
VLAATSRPDL IDSALLRPGR LDKSVICDMP DYDDRLDILK SITDKMDLAD DVNLEEIAEK
TSGFSGADMQ GLGYNAYLKG VHVKLAKLEQ ENSEPIKSSD NQDTIEFFQV NSEKLKNAKL
RPADRIKLLN QIQQLFTKEE QNEAASSKKA QDDSKVYITH ENFRESLVET KPSISYSEKR
KLERIYSQFV SARDGNMPDG SASNEIGGRT TLM