Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_83848 |
Symbol | PEX1 |
ID | 4839167 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009045 |
Strand | + |
Start bp | 1634637 |
End bp | 1637830 |
Gene Length | 3194 bp |
Protein Length | 1053 aa |
Translation table | 12 |
GC content | 42% |
IMG OID | 640390482 |
Product | AAA ATPase, peroxisomal biogenesis |
Protein accession | XP_001385011 |
Protein GI | 150865688 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0464] ATPases of the AAA+ class |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.979611 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATTTAGTGAA TAACATCCAC TACTGTTGCA ATATGGAGGG CCATCGTGGA AGTATCAAGT TTAGGCCAAT TAAGTCCAAC TTGGTCAATC TTCCGGCCAA TTTGTCCAAT TTGCTCTACA CTGCTAATAT CCAAATTCAG GATGTCATCA TAGAAATCGT ATCCAATGCT ACAAAGAAGA AGCTGTATGC TGGCTGGACT GGTATGCTGT CTGCTGTTGT CCAGACTGTT GAGATCGACC CTGTTTTTGC AGGAGCTCTT GGCTTGAAGG ACGACGAGAA AATCACCCTC AACTTGAAGA TCGGCAACTT TGAAGCAGGT AACATCAACT TGGAGCCGGT AACGAGTTCT GATTGGGAAC TTGTAGAGCT TCATGCTCAG GCCATAGAAG ACACATTATT GCTGCAAACT AGATGTGTTT CTGTGGGCCA GATTTTGGTC GTGTATCCTA ATCAGACGAC TTCAGCAAAG TTGGTAGTTG TCGATATCGG TTCTAAGGAC CATGTATACG CTAAGATTTC GCCTTATTGT GAGATAGCGA TTGCTCCAAA AGTAAGAGAA AAGAAAGAAA CCGTAAAATC GCTGAAGAAT TTGAGTAGCT CTAAGTCGAA TGGAGCTGCA TCTCGTGAAG ACTACTCTTC TATGCCTGTA GCTCTCAAGA GGGGAATTTC TCTTCCTCAT AACTTGTTTT CTGATACTCA AAATAGGGGC TACGAAGTGT ATGCCAATCT CAGTGAATTT AACCCGACTG TTGTCAGCGA CTATGTTGCT ATCTCGGTTA TTCCTGGTCC TAACGATAAG CTACTGAAAG CCGCTCCTCA GAGTACAGAC GATTCTGATA ACCGAGTAGC TTCTTTGAGA GAGAATAAGC GCGTCATAGC TCGCCTAGTT GACTTTCCAT CCAGTCCCGC CAATAACGTG GGACTTTCAA CCAAGTTGGC CATAGCCTTG AACATCGAGG GCCAGATTGG AAACATCATA GCCATAAAGC CTGCTGTAAA AAATCTTCCC AAAAGACCTA CTACTTTCAT TGTACATCCA TATATTACAC AAACGAAGAA ATCAGGAGAA ATAAGCATCA ACACGACAGA GAAACTGGAG TTGCAAACTA GGGCAAAGCT GTTGTCAGAG TACTTGTATA ACGAAGGAGC TATTTATAGT TCTTCTGCAA CGAACTACAC CAAGATCCCC ATAATTCCCA AGCTATTACC CAATGGTGGG CTCTTAAAGT TCAGAAAAAA TGACGACATC AATGCCTGGA TCAAACCATA TAACATCGAA TCCAAGAAGC CTATCAAAAT TGAACTTGGT GAAGAGTTGT TGCGTGCTGG TAGTTTTGTC CAACAAGACC AGAAGGAAGA AGAGCTAGAA GTAATTGGCT TAGATGGAGT CATAGACGAA ATTATCGAGT CTTTTACCAC ATCTAAGAAT ACAGGAACCT TGGTTTATGG TAACTCTGGC AGTGGAAAAT CACTTGTGCT CAAGTTGGCG AGCAGGAAGA TTGCGGCCGA ACATGGGTTC TATACGAAAT ACGTCTCTTG TGATTCGCTT ATGAATGAGA GCTTCAATCT GTTATCCAAA AACCACATTT TCAAGTGGTT GCAACAATGC TCGTGGAATA AGCCTTCGTT GTTAATCTTG GACAATGTAG ACAAGATTCT CAGTGTTGAA AGGGAACACT TGGACGCTTC CAAGTCTAAT CAACTAACCG AATACTTGAT TTCCAACTTA GAAAAGATTC ACAATCAGCA TAACAGTAAT TTGTCGATAT TACTTTCTGC TTCTTCAAAG GAAGCCATCA ATAAATTATT GATGCAATGT CATCTTATAG AAAACTTCCA CCACTTGAGC CCCCCCGACA AGGCACTCAG ATTAGATATC CTAGACAACT ACATTGTCAA CAAGCTTGGA TGTAAGATCG ACTTTGATCT AATGGACTTG GTAACCGAAA CCGAAGGCTA CTTGCCCAAC GATCTCAAGA TCTTGAGTGA CAGAATCTAC CATGAAGTAC TTTTTTCTTC TCAAGATCCT TCGGCAGAGT TGACGGTGAC TAAACAGCAC ATTGAAAAGT CTATCCAAGG ATATACTCCT TCCAATTTGC GTGGTGTCAA GTTACAGAAA TCGACTATTA GCTGGTCTGA TATTGGAGGC TTGAAAGAGG CTAAGAACAT CTTGTTGGAG ACTCTTGAAT GGCCAACAAA GTATGCGCCA ATCTTTGCTA ACTGTCCGTT ACGTTTACGT TCTGGTATCT TACTCTATGG CTATCCTGGT TGCGGTAAGA CCTTACTTGC TAGTGCCATT GCTGGTCAGT GTGGGTTGAA TTTCATTTCC ATTAAAGGTC CAGAAATCTT GAATAAGTAT ATTGGTGCTT CAGAACAGTC TGTTCGTGAG CTCTTTGAAC GAGCACAGGC TGCAAAGCCA TGTATTTTGT TCTTTGACGA ATTCGACTCT ATTGCACCCA AGAGAGGCCA CGATTCTACT GGTGTTACCG ATAGAGTTGT TAACCAGATG TTGACTCAGA TGGATGGTGC TGAAGGTCTC GATGGTGTTT ACGTCTTGGC CGCGACTTCC AGACCGGACT TGATCGATTC TGCGTTGTTA AGACCTGGTA GATTGGACAA AAGTGTCATT TGTGACATGC CGGACTACGA CGACAGATTG GATATCTTGA AGAGTATTAC TGATAAAATG GACTTGGCTG ACGATGTCAA TTTGGAAGAA ATCGCTGAGA AGACTTCAGG CTTTTCAGGT GCTGATATGC AAGGTTTGGG GTACAACGCC TACTTGAAGG GTGTCCACGT TAAGTTGGCC AAGTTGGAAC AGGAAAACAG TGAACCTATC AAGTCTTCTG ACAATCAGGA CACAATAGAG TTTTTCCAAG TAAACTCTGA GAAGTTAAAG AATGCTAAGT TGAGACCAGC TGATCGAATC AAGCTCTTGA ACCAGATACA GCAATTGTTC ACAAAAGAAG AACAAAATGA AGCTGCTTCT TCAAAGAAGG CTCAGGACGA TTCAAAAGTC TATATAACTC ATGAGAATTT CAGAGAATCG TTAGTAGAAA CAAAACCCTC GATTTCGTAT TCTGAAAAGA GAAAGCTTGA ACGAATCTAT AGTCAGTTTG TCTCTGCTCG AGATGGCAAT ATGCCCGATG GCAGCGCCAG TAACGAAATC GGAGGACGAA CCACTTTAAT GTGA
|
Protein sequence | MEGHRGSIKF RPIKSNLVNL PANLSNLLYT ANIQIQDVII EIVSNATKKK SYAGWTGMSS AVVQTVEIDP VFAGALGLKD DEKITLNLKI GNFEAGNINL EPVTSSDWEL VELHAQAIED TLLSQTRCVS VGQILVVYPN QTTSAKLVVV DIGSKDHVYA KISPYCEIAI APKVREKKET VKSSKNLSSS KSNGAASRED YSSMPVALKR GISLPHNLFS DTQNRGYEVY ANLSEFNPTV VSDYVAISVI PGPNDKLSKA APQSTDDSDN RVASLRENKR VIARLVDFPS SPANNVGLST KLAIALNIEG QIGNIIAIKP AVKNLPKRPT TFIVHPYITQ TKKSGEISIN TTEKSELQTR AKSLSEYLYN EGAIYSSSAT NYTKIPIIPK LLPNGGLLKF RKNDDINAWI KPYNIESKKP IKIELGEELL RAGSFVQQDQ KEEELEVIGL DGVIDEIIES FTTSKNTGTL VYGNSGSGKS LVLKLASRKI AAEHGFYTKY VSCDSLMNES FNSLSKNHIF KWLQQCSWNK PSLLILDNVD KILSVEREHL DASKSNQLTE YLISNLEKIH NQHNSNLSIL LSASSKEAIN KLLMQCHLIE NFHHLSPPDK ALRLDILDNY IVNKLGCKID FDLMDLVTET EGYLPNDLKI LSDRIYHEVL FSSQDPSAEL TVTKQHIEKS IQGYTPSNLR GVKLQKSTIS WSDIGGLKEA KNILLETLEW PTKYAPIFAN CPLRLRSGIL LYGYPGCGKT LLASAIAGQC GLNFISIKGP EILNKYIGAS EQSVRELFER AQAAKPCILF FDEFDSIAPK RGHDSTGVTD RVVNQMLTQM DGAEGLDGVY VLAATSRPDL IDSALLRPGR LDKSVICDMP DYDDRLDILK SITDKMDLAD DVNLEEIAEK TSGFSGADMQ GLGYNAYLKG VHVKLAKLEQ ENSEPIKSSD NQDTIEFFQV NSEKLKNAKL RPADRIKLLN QIQQLFTKEE QNEAASSKKA QDDSKVYITH ENFRESLVET KPSISYSEKR KLERIYSQFV SARDGNMPDG SASNEIGGRT TLM
|
| |