Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_45980 |
Symbol | |
ID | 4838390 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009044 |
Strand | - |
Start bp | 1592320 |
End bp | 1595862 |
Gene Length | 3543 bp |
Protein Length | 1180 aa |
Translation table | 12 |
GC content | 41% |
IMG OID | 640389705 |
Product | predicted protein |
Protein accession | XP_001384609 |
Protein GI | 150865406 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0466] ATP-dependent Lon protease, bacterial type |
TIGRFAM ID | [TIGR00763] ATP-dependent protease La |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.00372447 |
Plasmid hitchhiking | No |
Plasmid clonability | unclonable |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.0355887 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCGCGCT ACAAAAACCC CAAGTCTCCG GCTAAGTTGA AACAGCAGAT CGTACTTCCC ACCTGTAAAC TCGACTCCAA CTTGGTTTTG TTGCCAGGGA TTATATACAA CGTTACGTTT TCTCGTTTCA AAGCTGCTGC GTTGTTGTAC CGTTACAAGG ATCTTGTATC GCAAGTCTCA ATAATAAACA ACTTGCTCAA CGAATATGAG TTCAATCCGT CTAAAGATTC GGACTCAGTT GAAGAAGACG ATATGGTCAC AAGTCCTATC ACTATCTCAA AGGTCGCTGT GGAAGGTATT GAACAGTTTT TCAAATACGA AGCAGCATTC AAGAACAGCC AGGGCTTAGT TTCGGAAAAG GATATTGCTG AAGTTGCACC CTCCAACGAG TTTGACTGGT TGACCCTTGC TATCAAGCCC AATTTGGAAA AGATCAAAGA GCCTAGTAAT GCTCAAATTG ATCCAACTGA ACACAACAGT GTGGTCACTA TAGCCAGAAT CGTAGGAATC GTAGACGACA CCACCAACAT CAAGTTGACG TTGCAGGCTA TAACCAGAGG ATTGAAGATT GCACCCAAGA AGAAAACAAG GCCAAACGAA CAACTCTTAG AAGTTGACTG GAGCTCAGAC ATCCCCGAGT TGAGGCGCCA TTTTAAGTCT TTGAAGGATA GCAGTCTCGA TTTGTTCAAG GTAATCGACA AATTTATTGT AGACTATCGT CAAGCTTTGA GCATCAACTC TGCCAATGGA AACAAGTCAA ACCTCCAGAT AACTAAACCG GGATCAAGAT ATAAAGGAGC CAATGGATCC AGCCAGAAAC CTGGTGATTT GCTTACGTTG AATCCTTTGG CCAATGCTTT GTATCTCCAG TTGGCTGGTT CGAAGGATTT CTCCAAGGCA TTTCTTAGTT TGCAGAAGTT ATATGGCCAG TTTGCTTCTG ATGAAAACTT GAAGGTTGAC ACGAAGTCAT ATTTGAGACT ACTTGATTTG ACTTGTGGAA TCTTGCCATT TCCCAACCAC GAGAAGTTGA AATTGCTCCA TAAAATTAGC ATTGATGACA GAGGCAATGA GTTGATCAAC ATGATAAACC AGCTAATCAA GATCTTTGAC ACTTTGGATG GTAACAACTC GTTTGTGAAC CATTGGTTCT ATAACGAAGC AACCAATATC CAGAAAGCCA ATGTCGTAGC CAACCAGCTC AAATCTATCA GGTTGCTCTT AGAAGGAATG ACGAATAAGA CCAGACCAAT CAGCAACAGA GGCAATATCA AATCGTTCAA CAACAGTGAA AATGGCAATA ACAATAAAAC TAATGGTTCT GGAATCACAA GTCGTCGTCC TAAGTCAAAT GAGGATGGAG GAGAAGTCTA TGATGAAGAA GATGATGATG AAGAAGATGA TGAGTTAAGA GCCATCACTA ACTTCATCAA GTACAAGTTG CCAAATATCA CGACTTTAAG TCCTGACTCC AAGAGATTGA TTATCAAGGA TTTCAAAAGA ATTAGAGCCT CTTCACAATC TCCTGGAGGA GGTGGTAACT CTGATTTCCA CGTGATTAGA AACTATTTGG AGATCGTCAT GGATATTCCT TGGGACAAGT ACGTCACTAA GTTCAAGTCC AACAAAGATA TTGACTTGAA CTTTGCCAAG AAACAGTTGG ACGATGACCA CTACGGTTTG GAACATGTCA AGAAGAGATT GATTCAGTAC TTGGTTGTGT TGAAGCTTTT AGGAATTAAC GCGGAAAAGC AGATTAGCGA TTTCAGGAAA GAAAATCAAG TTCCTCTGCC TTCTTCTAGT GGACTGAATC TAGCCACTCA AAATAGTCTT GTGCCAGCTT CTTCTATTGT GATTGCCAAC AACGACGAAA CCTCATTTGC TCACAAACAA GCACAGAACA AGGTAAAGAC TTCGATCAAG GAAAGCAATA TAGAAAATCA AACCAACCAG TCGATTCAAG TTACCAAATA CAACAAGTCG CCTATTATAA TGTTGGCTGG GCCACCTGGT ACAGGTAAGA CTTCGTTGGC CAAATCTATT GCCAGTTCTT TAGGAAGAAA CTTCCAGAGG ATCTCGTTGG GTGGTGTCAA GGACGAGAGC GAGATCAGAG GTCATAGAAG AACCTATGTA GGAGCCATGC CTGGTTTGAT TATCCAAGCT TTGAGAAAAT CTAGATCAAT GAACCCAGTG ATTTTACTTG ATGAAATCGA CAAAGTCATT GGTGGAAGCT CTGGAGTGAA CAAGTTCAAT GGAGATCCAT CAGCTGCACT TTTGGAGGTT TTGGATCCAG AACAGAACAC TTCGTTTATT GATCACTACC TTGGTTTCCC CGTAGACTTG TCTCAGGTGA TTTTTATCTG CACTGCCAAT GAGCCTCATA ACTTGACCAG GCCGTTGTTG GATCGTCTTG AGATGATTGA AGTTAGCGCC TATGACTACA ACGAAAAGTT AATTATTGGA AGAAAGTACT TGTTACCAAG GCAGGTGAAG AGAAATGGGT TTCCTGCTTC TGATCGTATC GAGGAGTTTG TCAACATTGA CGATGCCCTG ATGAAGAAGA TCATTGTAGA TTATACGAGA GAAGCCGGAG TGAGAAACCT CGAGCGAAAG TTGGGAACGA TTTGTCGTTT TAAAGCAGTG GAATACTGTG AAGGACTTTC TGGCAAGAGT TTCTACAACC CCAATGTTGA GGAGGCAGAC TTGCCTAAGT ATTTGGGTAT TCCATACAGC TCTGGAGACT TTTCTTCTAT AGAAACAACC ATTTCGAACA ACAGTAGAGT TGGTATAGTT AATGGTTTGT CATATAACTC GGACGGATCT GGATCGGTCT TGGTATTTGA GACTATTGGT TTCGACAAGA GAGTGGGAAA TCCAAATAGC TCCAACACAG GCTGCTCTTT GGTTATGACA GGTAGATTGG GTGAAGTTCT CATGGAAAGT GGAAAAATTG GTTTGACTTT CATCAAACTG TTGATCTATA AGAACTTGAT ACAAGCCAAA GAGCAACCCG ACGATAAGTA TTTGATTGAA AAGTTCAACA ACTTAGAGCT TAACTTACAT GTACCAATGG GATCGATTTC TAAAGATGGT CCTTCTGCCG GTATTACAAT GGCAACACTG TTCTTGTCGG TAATACTTGA TAAGCCAGTT CCAGCGGATG TCGCCATGAC TGGAGAAATC ACCTTGAGAG GTTTGGTCTT ACCCATTGGT GGTGTTAAGG AAAAGATGAT GGGTGCTCAT TTGAATGGAA ACATCAGGCG AATGATTGTG CCTCGTGAAA ATCGAAAAGA TTTGATTGAA GAGTTCAGCA GAAGTGTAGA AGAGGCTGGA GACGTTGTTG ATTCCAACTT GATGAACGAA TTGTTGAAAG ACAATGAGGA AGCTGATTTC AAGATGGATA AGGTTGAAAA ATTCTATTTG AAGAGGTATG GCATTCAAAT CTTTTATGCC CGTGAATTCT ATGATGTAAT GAAGATTCTC TGGGGTGAAG ACGATTTGTT GACGAAACCG AAGTCCAACA GAATCCTCGA ATACCACTTG TAA
|
Protein sequence | MARYKNPKSP AKLKQQIVLP TCKLDSNLVL LPGIIYNVTF SRFKAAALLY RYKDLVSQVS IINNLLNEYE FNPSKDSDSV EEDDMVTSPI TISKVAVEGI EQFFKYEAAF KNSQGLVSEK DIAEVAPSNE FDWLTLAIKP NLEKIKEPSN AQIDPTEHNS VVTIARIVGI VDDTTNIKLT LQAITRGLKI APKKKTRPNE QLLEVDWSSD IPELRRHFKS LKDSSLDLFK VIDKFIVDYR QALSINSANG NKSNLQITKP GSRYKGANGS SQKPGDLLTL NPLANALYLQ LAGSKDFSKA FLSLQKLYGQ FASDENLKVD TKSYLRLLDL TCGILPFPNH EKLKLLHKIS IDDRGNELIN MINQLIKIFD TLDGNNSFVN HWFYNEATNI QKANVVANQL KSIRLLLEGM TNKTRPISNR GNIKSFNNSE NGNNNKTNGS GITSRRPKSN EDGGEVYDEE DDDEEDDELR AITNFIKYKL PNITTLSPDS KRLIIKDFKR IRASSQSPGG GGNSDFHVIR NYLEIVMDIP WDKYVTKFKS NKDIDLNFAK KQLDDDHYGL EHVKKRLIQY LVVLKLLGIN AEKQISDFRK ENQVPSPSSS GSNLATQNSL VPASSIVIAN NDETSFAHKQ AQNKVKTSIK ESNIENQTNQ SIQVTKYNKS PIIMLAGPPG TGKTSLAKSI ASSLGRNFQR ISLGGVKDES EIRGHRRTYV GAMPGLIIQA LRKSRSMNPV ILLDEIDKVI GGSSGVNKFN GDPSAALLEV LDPEQNTSFI DHYLGFPVDL SQVIFICTAN EPHNLTRPLL DRLEMIEVSA YDYNEKLIIG RKYLLPRQVK RNGFPASDRI EEFVNIDDAS MKKIIVDYTR EAGVRNLERK LGTICRFKAV EYCEGLSGKS FYNPNVEEAD LPKYLGIPYS SGDFSSIETT ISNNSRVGIV NGLSYNSDGS GSVLVFETIG FDKRVGNPNS SNTGCSLVMT GRLGEVLMES GKIGLTFIKS LIYKNLIQAK EQPDDKYLIE KFNNLELNLH VPMGSISKDG PSAGITMATS FLSVILDKPV PADVAMTGEI TLRGLVLPIG GVKEKMMGAH LNGNIRRMIV PRENRKDLIE EFSRSVEEAG DVVDSNLMNE LLKDNEEADF KMDKVEKFYL KRYGIQIFYA REFYDVMKIL WGEDDLLTKP KSNRILEYHL
|
| |