Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_64463 |
Symbol | |
ID | 4840951 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009048 |
Strand | - |
Start bp | 529427 |
End bp | 532234 |
Gene Length | 2808 bp |
Protein Length | 935 aa |
Translation table | 12 |
GC content | 42% |
IMG OID | 640392266 |
Product | predicted protein |
Protein accession | XP_001386694 |
Protein GI | 150866932 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0466] ATP-dependent Lon protease, bacterial type |
TIGRFAM ID | [TIGR00763] ATP-dependent protease La |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.658129 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | AGCAATAACA ATAATAACAA TAACAACAAC AATAACGATA ACGATGAGCC TAACGAGATA GTAACGAACG CCGGTACAGG CTTGTATCCT CCGCTATTGG CCATTCCAAT GAAGGATCGT CCCCCTCTTC CCGGACGTCC TTTTGCTATC AATATTACGG ATCCAGAAGT GATCAGGTCG ATCTACACCA TCATCGACAA GAGGGAGCCC TACTTTGTAT TGTTCCACGT CAAAGATCCA AATGAAGGAG ATACTGATGT CATCAATAGT AAGGATTCAG TGTACAACAT TGGTGTACAC TGTCAAATTA TCAGACACAC GACTCCAAGA CCAGGAGTGT TCAATGTCTT GGGGTATCCT CTTGAAAGAT GTCTGTTGGC TGACCTTAGC ACTCCCAGTG AGAAGAAGGG CGAAACTGAG ACCAGAAAGG AGGGAGAAAA CTTTCCTACT TCTTATTTGA AAGGTCTCAA AGTGTCCTAT GCTACCGTGA AACCTGTCAA AGATGAGCCT TTCGATAAGA CTTCGACCGA TATCAAGTCA TTGGTAGAAT CCTTAAAGGC TCTTTTGTCG AAGATGGGCG CAAAGAATCC CCTTGAAAAG CTCCAGATCA AAGAAGGTAC AGAATTGGTG AACGATCCAC CCAGATTTGC CGATTTTGTA GGCTCCACTA TTCATGGAGA CCCCAAGAAG ATCCAAGAGA TCTTGGAATC ATTGAACATC CAGACAAGAT TATCAAAGGC TTTGGAATTG TTGAAAGTTG AACTCAAGGC AAGCTTAATT AAAGAGAACA CCATCCATAA CTTGAGTACC AAGGCCGATG AATACCAAAC GAGACTCTTC ATAAAGGAAT TTATCAAGGA ATTGCAAAAG CGTGCTGGAA TTGTAGAGTC TGATGACAAA AAGACGTCGA AATTTGATGA GCGTCTCAAA CATTTGAAGA TGACAGAAGA GGCTCTTGAA GCATACAATG CCGAAAAGGC AAAAATGGAA AGTCAGAACG AACACTCGAG TGAGCTTGGT GTTAGTGAGA GATACTTGGA TTGGTTGACT TCGATTCCCT GGGGAATCTA TTCTAAGGAT CGCTTTAATA TCAAGCAGGC CAGAGAGATC TTGGACAGGG ACCACTATGG GTTGAAAGAT GTCAAGGACA GAATCTTAGA GTTCATCTCT ATGGGCAGAG TTTCAGGAAA AGTCGATGGG AAGATATTGT GTTTGACAGG CCCACCCGGT ACTGGTAAAA CATCCATAGC CAAGTCTATT GCCGAGTCAT TGAACCGTAA GTATGTTAGA ATCGCCATGG GTGGTATCCA GGATGTTCAC GAAGTTAAAG GTCATAGAAG AACATATGTT GGATCAATTC CTGGTCGTAT CATTTCTGCG TTGAAGCAAG CCAAAACGTC CAATCCATTG ATGTTGATTG ATGAAATTGA CAAGTTGGAC TTAAGTCGTA GTGGGGGTGC CTCTTCAGCC TTTTTGGAGA TCTTGGACCC TGAACAGAAT AATGCCTTTG TTGACAACTA CATTGATGTC AAGGTCGATT TGTCCAAGGT GTTGTTTGTT TGTACTGCTA ATTATTTGGG CAACATTTCT CCTCCGTTGA GAGACCGTAT GGAAATCATT GAAGTCAATG GTTACACCAA CAATGAGAAA ATTGAGATTG CCAAAAGACA CTTGATTCCA GATGCAGCCA AAAAAGCTGG ATTGGAAGGT GGGCATGTTG TAATTGAGAC GAAGACCATT TCTAGATTGA TAGAGAAGTA CTGTCGTGAA AGTGGATTGA GAAACATCAA AAAGCTTATC ACCAGAATCT TCAGCAAGGC CTCTCTCAAG ATCGTGGAAG AAGTTGAAGC TAGAGAAGGC GAATCAAAAT CGAAATCTGA AGAAGCTAAG TCAGAAGCTA TCACTGGTTC TGTTACTGAG ATTTCTGTTG AAGATGCTAC AGTAAAGGCC CAGTCCATTG AAGAACCTAG TGTAGAATCT GCTTCTCAGA AGGTTGACGA AGCCAAACCT GTCGAATCAG AAGAACTTAA ATCAGATGAA GAAGAAGAGG AAGTCGTGAA GTTGGAAATT CCAGATGACA TAAAGTTGGA AATCACTTCT GCCAACTTGA AGGATTATGT TGGACCAGAG ATTTATACTA GGGACCGTGT CTACGACATC CCTCCTCCTG GTGTTGCTAC TGGTCTTTCG TATAGTACTT CTGGTAATGG AGATGCATTG TACATTGAAT CTATCTTAAC ACACTCTATT GGATCAGGTT CGGGACATGC TAGTATTCAT GTTACTGGTA GCCTCAAGGA TGTCATGAAG GAATCTGCTT CCATCGCTTA TTCTTTTGCC AAACTGTACA TGGTCAAGAA CTACCCAGAA AACAGATTCT TTGAAGCTGC TGAGATTCAT GTTCACTGTC CTGACGGTGC TATTCCAAAG GATGGTCCTT CCGCTGGTAT TTCCTTCACA TCTTCATTGA TTTCATTAGC TCTTCAAAAG CCTTTGCCTC CTACAATTGC CATGACAGGT GAGATCACTG TTACTGGTAG GGTATTGGCC GTTGGAGGTT TAAGAGAAAA GATCTTAGGT GCTAAGAGAT ACGGATGTAA CACCATTATC TTCCCCAAGG ATATTGAAAA CGAACTTGAA GAAATCCCTG AAGAAGTAAA GGAGGGTGTT AAATTTATCC CCGTCGAATG GTACCAGGAT GTATTTGACG AAATATTCCC CAACTTGTCT AGTGATGAAG GTAACGAGGT ATGGAAGGAA GAGTTCAACA AATTGGATAA GAAGAAGGCT AGCAACAAGA AGAAATGA
|
Protein sequence | SNNNNNNNNN NNDNDEPNEI VTNAGTGLYP PLLAIPMKDR PPLPGRPFAI NITDPEVIRS IYTIIDKREP YFVLFHVKDP NEGDTDVINS KDSVYNIGVH CQIIRHTTPR PGVFNVLGYP LERCSLADLS TPSEKKGETE TRKEGENFPT SYLKGLKVSY ATVKPVKDEP FDKTSTDIKS LVESLKALLS KMGAKNPLEK LQIKEGTELV NDPPRFADFV GSTIHGDPKK IQEILESLNI QTRLSKALEL LKVELKASLI KENTIHNLST KADEYQTRLF IKEFIKELQK RAGIVESDDK KTSKFDERLK HLKMTEEALE AYNAEKAKME SQNEHSSELG VSERYLDWLT SIPWGIYSKD RFNIKQAREI LDRDHYGLKD VKDRILEFIS MGRVSGKVDG KILCLTGPPG TGKTSIAKSI AESLNRKYVR IAMGGIQDVH EVKGHRRTYV GSIPGRIISA LKQAKTSNPL MLIDEIDKLD LSRSGGASSA FLEILDPEQN NAFVDNYIDV KVDLSKVLFV CTANYLGNIS PPLRDRMEII EVNGYTNNEK IEIAKRHLIP DAAKKAGLEG GHVVIETKTI SRLIEKYCRE SGLRNIKKLI TRIFSKASLK IVEEVEAREG ESKSKSEEAK SEAITGSVTE ISVEDATVKA QSIEEPSVES ASQKVDEAKP VESEELKSDE EEEEVVKLEI PDDIKLEITS ANLKDYVGPE IYTRDRVYDI PPPGVATGLS YSTSGNGDAL YIESILTHSI GSGSGHASIH VTGSLKDVMK ESASIAYSFA KSYMVKNYPE NRFFEAAEIH VHCPDGAIPK DGPSAGISFT SSLISLALQK PLPPTIAMTG EITVTGRVLA VGGLREKILG AKRYGCNTII FPKDIENELE EIPEEVKEGV KFIPVEWYQD VFDEIFPNLS SDEGNEVWKE EFNKLDKKKA SNKKK
|
| |