Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_85730 |
Symbol | |
ID | 4840916 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009048 |
Strand | + |
Start bp | 576114 |
End bp | 579066 |
Gene Length | 2953 bp |
Protein Length | 871 aa |
Translation table | 12 |
GC content | 40% |
IMG OID | 640392231 |
Product | predicted protein |
Protein accession | XP_001386506 |
Protein GI | 150866793 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0249] Mismatch repair ATPase (MutS family) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.144196 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGCTTTGGC AGTGTTCAGA TCGAATACAA GAGTGCCGAG GAACCTGGTT CTTTTTTTTT TTAGACGAAT ATCGCATTCA TGTCTGGTAA GCTTACCTTC CATATCCAAG ACACTCCAAG CACAGACTGC TAGAAACGAA GAACCCATAG CTGAACAACT TTTAGGAGCT ACACTTGACT GGGACCGTGG AGGACGAACA AAAAAGAATG GCTTAACTCC TCTATATTCA CTGATGAAAA AAATCATCGA TCTGAACCAA GGATGCGTTT GTTTAATTCA AGTAGGAAGT TTCTATGAGC TCTATTTTGA GCAAGCTCTG GACTACGGTC CCCAATTGGG ACTCAAAGTT TCCACAAGAA AAACCAACAA CTACACCATA CCTATGGCAG GGTTTCCTAC TTATCAATTG CAGAAATTTG TTAAGATTTT GGTGCAAGAC TTGGGTGAAA ATGTAGCCAT TATAGACCAA TTCCCCACAA GAAAGATCTC AGAAACGATA ATACACCGCA AAATATCACG AATTGTTTCC CCAGGGACTC TTGTAGATGA AACATTCATG AACTATAACC AGAACAACTT CTTGCTAGCA ATCTCGTTTC CTGCTAATTG TACAAAGGTT CCAGCCGATC CTGAGACTGC GGTAGGACTC TCATGGATTG ATGTAAGTGT TGGGGAGTTC TATGTTCAGA ACACAACTTT GGGGAATATG ATTTCTGACA TTTCCAGGGT TAATCCCAGT GAAATCATCA TCTCCAAAGA GTTTCAGGAT ATGAACATTA TCGACGGTAA TTGGTATCCA CCTCTTCAGG AATTGCGTCG GTTCTTCTTA CGCTACCACA AGACGACATA CAACGATCTG AAGTTGAAGT TCAAAAGTGG GTTACAAACC ACAAGAAAGA TGTTGGAAAG CTTTACCGTT AGAGAAGAGG CAGCAATGAA CATGATCTTG TCGTATATTG ATGTAAATTT ACCTGAATCC AATCCTTCCT TGGATCACCC CATCACATAC TGGAATCAGA GTTGTCTACA GATGGATGCC CGAACTCGTG AAGCACTAGA ATTGACCGAA AGATCCACCA GTGGCAGATC TTCTGTTGTA GGATCTCTCT TGACGACTGT TAAGCGAACT ATCACACCTT CTGGATCTAG ATTGTTAACT CAATGGATAA AGTCTCCAAT TCTCGATGTT AACGAAATTC GCCGCAGACA AGGTTTTGTC CAGACTTTCC TTGAGAACCA CCAAGTGACA ACCTCTTTGA GGTACCAGTT GCTGCAGCTT GGTGACTTTA TTAGGTCGTT ACAAAGACTA GCGTTCGGTG CAGGTGATAG CGTTACCCAC TTGCTAGCAA TTGCCGATAG CATAGCAAAG TTACAGGAAA TAGAAGTATT CTTAAGAACA GAACATTCCA ATAACAAAAA GGGATTGAAA ATTTTGGACA AATTTTTGAA GGAGTTTGTT GTTCCTTCAG ACATTTCAGA AGAGATCATA TCAACACTCC ATATTCAGAT AAATGACGTT AATTCAAGTT TCCTGGACAT GTCAGAGGAA GTAATAGGAG AATTCGAAGA TTCTGAAGAT TCTGAAGAGT ATCCACATAT TGATACAGGT TCTTATAGCA ACAAATCTAT TGACAAGTAT AGATTCGTAC CAAAACCTAA AGGGGAAAGT GTATTCAGCT TCTCAGTGAG GCGTGATTAT AACAAATCTT TGTTAGACTT GCACAACCTG ATGGATATCT TGAAAGATAA AGAGGATAAC ATGATATCAG CTGTTAGAGA AGAATTGGGC AAGATCGATC CAAAACTACT TGTTTCCAAA AAAGAACAAC ATGGAAGGTA TCTGAATATT TTGCACATTT CTGGTAAACA AAAACTGATA GAAGAGGTCT ATGTTCATCT TGGTAATGAT GTCAGAGACA AGAAGAAGGC TTCACTCTTG TATAAGCCAA CTGAATGGAA CAACTTGCAG GTGATCATTG AAGAAAAGAA AGAGCATATA AGAGAATTGG AACGCCAAAT CGTTGATCTG CTTAGACAGA AAGTGCTAGA TAAAGCATCT GATATCAGAA AAGTCAGTAA GATGGTTGAT TTCTTGGATG TGACATCTTC TTTTGCAATT TTGGCGGAAG AGAATAACTT AGTATGTCCA AAATTTGTGA AGACTTCTCT GATTAACATT GAGAATGGCA GACATTTTGT GGTTGAATCA GGCTTGAAGT CAGTTGGTAA GATGTTTACA CCTAATGATA CCAAGATCAC TTCCAGTGCC AACCTTTGGG TCGTTTCAGG ACCCAATATG GGAGGTAAGA GTACGTTCTT AAGACAGAAT GCAATTATAG TCATTCTAGC ACAAATTGGT TCTTTTGTTC CTGCTTCAAA AGCCAATCTC GGAATTGTAG ATAAGATATT TACCAGAATA GGAGCCTCAG ACGATTTATT CAACGACTTA AGTACTTTCA TGGTTGAGAT GGTAGAGACT AGCAATATCT TGCGCAATGC AACCTCTCAT TCGTTAGCCA TTGTTGATGA AATTGGAAGA GGAACCAGTG GAAAAGAAGG ACTAGCGCTT GCATATGCTA CTTTGTACAA CTTGTTGCTG GTCAACAAAT GCCGTACACT CTTTGCAACT CATTTTGGTA AGGAATTAGA GCAATTATTG AAAGCAAATA AAGTGAGCCA AAGTAAGATA CGCTACTTCC GTACCAGAGT AATTCAAGAT GATGACGACA AAAACCCCTC AGGACTTGGT CTTGTCATAG ACCATACTTT GGAAAAGGGT ATCAGCGAGA GATCGTATGC ACTTGAAGTG GCCCAGATGG CAGGATTCCC GCCAGAAGCA TTAAAAAATG CTCGAATGGC ACTTGATCTA CTAGATTAAG ATTTAGACAG ATAGCTTAGC ATATACCATA TAGCATACAT GCCATCAGAG CATATATTTA CAGAATCTAT AAT
|
Protein sequence | MKKIIDSNQG CVCLIQVGSF YELYFEQASD YGPQLGLKVS TRKTNNYTIP MAGFPTYQLQ KFVKILVQDL GENVAIIDQF PTRKISETII HRKISRIVSP GTLVDETFMN YNQNNFLLAI SFPANCTKVP ADPETAVGLS WIDVSVGEFY VQNTTLGNMI SDISRVNPSE IIISKEFQDM NIIDGNWYPP LQELRRFFLR YHKTTYNDSK LKFKSGLQTT RKMLESFTVR EEAAMNMILS YIDVNLPESN PSLDHPITYW NQSCLQMDAR TREALELTER STSGRSSVVG SLLTTVKRTI TPSGSRLLTQ WIKSPILDVN EIRRRQGFVQ TFLENHQVTT SLRYQLSQLG DFIRSLQRLA FGAGDSVTHL LAIADSIAKL QEIEVFLRTE HSNNKKGLKI LDKFLKEFVV PSDISEEIIS TLHIQINDEV IGEFEDSEDS EEYPHIDTGS YSNKSIDKYR FVPKPKGESV FSFSVRRDYN KSLLDLHNSM DILKDKEDNM ISAVREELGK IDPKLLVSKK EQHGRYSNIL HISGKQKSIE EVYVHLGNDV RDKKKASLLY KPTEWNNLQV IIEEKKEHIR ELERQIVDSL RQKVLDKASD IRKVSKMVDF LDVTSSFAIL AEENNLVCPK FVKTSSINIE NGRHFVVESG LKSVGKMFTP NDTKITSSAN LWVVSGPNMG GKSTFLRQNA IIVILAQIGS FVPASKANLG IVDKIFTRIG ASDDLFNDLS TFMVEMVETS NILRNATSHS LAIVDEIGRG TSGKEGLALA YATLYNLLSV NKCRTLFATH FGKELEQLLK ANKVSQSKIR YFRTRVIQDD DDKNPSGLGL VIDHTLEKGI SERSYALEVA QMAGFPPEAL KNARMALDLL D
|
| |