Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_51649 |
Symbol | MLH3 |
ID | 4851505 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009068 |
Strand | - |
Start bp | 2002714 |
End bp | 2004618 |
Gene Length | 1905 bp |
Protein Length | 634 aa |
Translation table | |
GC content | 40% |
IMG OID | 640393213 |
Product | DNA mismatch repair |
Protein accession | XP_001388013 |
Protein GI | 126274683 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0323] DNA mismatch repair enzyme (predicted ATPase) |
TIGRFAM ID | [TIGR00585] DNA mismatch repair protein MutL |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.0197688 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTGTCGA GTGGTAGGAT TCGAAAACTA AATCCACAGG TTCTGAGTGA ATTGAGATCG CAAACTATCT TCAACTCTTT AGCCTCGGTA GTTCAGGAAT TGTTAAGAAA TAGCTTGGAT GCTCAAGCCA AGGTGGTAGA AATACGTCTA GACTTGGATT CGCTCGCTGT ACAGGTTGCC GACAATGGAG TTGGAATTCC TGCTGATGAT ATGCTGATGG TAGGCGAGAG ATATCACACT TCAAAGTTGA AGCAAATTAA GGACTTGCCT TTTATCTCTA CATATGGCTA TAGGGGAGAA GCATTATACG CTTTAGGTTT GGTATCTCGA CTTTCTATTG TGTCTAAAAG TGACTCTGGA GACGTTACTT TTGTTCGAAT GATTTCGTAT AATTCGAATA CAGAAGTCTA TGATTATCAA AATTATACGA ACGATGGCTT CTTTCGGGTA GAGCCGATCA AAAAGAACGG AACCATAGTC ACGGCAACAG GATTGTATTG CAACCTTCCT GTTCGTCGAC AACAGATTAG AGCAGTTTCG CAATTCAAGA TTATTGATGA AATTAGACAT ATTGTATTTC AGAGTCTTGT CAAATTTCCA GATGTGAGTA TCAAAGTACT ACGACTTGAC CATGATTCCT TGAATCCAGA TATTCTCATA AACTATTCTC CCAGTAACAC ACGAAAGACT GATAATTTTG CTTGTATATT CAGGAACATC TATGGAAAGT CTGTTCTACC AAAATTTCAC ACCCTAGAAG CTGAACAAAG AGGATTGCAG TTGACAGGAT TTGTAGGAAC GGACCCTGTA AGCTCGAAGA GGTTCCAATA CATATTCTTC AATGGTTCTC TCCAAGGTTA TGAGACCACA CGTATAGCTG TAAACCAAAC TTTCAAAGAA TCGAGATTTG GAGATATACC TGAGTCTGTT TCTTATCGTA CAGGGGAATC TCCTTCTAAG AAGCGATCTA GGTTGCTGTG GGTTTGGCCA GTCTTTCTTG TTTGCATAGA AAGCGTTAAA AAAGGGGCTA CTATTGAAGC CGATGAAATC ACTAAGTTTG TGGTGAAAGT CTTCAAGCAA TTTTTGGTCT CTCAAGGGTT TCAGGTTGAT TCTGGGCCAT CCGTATTTGG CTCTCCCCGG ATATCATTGT CTCCATCGAA ACGAAGAAAG ACATCTCCCA GTGGAGATGA GGAACCTAGA GTGAAGAAAA GTGATCAGCT TGATAGTACT TTTCTAAATG TTGCAGAATC AGGTTTAACT TCGGGCAACT ATAGAATTGT GAGACAACTT GATAGCAAGT TTATCTTGGT GAGCAGTTCG AATAATCTTG GAGGCAAAGT ACTTCTAGTT ATTGACCAAC ATGCTTGCGA TGAAAGAATA AAGGTAGAGG CACTCTTTAA AGACTTCATA TTTCTTGTCT TAGATGCTCA CACCAATTTG CTGCTACGAG TGGTTGAGCC TGTAACGTTT GCTGTTAGTA GCGTAGAAGT TCAGTTGTTC GAGGAATATG CGGAGAATCT TAACAAGTTT GGAATTAGGT TCATCATTGA AGGTCTAACT ATAGTCGTAA CCCACATGCC CCAAATAATT TTGGAGAAGT CAGACATAGA TGCTGATATC TTGAGGAGGT GGTTGTTGCT GCATGTAAAC GATTTAAAAG AAGAAAGCAA GTCGGCAATC GTAGATACAT ATTCTATTAA TGATTGGTTT CCTTTTGTTC GTCACTTGCC CACCTTCTTG ATCGATATTA TCAATTCTAA GGCATGCCAT TCTTCTGTGG TTTTCGGAGA GGTATTGGAG TATTCTGAAA TGGAGAAAAT GGTACGGCAG CTCTTGCACT GTCGTCTACC GTTTCAATGT GCTCACGGAC GGCCATCTAT AGTTCCATTA GTAAACATAC AGTAA
|
Protein sequence | MLSSGRIRKL NPQVLSELRS QTIFNSLASV VQELLRNSLD AQAKVVEIRL DLDSLAVQVA DNGVGIPADD MLMVGERYHT SKLKQIKDLP FISTYGYRGE ALYALGLVSR LSIVSKSDSG DVTFVRMISY NSNTEVYDYQ NYTNDGFFRV EPIKKNGTIV TATGLYCNLP VRRQQIRAVS QFKIIDEIRH IVFQSLVKFP DVSIKVLRLD HDSLNPDILI NYSPSNTRKT DNFACIFRNI YGKSVLPKFH TLEAEQRGLQ LTGFVGTDPV SSKRFQYIFF NGSLQGYETT RIAVNQTFKE SRFGDIPESV SYRTGESPSK KRSRLLWVWP VFLVCIESVK KGATIEADEI TKFVVKVFKQ FLVSQGFQVD SGPSVFGSPR ISLSPSKRRK TSPSGDEEPR VKKSDQLDST FLNVAESGLT SGNYRIVRQL DSKFILVSSS NNLGGKVLLV IDQHACDERI KVEALFKDFI FLVLDAHTNL LLRVVEPVTF AVSSVEVQLF EEYAENLNKF GIRFIIEGLT IVVTHMPQII LEKSDIDADI LRRWLLLHVN DLKEESKSAI VDTYSINDWF PFVRHLPTFL IDIINSKACH SSVVFGEVLE YSEMEKMVRQ LLHCRLPFQC AHGRPSIVPL VNIQ
|
| |