Gene PICST_51649 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_51649 
SymbolMLH3 
ID4851505 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009068 
Strand
Start bp2002714 
End bp2004618 
Gene Length1905 bp 
Protein Length634 aa 
Translation table 
GC content40% 
IMG OID640393213 
ProductDNA mismatch repair 
Protein accessionXP_001388013 
Protein GI126274683 
COG category[L] Replication, recombination and repair 
COG ID[COG0323] DNA mismatch repair enzyme (predicted ATPase) 
TIGRFAM ID[TIGR00585] DNA mismatch repair protein MutL 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0197688 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTGTCGA GTGGTAGGAT TCGAAAACTA AATCCACAGG TTCTGAGTGA ATTGAGATCG 
CAAACTATCT TCAACTCTTT AGCCTCGGTA GTTCAGGAAT TGTTAAGAAA TAGCTTGGAT
GCTCAAGCCA AGGTGGTAGA AATACGTCTA GACTTGGATT CGCTCGCTGT ACAGGTTGCC
GACAATGGAG TTGGAATTCC TGCTGATGAT ATGCTGATGG TAGGCGAGAG ATATCACACT
TCAAAGTTGA AGCAAATTAA GGACTTGCCT TTTATCTCTA CATATGGCTA TAGGGGAGAA
GCATTATACG CTTTAGGTTT GGTATCTCGA CTTTCTATTG TGTCTAAAAG TGACTCTGGA
GACGTTACTT TTGTTCGAAT GATTTCGTAT AATTCGAATA CAGAAGTCTA TGATTATCAA
AATTATACGA ACGATGGCTT CTTTCGGGTA GAGCCGATCA AAAAGAACGG AACCATAGTC
ACGGCAACAG GATTGTATTG CAACCTTCCT GTTCGTCGAC AACAGATTAG AGCAGTTTCG
CAATTCAAGA TTATTGATGA AATTAGACAT ATTGTATTTC AGAGTCTTGT CAAATTTCCA
GATGTGAGTA TCAAAGTACT ACGACTTGAC CATGATTCCT TGAATCCAGA TATTCTCATA
AACTATTCTC CCAGTAACAC ACGAAAGACT GATAATTTTG CTTGTATATT CAGGAACATC
TATGGAAAGT CTGTTCTACC AAAATTTCAC ACCCTAGAAG CTGAACAAAG AGGATTGCAG
TTGACAGGAT TTGTAGGAAC GGACCCTGTA AGCTCGAAGA GGTTCCAATA CATATTCTTC
AATGGTTCTC TCCAAGGTTA TGAGACCACA CGTATAGCTG TAAACCAAAC TTTCAAAGAA
TCGAGATTTG GAGATATACC TGAGTCTGTT TCTTATCGTA CAGGGGAATC TCCTTCTAAG
AAGCGATCTA GGTTGCTGTG GGTTTGGCCA GTCTTTCTTG TTTGCATAGA AAGCGTTAAA
AAAGGGGCTA CTATTGAAGC CGATGAAATC ACTAAGTTTG TGGTGAAAGT CTTCAAGCAA
TTTTTGGTCT CTCAAGGGTT TCAGGTTGAT TCTGGGCCAT CCGTATTTGG CTCTCCCCGG
ATATCATTGT CTCCATCGAA ACGAAGAAAG ACATCTCCCA GTGGAGATGA GGAACCTAGA
GTGAAGAAAA GTGATCAGCT TGATAGTACT TTTCTAAATG TTGCAGAATC AGGTTTAACT
TCGGGCAACT ATAGAATTGT GAGACAACTT GATAGCAAGT TTATCTTGGT GAGCAGTTCG
AATAATCTTG GAGGCAAAGT ACTTCTAGTT ATTGACCAAC ATGCTTGCGA TGAAAGAATA
AAGGTAGAGG CACTCTTTAA AGACTTCATA TTTCTTGTCT TAGATGCTCA CACCAATTTG
CTGCTACGAG TGGTTGAGCC TGTAACGTTT GCTGTTAGTA GCGTAGAAGT TCAGTTGTTC
GAGGAATATG CGGAGAATCT TAACAAGTTT GGAATTAGGT TCATCATTGA AGGTCTAACT
ATAGTCGTAA CCCACATGCC CCAAATAATT TTGGAGAAGT CAGACATAGA TGCTGATATC
TTGAGGAGGT GGTTGTTGCT GCATGTAAAC GATTTAAAAG AAGAAAGCAA GTCGGCAATC
GTAGATACAT ATTCTATTAA TGATTGGTTT CCTTTTGTTC GTCACTTGCC CACCTTCTTG
ATCGATATTA TCAATTCTAA GGCATGCCAT TCTTCTGTGG TTTTCGGAGA GGTATTGGAG
TATTCTGAAA TGGAGAAAAT GGTACGGCAG CTCTTGCACT GTCGTCTACC GTTTCAATGT
GCTCACGGAC GGCCATCTAT AGTTCCATTA GTAAACATAC AGTAA
 
Protein sequence
MLSSGRIRKL NPQVLSELRS QTIFNSLASV VQELLRNSLD AQAKVVEIRL DLDSLAVQVA 
DNGVGIPADD MLMVGERYHT SKLKQIKDLP FISTYGYRGE ALYALGLVSR LSIVSKSDSG
DVTFVRMISY NSNTEVYDYQ NYTNDGFFRV EPIKKNGTIV TATGLYCNLP VRRQQIRAVS
QFKIIDEIRH IVFQSLVKFP DVSIKVLRLD HDSLNPDILI NYSPSNTRKT DNFACIFRNI
YGKSVLPKFH TLEAEQRGLQ LTGFVGTDPV SSKRFQYIFF NGSLQGYETT RIAVNQTFKE
SRFGDIPESV SYRTGESPSK KRSRLLWVWP VFLVCIESVK KGATIEADEI TKFVVKVFKQ
FLVSQGFQVD SGPSVFGSPR ISLSPSKRRK TSPSGDEEPR VKKSDQLDST FLNVAESGLT
SGNYRIVRQL DSKFILVSSS NNLGGKVLLV IDQHACDERI KVEALFKDFI FLVLDAHTNL
LLRVVEPVTF AVSSVEVQLF EEYAENLNKF GIRFIIEGLT IVVTHMPQII LEKSDIDADI
LRRWLLLHVN DLKEESKSAI VDTYSINDWF PFVRHLPTFL IDIINSKACH SSVVFGEVLE
YSEMEKMVRQ LLHCRLPFQC AHGRPSIVPL VNIQ