Gene Mlab_1168 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlab_1168 
SymbolmutL 
ID4795836 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanocorpusculum labreanum Z 
KingdomArchaea 
Replicon accessionNC_008942 
Strand
Start bp1188725 
End bp1190491 
Gene Length1767 bp 
Protein Length588 aa 
Translation table11 
GC content61% 
IMG OID640099841 
ProductDNA mismatch repair protein 
Protein accessionYP_001030604 
Protein GI124485988 
COG category[L] Replication, recombination and repair 
COG ID[COG0323] DNA mismatch repair enzyme (predicted ATPase) 
TIGRFAM ID[TIGR00585] DNA mismatch repair protein MutL 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.985109 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value0.444607 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCCGGG TCAAAATCCT CGACGAGGAG ACGATCAGCC ACATCGCGGC GGGCGAAGTG 
GTCGAGCGTG CGGCGTCCGT CGTGAAAGAG CTCGTCGAAA ACGCCGTGGA TGCGGACGCC
CAAATCATCC GGATCGGCAT ATCGGCCGAC AAAACCGGGA TAACCAAAAT CTCCGTCACG
GACGACGGGA TCGGGATGGA CTTCGACGAC GCTCTTCTGG CATTCCGCCA GCACGCAACA
AGCAAGATAT CCCGCCCTGA GGATCTCGAT GGGATCACCA CGCTCGGGTT CCGCGGCGAG
GCTCTTGCAA GCATCGCGGC GATCTCGAAG GTGACCTTCA CGACAAAGGA ACGCGGCTCC
CCTTCGCCCG AAGCGGCCCG CGTGGTGATC CACGGCGGCG AGCTGATCTC TCACTCGGCT
GTTGGTGCGC CGGAAGGAAC GAGCGTTCTT ATCGACGCTC TCTTTTACAA CACTCCCGCC
CGGCGCAAGT TCCAGAAGTC CGTTCCAACG GAGTTGTCCC ACGTCTACGA CATGGTCGAG
CGGATCGCCC TTTCGAACAG GAACATCTCG TTTGTTCTGC TGTACAACGG CAAAGAGCGG
TTCCAGACCT TTGGGACAGG CTCGTATCCG GACGTGATCG CCGCGGTGTT CGGCTCCACC
TTTTCCAAAG AGCTGACCCC GGTCTCCGGC AGTTTCGGGC CGGTGAAAAT CGACGGCTGG
ATCACGCGTC CCGGCTCGGA GATGAAGACG ACCCAGACGC GGTTTTATCT CTCGATAAAC
GGCCGGCAGG TGACGTCCCG CCAGCTGCAG TGGGCGATCC GCGAAGGATA CGGCACGCTT
CTGCCAAAGG GCATGTACCC TGCGGCGTTT CTTGATATCG TCCTCGATCC CCGGGACGTG
GATGTGAACG TGCATCCGAC AAAGCGGGAG GTCCGCCTCT CCCGCGAGAG GGAAGTGATG
CGGTGCGTTC AGGATGCGGT CTATACATCG CTGCATGAAG AGCGGGTCTT TTCCACCGCC
CCCATGCCTA CCCTCGCCCG CGAGACTATC ACGACCCTTC CGGTAGAGAT CGTCGGCGAG
CCGGTGCCTG TATATGCCGG GAAGCAGGAG ATGCATGAGG CAAGACAGGC CCCTCTCAAA
CAGACGGAGA AGCAGCTTCG GCGGACCGAG TCTGCGGATC TGCCGGAGAC CGATCTGTTC
GTCCCGGAAG TCCTCGGGCA GATCGGGGAC ACCTACATTC TTGCGAAGAA TGAATCCGGC
GACCTTATCG TCGTGGATCA GCATGCGGCC CACGAGCGGA TCATGTACGA TCAGCTGCTC
GCCCGCTCCT CGTCAGCGGA GGCCGGGCAG GAACTGATCG TCCCCCAGCC GATCACTCTC
TCGAAAAAGG AGACCGCCGC CCTTCCCGAT TTGCTGGATG TCCTCGCCGC CGCGGGATAC
CTTCTAGAAC CGTTTGGAAA AGACGTCTGG ATGGTCCGCT CCGTCCCCGT CGTCTCCTCA
ACGCTCGGCG ACCCGGACAC CATCCATGCG ATCCTGGACG CGGCGCTGGA CGGGGTGGGG
AACACCGACG AGGTCCTTGA TCGGGTGCTG AAGACCGCTG CGTGCCGTGC TGTTGTGAAG
GGGAACACGC CGCTGACGAT CGAACAGATG CAGCGTCTCT TACGCCAGCT TATGGCGACA
AAATCCCCGT ACACCTGCCC TCACGGTCGC CCTACGACGA TCGTTCTCTC GAAATCGCGG
TTGGCCGGGA TGTTTCTGAG AACATAA
 
Protein sequence
MSRVKILDEE TISHIAAGEV VERAASVVKE LVENAVDADA QIIRIGISAD KTGITKISVT 
DDGIGMDFDD ALLAFRQHAT SKISRPEDLD GITTLGFRGE ALASIAAISK VTFTTKERGS
PSPEAARVVI HGGELISHSA VGAPEGTSVL IDALFYNTPA RRKFQKSVPT ELSHVYDMVE
RIALSNRNIS FVLLYNGKER FQTFGTGSYP DVIAAVFGST FSKELTPVSG SFGPVKIDGW
ITRPGSEMKT TQTRFYLSIN GRQVTSRQLQ WAIREGYGTL LPKGMYPAAF LDIVLDPRDV
DVNVHPTKRE VRLSREREVM RCVQDAVYTS LHEERVFSTA PMPTLARETI TTLPVEIVGE
PVPVYAGKQE MHEARQAPLK QTEKQLRRTE SADLPETDLF VPEVLGQIGD TYILAKNESG
DLIVVDQHAA HERIMYDQLL ARSSSAEAGQ ELIVPQPITL SKKETAALPD LLDVLAAAGY
LLEPFGKDVW MVRSVPVVSS TLGDPDTIHA ILDAALDGVG NTDEVLDRVL KTAACRAVVK
GNTPLTIEQM QRLLRQLMAT KSPYTCPHGR PTTIVLSKSR LAGMFLRT