Gene SbBS512_E4701 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbBS512_E4701 
SymbolmutL 
ID6273284 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShigella boydii CDC 3083-94 
KingdomBacteria 
Replicon accessionNC_010658 
Strand
Start bp4392961 
End bp4394808 
Gene Length1848 bp 
Protein Length615 aa 
Translation table11 
GC content56% 
IMG OID641728466 
ProductDNA mismatch repair protein 
Protein accessionYP_001882861 
Protein GI187733395 
COG category[L] Replication, recombination and repair 
COG ID[COG0323] DNA mismatch repair enzyme (predicted ATPase) 
TIGRFAM ID[TIGR00585] DNA mismatch repair protein MutL 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.0915673 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCAATTC AGGTCTTACC GCCACAACTG GCGAACCAGA TTGCCGCAGG TGAGGTGGTC 
GAGCGACCTG CGTCGGTAGT CAAAGAACTG GTGGAAAACA GCCTCGATGC AGGTGCGACG
CGTATCGATA TTGATATCGA ACGCGGTGGG GCGAAACTTA TCCGCATTCG TGATAACGGC
TGCGGTATCA AAAAAGATGA GCTGGCGCTG GCGCTGGCTC GTCATGCCAC CAGTAAAATC
GCCTCTCTGG ACGATCTCGA AGCCATTATC AGCCTGGGCT TTCGCGGTGA GGCGCTGGCG
AGTATCAGTT CGGTTTCCCG CCTGACGCTC ACTTCACGCA CCGCAGAACA GCAGGAAGCC
TGGCAGGCCT ATGCCGAAGG GCGCGATATG AACGTGACGG TAAAACCGGC GGCGCATCCT
GTGGGGACGA CGCTGGAGGT GCTGGATCTG TTCTACAACA CCCCGGCGCG GCGCAAATTC
CTGCGCACCG AGAAAACCGA ATTTAACCAC ATTGATGAGA TCATCCGCCG CATTGCGCTG
GCGCGTTTCG ACGTCACGAT CAACCTGTCG CATAACGGTA AAATTGTGCG TCAGTACCGC
GCAGTGCCGG AAGGCGGGCA AAAAGAACGG CGCTTAGGCG CGATTTGCGG CACTGCTTTT
CTTGAACAAG CGCTGGCGAT TGAATGGCAA CACGGCGATC TCACGCTACG CGGCTGGGTG
GCCGATCCAA ATCACACCAC GCCCGCACTG GCAGAAATTC AGTATTGCTA CGTGAACGGT
CGCATGATGC GCGATCGCCT GATCAATCAC GCGATCCGCC AGGCCTGCGA AGACAAACTG
GGGGCCGATC AGCAACCGGC ATTTGTGCTG TATCTGGAGA TCGATCCGCA TCAGGTGGAC
GTCAACGTGC ACCCCGCCAA ACACGAAGTG CGTTTCCATC AGTCGCGTCT GGTGCATGAT
TTTATCTATC AGGGCGTGCT GAGCGTGCTA CAACAGCAAC TGGAAACGCC GCTACCGCTG
GACGATGAAC CCCAACCTGC ACCGCGTTCC ATTCCGGAAA ACCGCGTGGC GGCGGGGCGC
AATCACTTTG CAGAACCGGC AGCTCGTGAG CCGGTAGCTC CGCGCTACAC TCCTGCGCCA
GCATCAGGCA GTCGTCCGGC TGCCCCTTGG CCGAATGCGC AGCCAGGCTA CCAGAAACAG
CAAGGTGAAG TGTATCGCCA GCTTTTGCAA ACGCCCGCGC CGATGCAAAA ATTAAAAGCG
CCGGAACCGC AGGAACCTGC ACTTGCGGCG AACAGTCAGA GTTTTGGTCG GGTACTGACT
ATCGTCCATT CCGACTGTGC GTTGCTGGAG CGCGACGGCA ACATTTCACT TTTAGCCTTG
CCAGTGGCAG AACGTTGGCT ACGTCAGGTA CAACTGACGC CGGGTGAAGC GCCAGTTTGC
GCTCAGCCGC TGCTTATTCC GTTGCGGCTA AAAGTTTCTG GCGAAGAAAA ATCGGCATTA
GAAAAAGCGC AGTCTGCCCT GGCGGAATTG GGTATTGATT TCCAGTCAGA TGCACAGCAT
GTGACCATCA GGGCCGTGCC TTTACCCTTA CGCCAACAAA ATTTACAAAT CTTGATTCCT
GAACTGATAG GCTACCTGGC GAAGCAGTCC GTATTCGAAC CTGGCAATAT TGCGCAGTGG
ATTGCACGAA ATCTGATGAG CGAACATGCG CAGTGGTCAA TGGCACAGGC CATAACCCTG
CTGGCGGACG TGGAACGGTT ATGTCCGCAA CTTGTGAAAA CGCCGCCGGG TGGTCTGTTA
CAATCTGTTG ATTTACATCC GGCGATAAAA GCCCTGAAAG ATGAGTGA
 
Protein sequence
MPIQVLPPQL ANQIAAGEVV ERPASVVKEL VENSLDAGAT RIDIDIERGG AKLIRIRDNG 
CGIKKDELAL ALARHATSKI ASLDDLEAII SLGFRGEALA SISSVSRLTL TSRTAEQQEA
WQAYAEGRDM NVTVKPAAHP VGTTLEVLDL FYNTPARRKF LRTEKTEFNH IDEIIRRIAL
ARFDVTINLS HNGKIVRQYR AVPEGGQKER RLGAICGTAF LEQALAIEWQ HGDLTLRGWV
ADPNHTTPAL AEIQYCYVNG RMMRDRLINH AIRQACEDKL GADQQPAFVL YLEIDPHQVD
VNVHPAKHEV RFHQSRLVHD FIYQGVLSVL QQQLETPLPL DDEPQPAPRS IPENRVAAGR
NHFAEPAARE PVAPRYTPAP ASGSRPAAPW PNAQPGYQKQ QGEVYRQLLQ TPAPMQKLKA
PEPQEPALAA NSQSFGRVLT IVHSDCALLE RDGNISLLAL PVAERWLRQV QLTPGEAPVC
AQPLLIPLRL KVSGEEKSAL EKAQSALAEL GIDFQSDAQH VTIRAVPLPL RQQNLQILIP
ELIGYLAKQS VFEPGNIAQW IARNLMSEHA QWSMAQAITL LADVERLCPQ LVKTPPGGLL
QSVDLHPAIK ALKDE