Gene SeD_A4756 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeD_A4756 
SymbolmutL 
ID6872371 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Dublin str. CT_02021853 
KingdomBacteria 
Replicon accessionNC_011205 
Strand
Start bp4614209 
End bp4616065 
Gene Length1857 bp 
Protein Length618 aa 
Translation table11 
GC content58% 
IMG OID642787649 
ProductDNA mismatch repair protein 
Protein accessionYP_002218243 
Protein GI198245330 
COG category[L] Replication, recombination and repair 
COG ID[COG0323] DNA mismatch repair enzyme (predicted ATPase) 
TIGRFAM ID[TIGR00585] DNA mismatch repair protein MutL 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones70 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGATTC AGGTTCTGCC GCCGCAGCTT GCGAACCAAA TCGCCGCTGG CGAAGTGGTG 
GAACGCCCTG CGTCGGTTGT TAAAGAGCTG GTAGAGAATA GTCTGGATGC AGGCGCCACC
CGCGTTGATA TCGACATTGA GCGTGGCGGC GCGAAGCTTA TTCGTATTCG CGACAATGGC
TGCGGCATTA AAAAAGAGGA GCTGGCGCTG GCGCTGGCCC GTCATGCCAC CAGTAAAATC
GCCTCGCTTG ACGATCTGGA AGCGATTATC AGTCTGGGAT TTCGCGGCGA AGCGCTGGCG
AGTATCAGTT CGGTCTCGCG TTTGACGCTA ACGTCGCGCA CGGCGGAGCA GGCGGAAGCC
TGGCAGGCGT ATGCGGAAGG GCGTGACATG GACGTGACGG TAAAACCCGC CGCGCACCCG
GTCGGCACCA CCCTGGAAGT TCTGGATCTC TTTTACAATA CGCCCGCCCG GCGCAAATTC
ATGCGTACCG AAAAAACGGA ATTTAATCAT ATCGATGAGA TCATCCGTCG TATTGCATTG
GCCCGTTTTG ACGTCACGCT TAACCTGTCG CACAACGGCA AATTGGTACG GCAGTATCGC
GCTGTCGCAA AGGACGGGCA AAAAGAGCGC CGGTTAGGCG CCATCTGCGG CACGCCGTTT
CTCGAACAGG CACTGGCGAT CGAGTGGCAG CATGGCGATC TGACCCTGCG CGGCTGGGTC
GCCGATCCGA ATCACACCAC CACGGCGTTA ACGGAGATCC AGTACTGCTA TGTGAATGGC
CGCATGATGC GCGACCGCTT GATCAACCAT GCCATTCGCC AGGCCTGTGA AGATAAGCTG
GGCGCGGACC AACAGCCTGC GTTTGTGTTG TATCTGGAGA TTGACCCGCA TCAGGTGGAT
GTCAATGTTC ATCCCGCCAA GCACGAAGTG CGTTTTCATC AATCCCGGCT GGTGCACGAC
TTCATCTATC AAGGGGTGCT GAGCGTCCTG CAACAGCAGA CGGAAACGAC GCTGCCGCTG
GAGGAGATTG CGCCAGCGCC GCGGCATGTC CCGGAAAACC GTATCGCCGC CGGGCGCAAC
CATTTTGCTG TACCCGCCGA GCCAACTGCG GCGCGCGAGC CCGCGACACC GCGTTATTCC
GGCGGCGCAT CGGGCGGCAA CGGCGGGCGT CAGTCCGCGG GTGGTTGGCC GCACGCTCAG
CCAGGTTATC AGAAGCAGCA GGGCGAGGTT TATCGCGCGC TTTTACAGAC GCCGGCGACG
AGCCCCGCGC CGGAGCCGGT TGCGCCTGCG CTTGACGGAC ATAGCCAGAG TTTTGGTCGC
GTACTGACGA TAGTCGGCGG TGACTGTGCG TTGCTGGAAC ACGCGGGGAC TATCCAGCTC
TTGTCGCTGC CGGTTGCGGA GCGTTGGCTG CGTCAGGCGC AGCTTACACC GGGTCAAAGT
CCGGTTTGCG CGCAGCCGTT GCTGATTCCG CTGCGTTTAA AAGTGAGCGC CGATGAAAAA
GCCGCGCTGC AAAAAGCCCA ATCTTTGTTG GGAGAATTGG GTATTGAATT TCAGTCAGAT
GCGCAGCATG TGACCATTCG GGCAGTGCCT TTACCCTTAC GACAACAAAA TTTACAAATC
TTGATTCCTG AACTGATAGG CTACCTGGCG CAACAGACCA CATTTGCAAC GGTCAATATT
GCACAATGGA TAGCGCGTAA TGTGCAGAGC GAACATCCGC AGTGGTCGAT GGCGCAGGCC
ATATCGCTGC TGGCGGATGT TGAGCGGCTA TGTCCGCAGC TGGTAAAAGC GCCGCCGGGT
GGCCTGTTAC AACCTGTTGA TTTACATTCG GCGATGAACG CCCTGAAGCA TGAATGA
 
Protein sequence
MPIQVLPPQL ANQIAAGEVV ERPASVVKEL VENSLDAGAT RVDIDIERGG AKLIRIRDNG 
CGIKKEELAL ALARHATSKI ASLDDLEAII SLGFRGEALA SISSVSRLTL TSRTAEQAEA
WQAYAEGRDM DVTVKPAAHP VGTTLEVLDL FYNTPARRKF MRTEKTEFNH IDEIIRRIAL
ARFDVTLNLS HNGKLVRQYR AVAKDGQKER RLGAICGTPF LEQALAIEWQ HGDLTLRGWV
ADPNHTTTAL TEIQYCYVNG RMMRDRLINH AIRQACEDKL GADQQPAFVL YLEIDPHQVD
VNVHPAKHEV RFHQSRLVHD FIYQGVLSVL QQQTETTLPL EEIAPAPRHV PENRIAAGRN
HFAVPAEPTA AREPATPRYS GGASGGNGGR QSAGGWPHAQ PGYQKQQGEV YRALLQTPAT
SPAPEPVAPA LDGHSQSFGR VLTIVGGDCA LLEHAGTIQL LSLPVAERWL RQAQLTPGQS
PVCAQPLLIP LRLKVSADEK AALQKAQSLL GELGIEFQSD AQHVTIRAVP LPLRQQNLQI
LIPELIGYLA QQTTFATVNI AQWIARNVQS EHPQWSMAQA ISLLADVERL CPQLVKAPPG
GLLQPVDLHS AMNALKHE