Gene SeHA_C4777 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeHA_C4777 
SymbolmutL 
ID6490107 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Heidelberg str. SL476 
KingdomBacteria 
Replicon accessionNC_011083 
Strand
Start bp4655633 
End bp4657489 
Gene Length1857 bp 
Protein Length618 aa 
Translation table11 
GC content58% 
IMG OID642744829 
ProductDNA mismatch repair protein 
Protein accessionYP_002048402 
Protein GI194450332 
COG category[L] Replication, recombination and repair 
COG ID[COG0323] DNA mismatch repair enzyme (predicted ATPase) 
TIGRFAM ID[TIGR00585] DNA mismatch repair protein MutL 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00171491 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clones54 
Fosmid unclonability p-value0.0289843 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGATTC AGGTTCTGCC GCCGCAGCTT GCGAACCAAA TCGCCGCTGG CGAAGTGGTG 
GAACGCCCTG CGTCGGTTGT TAAAGAGCTG GTAGAGAATA GTCTGGATGC AGGCGCCACC
CGCGTTGATA TCGACATCGA GCGTGGCGGC GCGAAGCTTA TTCGTATTCG CGACAATGGC
TGCGGCATTA AAAAAGAGGA GCTGGCGCTG GCGCTGGCCC GTCATGCCAC CAGTAAAATC
GCCTCGCTTG ACGATCTGGA AGCGATTATC AGTCTGGGAT TTCGCGGCGA AGCGCTGGCC
AGTATCAGTT CGGTCTCGCG TTTGACGCTA ACGTCGCGCA CGGCGGAGCA GGCGGAAGCC
TGGCAGGCGT ATGCGGAAGG GCGCGACATG GACGTGACGG TAAAACCCGC CGCGCACCCG
GTCGGCACCA CCCTGGAAGT TCTGGATCTC TTTTATAATA CGCCCGCCCG GCGCAAATTC
ATGCGTACCG AAAAAACGGA ATTTAATCAC ATCGATGAGA TCATCCGTCG TATTGCATTG
GCCCGTTTTG ACGTCACGCT TAACCTGTCG CACAACGGCA AATTGGTACG GCAGTATCGC
GCTGTCGCAA AGGACGGGCA AAAAGAGCGC CGGTTAGGCG CCATCTGCGG CACGCCGTTT
CTCGAACAGG CACTGGCGAT CGAGTGGCAG CATGGCGATC TGACCCTGCG CGGCTGGGTC
GCCGATCCGA ATCACACCAC CACGGCGTTA ACGGAGATCC AGTACTGCTA TGTGAATGGC
CGCATGATGC GCGACCGCTT GATCAACCAT GCCATTCGCC AGGCCTGTGA AGATAAACTG
GGCGCGGACC AACAGCCTGC GTTTGTGCTG TATCTGGAGA TTGACCCGCA TCAGGTGGAT
GTCAATGTTC ATCCCGCCAA GCACGAAGTG CGTTTTCATC AATCCCGGCT GGTGCACGAC
TTCATCTATC AAGGGGTGCT GAGCGTTCTG CAACAGCAGA CGGAAACGAC GCTGCCGCTG
GAGGACATTG CGCCAGCGCC GCGCCATGTC CCGGAAAACC GTATCGCCGC CGGGCGCAAC
CATTTTGCTG TACCCGCCGA GCCAACTGCG GCGCGCGAGC CCGCGACACC GCGTTATTCC
GGCGGCGCAT CGGGCGGTAA CGGCGGGCGT CAGTCCGCAG GTGGTTGGCC GCACGCGCAG
CCAGGTTATC AGAAGCAGCA GGGCGAGGTT TATCGCGCGC TTTTACAGAC GCCGACGACG
AGCCCCGCGC CGGAGGCGGT TGCGCCTGCG CTTGACGGAC ATAGCCAGAG TTTCGGTCGC
GTACTGACGA TAGTCTGCGG TGACTGCGCG TTGCTGGAAC ACGCGGGGAC TATCCAGCTC
TTGTCGCTGC CGGTTGCGGA GCGTTGGCTG CGTCAGGCGC AGCTTACACC GGGTCAAAGT
CCGGTTTGCG CGCAGCCGTT GCTGATTCCA CTGCGTTTAA AAGTGAGCGC CGATGAAAAA
GCCGCGCTGC AAAAAGCCCA ATCTTTGTTG GGAGAATTGG GTATTGAATT TCAGTCAGAT
GCGCAGCATG TGACCATTCG GGCGGTGCCT TTACCCTTAC GACAACAAAA TTTACAAATC
TTGATTCCTG AACTGATAGG CTACCTGGCG CAACAGACCA CATTTGCAAC GGTCAATATT
GCACAATGGA TAGCGCGTAA TGTGCAGAGT GAACATCCGC AGTGGTCGAT GGCGCAGGCC
ATATCGCTGC TGGCGGATGT TGAGCGGCTA TGTCCGCAGC TGGTAAAAGC GCCGCCGGGT
GGCCTGTTAC AACCTGTTGA TTTACATTCG GCGATGAACG CCCTGAAGCA TGAATGA
 
Protein sequence
MPIQVLPPQL ANQIAAGEVV ERPASVVKEL VENSLDAGAT RVDIDIERGG AKLIRIRDNG 
CGIKKEELAL ALARHATSKI ASLDDLEAII SLGFRGEALA SISSVSRLTL TSRTAEQAEA
WQAYAEGRDM DVTVKPAAHP VGTTLEVLDL FYNTPARRKF MRTEKTEFNH IDEIIRRIAL
ARFDVTLNLS HNGKLVRQYR AVAKDGQKER RLGAICGTPF LEQALAIEWQ HGDLTLRGWV
ADPNHTTTAL TEIQYCYVNG RMMRDRLINH AIRQACEDKL GADQQPAFVL YLEIDPHQVD
VNVHPAKHEV RFHQSRLVHD FIYQGVLSVL QQQTETTLPL EDIAPAPRHV PENRIAAGRN
HFAVPAEPTA AREPATPRYS GGASGGNGGR QSAGGWPHAQ PGYQKQQGEV YRALLQTPTT
SPAPEAVAPA LDGHSQSFGR VLTIVCGDCA LLEHAGTIQL LSLPVAERWL RQAQLTPGQS
PVCAQPLLIP LRLKVSADEK AALQKAQSLL GELGIEFQSD AQHVTIRAVP LPLRQQNLQI
LIPELIGYLA QQTTFATVNI AQWIARNVQS EHPQWSMAQA ISLLADVERL CPQLVKAPPG
GLLQPVDLHS AMNALKHE