Gene SNSL254_A4720 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSNSL254_A4720 
SymbolmutL 
ID6484080 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Newport str. SL254 
KingdomBacteria 
Replicon accessionNC_011080 
Strand
Start bp4600870 
End bp4602726 
Gene Length1857 bp 
Protein Length618 aa 
Translation table11 
GC content58% 
IMG OID642739936 
ProductDNA mismatch repair protein 
Protein accessionYP_002043614 
Protein GI194445394 
COG category[L] Replication, recombination and repair 
COG ID[COG0323] DNA mismatch repair enzyme (predicted ATPase) 
TIGRFAM ID[TIGR00585] DNA mismatch repair protein MutL 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones84 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGATTC AGGTTCTGCC GCCGCAGCTT GCGAACCAAA TCGCCGCTGG CGAAGTGGTG 
GAACGCCCTG CGTCGGTTGT TAAAGAGCTG GTAGAGAATA GTCTGGATGC AGGCGCCACC
CGCGTTGATA TCGACATCGA GCGTGGCGGC GCGAAGCTTA TTCGTATTCG CGACAATGGC
TGCGGCATTA AAAAAGAGGA GCTGGCGCTG GCGCTGGCCC GTCATGCCAC CAGTAAAATC
GCCTCGCTTG ACGATCTGGA AGCGATTATC AGTCTGGGAT TTCGCGGCGA AGCGCTGGCC
AGTATCAGTT CGGTCTCGCG TTTGACGCTA ACGTCGCGCA CGGCGGAGCA GGCGGAAGCC
TGGCAGGCGT ATGCGGAAGG GCGCGACATG GACGTGACGG TAAAACCCGC CGCGCACCCG
GTCGGCACCA CCCTGGAAGT TCTGGATCTC TTTTATAATA CGCCCGCCCG GCGCAAATTC
ATGCGTACCG AAAAAACGGA ATTTAATCAC ATCGATGAGA TCATCCGTCG TATTGCATTG
GCCCGTTTTG ACGTCACTTT TAACCTGTCG CACAACGGCA AATTGGTACG GCAGTATCGC
GCTGTCGCAA AGGACGGGCA AAAAGAGCGC CGGTTAGGCG CCATCTGCGG CACGCCGTTT
CTCGAACAGG CACTGGCGAT CGAGTGGCAG CATGGCGATC TGACCCTGCG CGGCTGGGTC
GCCGATCCGA ATCACACCAC CACGGCGTTA ACGGAGATCC AGTACTGTTA TGTGAATGGC
CGCATGATGC GCGACCGCTT GATCAACCAT GCCATTCGCC AGGCCTGTGA AGATAAACTG
GGCGCGGACC AACAGCCTGC GTTTGTGTTG TATCTGGAGA TTGACCCGCA TCAGGTGGAT
GTCAATGTTC ATCCCGCCAA GCACGAAGTG CGTTTTCATC AATCGCGGCT GGTGCACGAC
TTCATCTATC AAGGGGTGCT GAGCGTTCTA CAACAGCAGA CGGAAACGAC GCTGCCGCTG
GAGGAGATTG CGCCAGCGCC GCGGCATGTC CCGGAAAACC GTATCGCCGC CGGGCGCAAC
CATTTTGCTG TACCCGCCGA GCCAACTGCG GCGCGCGAGC CCGCGACACC GCGTTATTCC
GGCGGCGCAT CGGGCGGTAA CGGCGGGCGT CAGTCCGCAG GTGGTTGGCC GCACGCGCAG
CCAGGTTATC AGAAGCAGCA GGGCGAGGTT TATCGCGCGC TTTTACAGAC GCCGACGACG
AGCCCCGCGC CGGAGCCGGT TGCGCCTGCG CTTGACGGAC ATAGCCAGAG TTTCGGTCGC
GTACTGACAA TAGTCGGCGG TGACTGCGCG TTGCTGGAAC ACGCGGGGAC TATCCAGCTC
TTGTCGCTGC CGGTTGCGGA GCGTTGGCTG CGTCAGGCGC AGCTTACACC GGGTCAAAGC
CCGGTTTGCG CGCAGCCGTT GCTGATTCCG CTGCGTTTAA AAGTGAGCGC CGATGAAAAA
GACGCGCTGC AACAAGCCCA ATCTTTATTG GGAGAATTGG GTATTGAATT TCAGTCAGAT
GCGCAGCATG TGACCATTCG GGCGGTGCCT TTACCCTTAC GACAACAAAA TTTACAAATC
TTGATTCCTG AACTGATAGG CTACCTGGCG CAACAGACCA CATTTGCAAC GGTCAATATT
GCACAATGGA TAGCGCGTAA TGTGCAGAGC GAACATCCGC AGTGGTCGAT GGCGCAGGCC
ATATCGCTGC TGGCGGATGT TGAGCGGCTA TGTCCGCAGC TGGTAAAAGC GCCGCCGGGT
GGCCTGTTAC AACCTGTTGA TTTACATTCG GCGATGAACG CCCTGAAGCA TGAATGA
 
Protein sequence
MPIQVLPPQL ANQIAAGEVV ERPASVVKEL VENSLDAGAT RVDIDIERGG AKLIRIRDNG 
CGIKKEELAL ALARHATSKI ASLDDLEAII SLGFRGEALA SISSVSRLTL TSRTAEQAEA
WQAYAEGRDM DVTVKPAAHP VGTTLEVLDL FYNTPARRKF MRTEKTEFNH IDEIIRRIAL
ARFDVTFNLS HNGKLVRQYR AVAKDGQKER RLGAICGTPF LEQALAIEWQ HGDLTLRGWV
ADPNHTTTAL TEIQYCYVNG RMMRDRLINH AIRQACEDKL GADQQPAFVL YLEIDPHQVD
VNVHPAKHEV RFHQSRLVHD FIYQGVLSVL QQQTETTLPL EEIAPAPRHV PENRIAAGRN
HFAVPAEPTA AREPATPRYS GGASGGNGGR QSAGGWPHAQ PGYQKQQGEV YRALLQTPTT
SPAPEPVAPA LDGHSQSFGR VLTIVGGDCA LLEHAGTIQL LSLPVAERWL RQAQLTPGQS
PVCAQPLLIP LRLKVSADEK DALQQAQSLL GELGIEFQSD AQHVTIRAVP LPLRQQNLQI
LIPELIGYLA QQTTFATVNI AQWIARNVQS EHPQWSMAQA ISLLADVERL CPQLVKAPPG
GLLQPVDLHS AMNALKHE