Gene Smed_0428 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_0428 
SymbolmutL 
ID5321262 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009636 
Strand
Start bp461286 
End bp463103 
Gene Length1818 bp 
Protein Length605 aa 
Translation table11 
GC content66% 
IMG OID640789363 
ProductDNA mismatch repair protein 
Protein accessionYP_001326120 
Protein GI150395653 
COG category[L] Replication, recombination and repair 
COG ID[COG0323] DNA mismatch repair enzyme (predicted ATPase) 
TIGRFAM ID[TIGR00585] DNA mismatch repair protein MutL 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.329312 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCATAA AGCAGCTTTC CGAAACGCTC ATCAATCAGA TTGCCGCCGG CGAAGTCATC 
GAACGGCCGG CAAGCGCCGC GAAGGAACTG ATCGAGAATG CGCTCGACGC CGGCGCGACG
AGGATCGAGA TCGCGACGGC GGGGGGCGGC AAGACTCTGC TCAGGGTCAC CGATAACGGT
CTTGGAATGT CGCCGGCCGA TCTCGAACTG GCGATCCGCC GCCATTGCAC CTCCAAGCTC
GACGGCAGCC TCGCCGACAT CCGCACGCTT GGTTTTCGCG GCGAGGCGCT GCCCTCGATC
GGCTCGGTAG CGCGACTTTC GATCACCACG AGGACGGCAG AGGCGCGAGA AGGGGCAACG
ATCACGATCA CCGGCGGCAG AAGCGACCCT GTCCGCCCCT CGGCGGCCGT TGTCGGTACC
GTGGTCGAGG TGCGCGAGCT CTTCTTCGCG ACACCTGCAC GGCTGAAGTT CATGAAGTCT
GAAAGGGCCG AGACGGCCGC GATCTCGGAA GTCGTCCGGC GCATGGCCAT CGCCTTTCCG
AAAGTGCGCT TCGTGCTTTC GGGCTCCGAT CGCTCGGCAC TCGAGTTCCC GGCAACCGGA
GACGATCGGC TGGCGCGGAT GGCGCAGGTG CTCGGCAGGG ACTTTCGCGA CAATGCCATC
GAGATCGATG CCGAGCGCGA GGGTGCGCGG CTTACCGGGT TCGCCGGCGT GCCGACTTTC
AATCGCGGCA ACTCGCTTCA GCAATATGCC TTCGTCAACG GACGCCCGGT GCAGGACAAG
CTGATCATGT CCGCCATACG CGCCGCCTAT GCCGAAACCG TACCGCAGGG GCGCTATCCT
GTCGCGGTGC TGTCGCTTAC GATCGATCCG GCCCTTGTCG ACGTCAACGT TCATCCGGCG
AAATCGGATG TGCGCTTCCG CGACCCCGGC CTGATCCGCG GCCTCATCAT AGGCGCAATC
CGCGAGGCAC TGATGCGTGA GGGAGACCGG GCCGCGACGA CCGGGGCGCA AGGTTTGATG
CGCGCCTTCC GCCCTGAATT CCACAGGGGC GACCAGCAAC GGCCGCAGGA ACCCTGGACG
GCCGCGACCT CCCCCTATCG ACCGTTCAGC CCTGGCGGGG CGGCCCGCGG TTTTGCCGAG
ACGCCTCAGG CGGCGTTCTC GGATTTCGCA CAGCCGTCTG CGCGCAACGC TGCCGTCCCG
GTCGATAGCA TACAGGCCGC CGATGGACAG GCGGCCTCCT TCCCGCTTGG TGCTGCCCGC
GCTCAGCTTC ACGAAAACTA TATCGTCGCA CAGACCGACG ACGGGCTCGT GATCGTCGAC
CAGCACGCCG CGCATGAGCG TCTCGTCTTC GAAACGATGC GAACGGCGCT CCACGCCCGC
CCCGTACCGG CGCAGGCGCT GCTGATCCCC GAGATCGTCG ACCTGCCGGA GGAGGATTGC
GACAGGCTGG TTGCGCATGC CGGCGAATTC ACGCGGCTGG GTCTCGCCAT CGAACGCTTC
GGCCCCGCAG CGATCGCGGT CCGCGAAACA CCGGCGATGC TCGGCGAGAT GGATGCCGCC
GGACTGGTGC GGCAGCTTGC CGACGAGCTT GCGGAATGGG ACACAGCCGA CGGCCTTGCC
GGGCGGCTCG AATATCTGGC CGCCACCATG GCCTGTCACG GTTCCGTGCG CTCGGGCCGC
CGCTTGCGCA CCGAGGAAAT GAACGCTCTG CTTCGTCGGA TGGAGGCCAC GCCCGGCTCC
GGGCAATGCA ACCACGGCCG GCCGACCTAT ATCGAACTCA AGCTCGCCGA CATCGAACGG
CTCTTCGGGC GGAGCTGA
 
Protein sequence
MAIKQLSETL INQIAAGEVI ERPASAAKEL IENALDAGAT RIEIATAGGG KTLLRVTDNG 
LGMSPADLEL AIRRHCTSKL DGSLADIRTL GFRGEALPSI GSVARLSITT RTAEAREGAT
ITITGGRSDP VRPSAAVVGT VVEVRELFFA TPARLKFMKS ERAETAAISE VVRRMAIAFP
KVRFVLSGSD RSALEFPATG DDRLARMAQV LGRDFRDNAI EIDAEREGAR LTGFAGVPTF
NRGNSLQQYA FVNGRPVQDK LIMSAIRAAY AETVPQGRYP VAVLSLTIDP ALVDVNVHPA
KSDVRFRDPG LIRGLIIGAI REALMREGDR AATTGAQGLM RAFRPEFHRG DQQRPQEPWT
AATSPYRPFS PGGAARGFAE TPQAAFSDFA QPSARNAAVP VDSIQAADGQ AASFPLGAAR
AQLHENYIVA QTDDGLVIVD QHAAHERLVF ETMRTALHAR PVPAQALLIP EIVDLPEEDC
DRLVAHAGEF TRLGLAIERF GPAAIAVRET PAMLGEMDAA GLVRQLADEL AEWDTADGLA
GRLEYLAATM ACHGSVRSGR RLRTEEMNAL LRRMEATPGS GQCNHGRPTY IELKLADIER
LFGRS