Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rsph17029_2758 |
Symbol | mutL |
ID | 4898056 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodobacter sphaeroides ATCC 17029 |
Kingdom | Bacteria |
Replicon accession | NC_009049 |
Strand | - |
Start bp | 2899154 |
End bp | 2901004 |
Gene Length | 1851 bp |
Protein Length | 616 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 640113360 |
Product | DNA mismatch repair protein |
Protein accession | YP_001044632 |
Protein GI | 126463518 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0323] DNA mismatch repair enzyme (predicted ATPase) |
TIGRFAM ID | [TIGR00585] DNA mismatch repair protein MutL |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGCTCC TCAGCCCCAA GATAGGCGCC GCCCGCCCGG TGATCCGGCA GCTCGACGAA GCCGCCATCA ACCGCATCGC CGCGGGCGAG GTGGTCGAGC GGCCGGCCTC GGCGGTGAAG GAACTGGTCG AGAATGCGCT CGACGCGGGC GCCCGGCGCA TCGCCGTGGA CATCGCCTGC GGCGGCAAGA CCCTGATCCG CGTCACCGAT GACGGCTGCG GCATGACGGC CGAGGACCTG CCGCTCGCGC TCTCGCGCCA TGCCACCTCG AAGATCGACG GGTCCGACCT TCTCGACATC CGCAGCTTCG GCTTCCGCGG CGAGGCGCTG CCTTCGCTGG CGGCCGTGGG GCGGCTCACC ATCACCTCGC GGGTGACCGA GGGAGAGGGC GCGCAGATCG CCGTCAGCGC GGGCCGGATC GAGCCGGTGC GTCCGGCGGC GCTCGGGGCG GGCACGGTGG TCGAACTGCG CGACCTCTTC TTCGCCACGC CCGCGCGGCT CAAGTTCCTG CGCACCGACC GCGCCGAGAC CCAGGCCATC GCCGAGGTGG TGCGCCGTCT GGCGCTGGCC GAGCCCGAGG TGGGCTTCAC CCTCACCTAC CATTCTGCGG GCGAGCCGCG GCTTCTGTTT CGCGCCGAGG CCGAAGGGGG CGATCTGTTC GACGCCCTCC ACCGCCGCGT GGCGCGGGTG GTGGGGGCGG AGTTTGCCGA GAATGCGCTC CGGATCGATG TGGCGCGCGA AGGGCTGCGC CTGCAGGGCT ATGCCGCGCT GCCGACCTAT TCCCGCGGCT CGGGCGTGGC GCAGTTTCTG TTCGTGAACG GCCGCCCGGT GCTCGACCGG ATGCTGCTCG GCGCGCTCCG GGCAGGCTAC ATGGATGTGC TGAGCCGCGA CCGCTATCCG GCGGCGGTGC TGAACCTGAT CTGCGATCCG CAGCGGGTCG ACGTGAACGT GCATCCGGCC AAGGCCGAGG TGCGCTTCCG CGAGGCGGGC GAGGTGCGCG GGCTGATCGT CACCGCGCTG CGTCAGGCGC TGGCGGGGGC GGGGCACCGG GCCTCGACCA CCGTCGCGGG CGAGACGCTC GAGGCCTTCC GGCCCGAGAT GCCTGCGGCG GCGAGCCCGG CCCCCGCGAC GCGGATCTAT CAGATGGACC GGCCCTCGGC CGCCGCGGTG GCGCGCAGCT TCGCCTTTCA GGCGCCCGAG CCCGCGATGC CCGGTCTGGC CGAGGCGCCC GCCGCGAGGG TCGAAGCGCC GGTGGCCGAG GAGGCCCACG ACCGCCCTCT CGGCGCCGCG CGGGCGCAGA TCCACGGCAA CTGGATCCTT GCCCAGACGG CGAGCGGCCT CGTGATCGTG GACCAGCATG CCGCACACGA ACGGCTGGTC TACGAGAAGC TCAAGCGCCA GCGCGACGAG ACGGGGATCG CCCGGCAGGC GCTGCTGATC CCCGAAATCG TCGAACTGTC GCCCACCGAT GCCGCCCGGC TGCTGGAGGC CGCGGACGAG CTGGCCTCCG CGGGCCTCGT GATCGAGCCG TTCGGCGGCG GTGCCGTCGC GGTGCGTGAG GTGCCGGCGA TCCTCGGCAA GGTCGAGGCC GCGCCGCTCC TGCGCGACAT CCTCGACGAT CTGGCCGATC TCGGCAGCTC GGACCGGCTT CAGGCCCGGA TGGATGCGGT CCTCTCGCGC ATGGCCTGCC ACGGATCGGT GCGCTCGGGC CGGGCGCTGA GGGCCGAAGA GATGAACGCG CTTCTGCGCG AGATGGAGGC CACGCCGCTC TCGGGCCAAT GCAACCACGG CCGGCCCACC TATGTCGAGC TGAAGCTGGC CGACATCGAG CGGCTCTTCG GCCGGCGATG A
|
Protein sequence | MTLLSPKIGA ARPVIRQLDE AAINRIAAGE VVERPASAVK ELVENALDAG ARRIAVDIAC GGKTLIRVTD DGCGMTAEDL PLALSRHATS KIDGSDLLDI RSFGFRGEAL PSLAAVGRLT ITSRVTEGEG AQIAVSAGRI EPVRPAALGA GTVVELRDLF FATPARLKFL RTDRAETQAI AEVVRRLALA EPEVGFTLTY HSAGEPRLLF RAEAEGGDLF DALHRRVARV VGAEFAENAL RIDVAREGLR LQGYAALPTY SRGSGVAQFL FVNGRPVLDR MLLGALRAGY MDVLSRDRYP AAVLNLICDP QRVDVNVHPA KAEVRFREAG EVRGLIVTAL RQALAGAGHR ASTTVAGETL EAFRPEMPAA ASPAPATRIY QMDRPSAAAV ARSFAFQAPE PAMPGLAEAP AARVEAPVAE EAHDRPLGAA RAQIHGNWIL AQTASGLVIV DQHAAHERLV YEKLKRQRDE TGIARQALLI PEIVELSPTD AARLLEAADE LASAGLVIEP FGGGAVAVRE VPAILGKVEA APLLRDILDD LADLGSSDRL QARMDAVLSR MACHGSVRSG RALRAEEMNA LLREMEATPL SGQCNHGRPT YVELKLADIE RLFGRR
|
| |