Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TM1040_0102 |
Symbol | mutL |
ID | 4078687 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ruegeria sp. TM1040 |
Kingdom | Bacteria |
Replicon accession | NC_008044 |
Strand | + |
Start bp | 107657 |
End bp | 109591 |
Gene Length | 1935 bp |
Protein Length | 644 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 638005389 |
Product | DNA mismatch repair protein |
Protein accession | YP_612097 |
Protein GI | 99079943 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0323] DNA mismatch repair enzyme (predicted ATPase) |
TIGRFAM ID | [TIGR00585] DNA mismatch repair protein MutL |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCTACAC TCGACCCCCA AATCAGCGAA AAACCACCGA TTGAGGCGCC GGCCAAACCC GCGCGCCCCG TGATCCGGCA ACTGGATGAC GGCGCCATCA ACCGCATTGC GGCTGGTGAG GTGGTCGAGC GTCCGGCCTC GGCCGTCAAG GAACTTGTGG AGAACGCCAT CGACGCCGGT GCGACCCGCA TCACGGTAGA GATCGCCGAT GGTGGTAAGA CGCTGATCCG GGTGATCGAC AATGGCTGCG GGATGACACC AGAGGACCTG CCGTTGGCGC TGTCGCGTCA TGCGACTTCC AAGATTGATG GCTCTGATCT GTTGAACATT CACACCTTTG GCTTTCGGGG CGAGGCGCTG CCGAGCCTTG GCGCGGTGGG GCGGCTGGCG ATCACCAGCC GGGCCGAAGG ACATGACGCA GCCCAGATCC GCGTGTCAGG CGGCCATATG GAGCCTGTGA GGCCCGCGGC GCTCCGGCAG GGTAGCATCG TGGAGCTGCG CGATTTGTTT TTTGCAACAC CTGCGCGGCT CAAGTTCATG CGCACCGACC GGGCGGAAAT GCAGGCGATC TCGGACACGG TCAAGCGGCT GGCGATGGCG GAGCCTTCAG TGGGCTTCAC CTTGCGCGAT GTCTCAGGTG GCGGAGAGGG CCGCGTGACC TTCCGCGCAG ATCCCATGAA CGGCGATCTC TTTGATGCGT TGCACGGTCG GCTGGCGCAT GTCATCGGGC GTGAGTTCGC CGAGAACGCC CTGAAAATCG ATGCCACGCG CGAGGGTATC CGGCTTTATG GCTATGCGGC CCTGCCGACC TATTCGCGCG GTGCGGCAGT GACGCAGTTC CTGTTTGTGA ATGCGCGTCC GGTAAAGGAC AAGATGCTCA CAGGGGCGCT GCGGGCGGCC TATATGGATT TCCTCAGCCG GGATCGCCAC CCGGCGGCGG CCCTCTTTAT CGACTGTGAC CCGACGCTGG TGGACGTGAA CGTACATCCG GCAAAATCCG AGGTGCGCTT CCGTGATCCG GGGCTGGCGC GCGGGCTGAT CGTCTCGGCG CTGCGCCACG CGCTCGCCGA GGCCGGCCAT CGTGCCTCCA GCACCGTGGC GGGCGCGACC CTGGGGGCGA TGCGACCGGA ACAGCCGACT GCGACAGGGG CGCCGAGGGT GTATCAGATG GACCGGCCGT CCTTGGGCGC ACGGCACAGC GCCTATGCGG CGCAGACCCC CGCTCAGCCG CAGGCACCGT CCTATGCTGC GCCGCCTCCC TCTGATGCGG TAGGGTTTGC AGAGTTCTCA GGCACTTACA GTGGCCGTCT GGTCGAAGAG ACACCTCTTG AGGAGGCTCA GCCCCCCGCC GAGGATCAGC CCCTTGGCGC GGCACGCGGG CAGGTGCATG AGAATTATAT CATCGCCCAG ACCCGCGACG GGATGGTGAT CGTCGATCAG CACGCCGCCC ATGAACGGCT CGTTTATGAG CGGCTCAAAC GTCAGTTGGC CGAAAATGGC GTCGCGACCC AAGGTCTGCT GATCCCAGAA ATCATCGAAC TCTCGGATGG GGATTGTGCG CGCCTGCTGG AAGTTGCCGA GGATCTTGCC AGGTTGGGAC TGGGCATTGA GGCCTTTGGC GGCAGCGCCG TGGCTGTGCG AGAAACGCCC GCCATTCTGG GCGAGGTCAA TGCCGAGGCT ATGATCCGCG ACATTCTGGA TGAGCTGGCG GATCAGGGTG AGAGCCAGCT GGTGCAGGCG CGCCTTGAGG CAATCCTGTC GCGGGTGGCC TGTCATGGCT CGATCCGTTC CGGACGGCGC ATGCGCGGCG AGGAAATGAA CGCGCTCCTG CGGGAAATGG AACAGACGCC CCATTCCGGC CAGTGCAATC ACGGCAGACC CACCTATGTG GAGCTCAAAC TCGCGGATAT CGAGCGCCTC TTTGGGCGCA GCTAA
|
Protein sequence | MATLDPQISE KPPIEAPAKP ARPVIRQLDD GAINRIAAGE VVERPASAVK ELVENAIDAG ATRITVEIAD GGKTLIRVID NGCGMTPEDL PLALSRHATS KIDGSDLLNI HTFGFRGEAL PSLGAVGRLA ITSRAEGHDA AQIRVSGGHM EPVRPAALRQ GSIVELRDLF FATPARLKFM RTDRAEMQAI SDTVKRLAMA EPSVGFTLRD VSGGGEGRVT FRADPMNGDL FDALHGRLAH VIGREFAENA LKIDATREGI RLYGYAALPT YSRGAAVTQF LFVNARPVKD KMLTGALRAA YMDFLSRDRH PAAALFIDCD PTLVDVNVHP AKSEVRFRDP GLARGLIVSA LRHALAEAGH RASSTVAGAT LGAMRPEQPT ATGAPRVYQM DRPSLGARHS AYAAQTPAQP QAPSYAAPPP SDAVGFAEFS GTYSGRLVEE TPLEEAQPPA EDQPLGAARG QVHENYIIAQ TRDGMVIVDQ HAAHERLVYE RLKRQLAENG VATQGLLIPE IIELSDGDCA RLLEVAEDLA RLGLGIEAFG GSAVAVRETP AILGEVNAEA MIRDILDELA DQGESQLVQA RLEAILSRVA CHGSIRSGRR MRGEEMNALL REMEQTPHSG QCNHGRPTYV ELKLADIERL FGRS
|
| |