Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cag_0145 |
Symbol | |
ID | 3747191 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium chlorochromatii CaD3 |
Kingdom | Bacteria |
Replicon accession | NC_007514 |
Strand | + |
Start bp | 162849 |
End bp | 164783 |
Gene Length | 1935 bp |
Protein Length | 644 aa |
Translation table | 11 |
GC content | 45% |
IMG OID | 637772672 |
Product | DNA mismatch repair protein |
Protein accession | YP_378466 |
Protein GI | 78188128 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0323] DNA mismatch repair enzyme (predicted ATPase) |
TIGRFAM ID | [TIGR00585] DNA mismatch repair protein MutL |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCTATAA TTACCCGATT ACCTGATAGC GTTGCTAATA AAATATCCGC AGGCGAAGTG GTGCAGCGCC CGGCATCGGT AGTTAAAGAG CTGCTTGAAA ATGCTATTGA TGCTGGCGCT ACAAAAATTA GCGTTACTAT TAAAGATGCA GGTAAGGAGC TGATTCGCAT TGCCGATAAT GGCGTTGGTA TGAATCGCGA TGATGCTTTG CTTTGCGTGG AGCGTTTTGC CACCAGTAAA ATTAAAAGTG CAGATGATCT TGATGCGTTG CATACGTTGG GTTTTCGTGG GGAGGCGTTA GCAAGTATTT GTTCGGTCTC TCATTTTGAG CTTAAAACTC GTCAAGCCGA TGCAACGCTT GGCTTGCTGT TTCGCTACGA TGGTGGCTCG TTGGTTGAAG AGTTGGAGGT GCAAGCGGAG CAAGGTACCA GCTTTAGTGT GCGCAATCTT TTTTATAACG TGCCTGCTCG TCGTAAGTTT TTAAAGTCGA ATGCTACCGA GTATCATCAT CTTTTTGAGA TTGTAAAATC TTTTACGCTG GCTTATCCCG AAATTGAGTG GCGTATGGTG AATGATGATG AGGAGCTGTT CAACTTTAAA AACAATGATG TTCTTGAGCG GCTCAATTTT TATTATGGCG ATGATTTTGC AAGCAGCTTA ATTGAGGTTG CTGAGCAAAA CGATTATTTG CCTATTCACG GCTATCTTGG CAAGCCTGCG TTACAAAAAA AGCGCAAGTT GGAGCAATAC TTTTTTATTA ATCGTCGCCT TGTGCAAAAT CGGATGTTGT TGCAAGCGGT GCAGCAGGCG TATGGTGATT TGCTTGTTGA GCGTCAAACA CCGTTTGTGT TGCTTTTTCT GACGATTGAT CCTTCGCGTA TTGATGTGAA TGTGCATCCC GCTAAGCTTG AAATTCGTTT TGATGATGAG CGGCAGGTGC GCTCCATGTT TTATCCCGTT ATAAAGCGAG CGGTGCAGTT GCACGACTTT TCGACCAATA TTTCCGTTAT CGAACCTTTT GCATCGGCTT CTGAACCATT TGTGGGCTCA TCTTCCCAAC CAATATTTTC ATCTACCTCA AGCCAAGCGC CCCGTATGGG TGGGGGAAGT CGTCGTTTTG ATTTGAGTGA TGCGCCTGAG CGTGCAATCA CTAAAAATGA GCTGTATCGC AATTATCGTG AAGGAGCTTT TTCGTCGCCC TCGGTAGCTT CATATGATGC GCCATCTCCA TTGCAACAGG GTGGATTGTT TGCGTTGGCA TCGGCTGAAG AGAGTTTGTT TGGTGCGCAA GCGGTGCATG AGGCAAGCGA AAACATTGAG GCGTTCCAGC TTTCGCCGCT TGACAACATT GTTGAGCATA AAGAGGTTGA GCCAAAAATC TGGCAGTTGC ATAACAAATA TCTTATATGT CAGATTAAAA CGGGGTTAAT GATTATTGAC CAGCATGTGG CGCATGAGCG TGTGCTTTAT GAGCGAGCGT TGGAGGTGAT GCAGCAAAAT GTGCCAAATG CGCAGCAATT GCTTTTTCCG CAAAAAGTGG AGTTTCGTGC TTGGGAGTAT GAAGTGTTTG AGGAGATTCG TGATGACCTT TATCGCCTTG GCTTTAATGT GCGTTTGTTT GGCAACCGCA CGGTGATGAT TGAGGGGGTG CCGCAAGATG TGAAGTCGGG GAGTGAGGTT ACTATTTTGC AGGATATGAT TACGCAATAT CAAGAAAATG CTACCAAGCT GAAGTTGGAG CGGCGCGATA ATTTAGCAAA GTCCTACTCC TGCCGTAATG CCATTATGAC GGGGCAGAAG CTTTCGATGG AGGAGATGCG TTCGTTGATT GATAATCTTT TTGCAACACG AGAGCCTTAC ACCTGCCCAC ACGGACGTCC AATTATCATC AAGTTATCGC TTGATCAGCT TGATAAAATG TTTGGGAGGA AGTAA
|
Protein sequence | MPIITRLPDS VANKISAGEV VQRPASVVKE LLENAIDAGA TKISVTIKDA GKELIRIADN GVGMNRDDAL LCVERFATSK IKSADDLDAL HTLGFRGEAL ASICSVSHFE LKTRQADATL GLLFRYDGGS LVEELEVQAE QGTSFSVRNL FYNVPARRKF LKSNATEYHH LFEIVKSFTL AYPEIEWRMV NDDEELFNFK NNDVLERLNF YYGDDFASSL IEVAEQNDYL PIHGYLGKPA LQKKRKLEQY FFINRRLVQN RMLLQAVQQA YGDLLVERQT PFVLLFLTID PSRIDVNVHP AKLEIRFDDE RQVRSMFYPV IKRAVQLHDF STNISVIEPF ASASEPFVGS SSQPIFSSTS SQAPRMGGGS RRFDLSDAPE RAITKNELYR NYREGAFSSP SVASYDAPSP LQQGGLFALA SAEESLFGAQ AVHEASENIE AFQLSPLDNI VEHKEVEPKI WQLHNKYLIC QIKTGLMIID QHVAHERVLY ERALEVMQQN VPNAQQLLFP QKVEFRAWEY EVFEEIRDDL YRLGFNVRLF GNRTVMIEGV PQDVKSGSEV TILQDMITQY QENATKLKLE RRDNLAKSYS CRNAIMTGQK LSMEEMRSLI DNLFATREPY TCPHGRPIII KLSLDQLDKM FGRK
|
| |