Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cag_0699 |
Symbol | |
ID | 3747473 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium chlorochromatii CaD3 |
Kingdom | Bacteria |
Replicon accession | NC_007514 |
Strand | + |
Start bp | 970440 |
End bp | 972821 |
Gene Length | 2382 bp |
Protein Length | 793 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 637773233 |
Product | DNA mismatch repair protein MutS-like |
Protein accession | YP_379013 |
Protein GI | 78188675 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1193] Mismatch repair ATPase (MutS family) |
TIGRFAM ID | [TIGR01069] MutS2 family protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATCCCT CTACGCTAAA AAAACTTGAA TTTACGAAAA TAGCAGCTTA TGCAGCGCAA TTGTGCCTTT CGCCTATGGG GCGCGACCGT TTGCTCAATG CTCGTCCGTT GCGTGAGCGT GAGGCGCTGA TGGCGGAATT GGAGCGCGTG CTTGAGTTGC GCATGTTGTT GCAAGAGGGG CTAACCTTGC CCTTTTCCCA CCTGCCCGAC ACTCGCGTGC TCTTAAAAAA GCTGGAGATT GAGCACCTTG CGCTTGAACC GCTTGAGTTG CTTGATCTCT ATCATTTGCT CTACTCATCG GTGCAGTTGC GCCGTTTTAT GTATGGTAAT CGTGAGCGTT ATGGTCGCTT GAACGATCTT ACCATTATGC TCTGGATGGA GCGAAGCTTG CAAGCAATGA TTCAACGCTG TGTGGATGAG CGTGGCTTGG TGCGCGATAG TGCCAGCGAT GGGCTGTTGC TGATTCGCCA TGATCTTGCT GAAAGTCGGG AGTTGTTGCG CCGCCGCATG GAGCGTTTGC TGCGCCGTGC AAGTGCAAAT GGTTGGTTGA TGGAGGAGAC GGTTGCGGTA AAAAATGGGC GTTTAACTTT AGCCTTAAAG GTTGAGTACA AGTATAAAAT CCCCGGTTAC ATTCAAGATT ATTCAGGCAC GGGGCAGACG GTTTTTATTG AGCCTGCCGA AACGTTGGAA ACCAGCAACC GCATTCAAGA TTTAGAGATT AGCGAGCGGC GCGAGGTGGA GCGCATTTTG CAAGAGGTGA GTGCGGCGTT GCGCGGCGAG CTTGAAAATA TTCACCACAA TCAACAATTG ATGGCTGAGT TTGATGCGCT TTACGCCCGT GCTCGCTTTG CGGTTGAAAC CAACGCCGTG CTGCCTACTG TTACGGAGGG CAACGAGTTG CGCTTAATAA AAGCTTACCA TCCATGGCTT TTGCTCTCGC ACCGTGAGCG CACGGTGCAG CCGCTTGACC TCCATTTAAG CGCTGAAGAG CAGGTTTTAG TGATTTCGGG ACCCAATGCG GGTGGCAAGT CGGTTACTAT GAAAAGCGTG GGATTGCTCT GCTGTATGCT TGTGCATGGC TACCTTCTCC CCTGTAGCGA AAGCTCCTGC ATTCCTCTCT TTAACAATAT TTTTATTGAA ATTGGCGACG ACCAATCCAT TGAGCACGAC CTTTCCACCT TTAGTTCCCA CCTCAGCGCC ATTCGCTCTA TTCTTGAGCG GGCAGGCACG CGCGATTTAG TTTTAATTGA TGAATTGTGC GGCGGCACCG ACGTTGAAGA GGGCGGAGCA ATTGCACGTG CGGTGATTGA AGAGCTTTTG GCATCAGTGG CAAAAAGCAT TGTAACTACG CACCTTGGCG ACCTTAAAGC CTATGCACAC CAGCGCGACG GGGTGGTGAA TGGCGCTATG GCGTTTGACC GTGCTGAGCT GCAACCAACC TTCCGTTTTA TTAAAGGATT GCCCGGCAAC AGCTTTGCCT TTGCCATGAT GCAACGCATG GGCTTTTCGC CCGCTTTGGT GGAGCGAGCA CGCCACTTTA TGGCGCACGA ACGCATTGGC TTAGAGCAAA TGGTGGACGA TTTGAGCCAT ATTATGGAGG AGCAACAACG CCAACGCCAG CAGCTTGACG ACGAGCAACG CACCTTTGCA GAGCGCGAAC GCACGGTGCT GGAGGTTGAA GCGACCCTAA AGCAGCAACA ACGCGAGTTA AAACAACAAA TTTCACGCGC CGTGCAAAAA GAGGTGGAAC ATGCCCGCAA AGAGATTCGC GCCATTGTGC AAGAGGTAAA AGCGGCGCCG ACCAATCCGC AAGTGGTTCA AGCTGCTCGC GAAAAGCTTG GCATCAAGCG TCAAGAGGTT GAAGAGCGCC ATACCACCGC TGCACCCACA ACCGCAAGCG AGCCAACCAT TGATCGCACC ATCACCATTG GCGACATGGT GCGCTTGCTT GACACCAACG CCACGGGCGA AGTTGAACGC TTTAACGGTG ATAACGTGGT GGTACGCTGC GGAACCATTC GTTTGCAGAC GCATCTAAAG AATTTGGAAA AAAGCTCCAA AACCAAAGCA CGCACCGCAC AGCGCGACAC CTCGAATAGC AAGGTACGCT CATGGTCAAC CGTTACAAAC GAGGTCAGCT CAACGCAGCT TGATGTACGA GGCATGAGCG GCAACGAAGC CGTCCCCCAT ATTGAGCGCT TTCTTGATAC ATTGCGCCTG CACCGCATTC ACTTTGCCAC CATTTTGCAC GGCAAAGGCA CAGGCTCACT CCGCAAACGC ACTGCCGAAT GCTTAAAATT GCACACTGCC GTTAAAAGCT TTCGTTTAGG GGGATTAGGG GAAGGTGGGG ATGGGGTTAC GATTGTGGAG TTGGGGGAGT GA
|
Protein sequence | MNPSTLKKLE FTKIAAYAAQ LCLSPMGRDR LLNARPLRER EALMAELERV LELRMLLQEG LTLPFSHLPD TRVLLKKLEI EHLALEPLEL LDLYHLLYSS VQLRRFMYGN RERYGRLNDL TIMLWMERSL QAMIQRCVDE RGLVRDSASD GLLLIRHDLA ESRELLRRRM ERLLRRASAN GWLMEETVAV KNGRLTLALK VEYKYKIPGY IQDYSGTGQT VFIEPAETLE TSNRIQDLEI SERREVERIL QEVSAALRGE LENIHHNQQL MAEFDALYAR ARFAVETNAV LPTVTEGNEL RLIKAYHPWL LLSHRERTVQ PLDLHLSAEE QVLVISGPNA GGKSVTMKSV GLLCCMLVHG YLLPCSESSC IPLFNNIFIE IGDDQSIEHD LSTFSSHLSA IRSILERAGT RDLVLIDELC GGTDVEEGGA IARAVIEELL ASVAKSIVTT HLGDLKAYAH QRDGVVNGAM AFDRAELQPT FRFIKGLPGN SFAFAMMQRM GFSPALVERA RHFMAHERIG LEQMVDDLSH IMEEQQRQRQ QLDDEQRTFA ERERTVLEVE ATLKQQQREL KQQISRAVQK EVEHARKEIR AIVQEVKAAP TNPQVVQAAR EKLGIKRQEV EERHTTAAPT TASEPTIDRT ITIGDMVRLL DTNATGEVER FNGDNVVVRC GTIRLQTHLK NLEKSSKTKA RTAQRDTSNS KVRSWSTVTN EVSSTQLDVR GMSGNEAVPH IERFLDTLRL HRIHFATILH GKGTGSLRKR TAECLKLHTA VKSFRLGGLG EGGDGVTIVE LGE
|
| |