Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cpha266_0668 |
Symbol | |
ID | 4569822 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium phaeobacteroides DSM 266 |
Kingdom | Bacteria |
Replicon accession | NC_008639 |
Strand | + |
Start bp | 761185 |
End bp | 763569 |
Gene Length | 2385 bp |
Protein Length | 794 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 639765266 |
Product | Smr protein/MutS2 |
Protein accession | YP_911147 |
Protein GI | 119356503 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1193] Mismatch repair ATPase (MutS family) |
TIGRFAM ID | [TIGR01069] MutS2 family protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.529959 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATCCCG TCAGTCTGAA AAAACTTGAA TTCGATAAAG TAGCCAATTA TGCCGCGCAG TTCTGCCTTT CGGCGATGGG GCGCGACAGG CTTCTTGCTG CGGAACCGGA GGTGGGGCGC CGGGAGCTTG TGGCGGAACT TGAACGGGTG CTTGAGTTGC GCAATATGCT TCAGGAGGGG AGTGCGCTTC CTTTTTCATG GTTGCCGGAT ACACGGCCTC TTCTGAAAAA GCTTGAAATC CTTGAGAGTT ATCTTGAGCC GGAGGAGTTG CAGGATATTT ATCATCTCCT TTTTTCATCG GTTCAGTTGC GCAAGTTCAT GTTTTTTAAC CGCGAGGTCT ATCCTCTGCT GAATGAGTTT ACCATCAGGC TCTGGCTTGA AAAGAGCCTT CAGACCTCGA TTCGTCGTAT TATCGATGAG CAGTCGAGGG TGCGCGATAC GGCAAGCGAG GAGCTTCTGT TGATCCGGCG TGAGCTCGGC GGCAGCCGTG AGCTGATTCG AAGGAAGATG GAGCGGCTAC AGAGGCGTTG TCAGGAGAGC GGATGGCTGA TGGAGGATAC GATAGCGATC AAGAACGGGC GCCTGACGCT TGGTCTTCGG GTGGAGTACA AATACAAAAT TGCCGGCTAT ATACAGGATT ACTCCGGTAG CGGACAGACG GTTTTTATCG AGCCTGCCGA AACGCTTGAG ATCAGTAACC GCATTCAGGA TCTGGAGATC AGCGAGCGAA GGGAGATCGA GCGAATTCTG AAGGAGATGT CGGGAGCGTT GCGCCTTGAA CTTGAAAATC TGAGGTATAA CGAGATCATT CTTGGTGATT TTGATTCGCT CTACGCGCGG GCACGCTTTG CCGTTGAAAC GAACTCGGTG CTTCCGGGTA TTGCCGATGG ACAGTCCTTG CGAATTATCA GAGGGTTTCA TCCCTGGCTC CTGATTTCGC ATCATCATAA AGAGGTTATG CCTCTCGATC TTGATCTGGA TGAAACTGAC CGGGTGCTCG TAATTTCCGG TCCCAATGCG GGCGGTAAAT CGGTGGCGAT GAAGACCGCC GGTCTGCTCT GCTGCATGCT GGTGCATGGT TACCTGCTGC CTTGCAGCGA GAGTTCCGTG TTCCCTCTTT TCAGTGATAT TTTTATCGAG ATCGGCGACG ATCAGTCTAT TGAAAATGAT CTCTCCACCT TCAGCTCCCA TCTTGGCGCG ATCAGAACCA TCCTTGACGT TGCAGGGAGC GGCGATCTGG TGCTGATTGA CGAGCTTTGC GCCGGCACGG ATGTTGAGGA GGGCGGGGCC ATTGCTCGAG CAGTGATGGA GGAACTGCTC AATCGCGGGA CAAAAACCAT TGTTACCACT CATCTCGGCG ACCTGAAGGC CTATGCTCAT GAGCGTGAGG GAGTGCTTAA CGGCGCCATG GAGTTTGACC GGGCTGGTCT GGTGCCGACT TTCCGTTTTG TCAAGGGATT GCCGGGTAAC AGTTTTGCCT TTGCGATGAT GAAGCGGATG GGTTTTCCTG TGAAAATGGT TGAGCGGGCT TCGGAATTTA TGATGGATGA GCGTATCGGG CTTGACCGGA TGCTTGATGA CTTGAGTCGT CTCTTTGAAG AGAATCGTCT GCTGAAGCAG CAGCTTGAGG GTGAACGGGC TGATCTTGCT GAACGGGTTA TTGCTCTTCG CGCCGAGGAG GCCGGTGTTG AACGGAAGCA GAGAGAACTG AGACTTGGTG CTGCAAGAGA GTTGCAGAAA GAGGTAGAAC ATGCACGAAA AGAGATCAGG GAGATTGTTC AGGAGGTGAG GAACGCTCCA GCTGATGCAA AAACTGTACA GGATTCGAGA AAAAAACTTG GTCTGAAAAA GCAGGAAGCT GAAAAGAGTG AATCAGTTCT GGATGCTGAA GCTGAGAGTG CAGTTCATCT TGATCGTTCC ATCCGAGAGG GTGATCTGGT CAGGATTCTT GACAGCACGG CCTCAGGCGA GGTCGAGAGC GTCAATGGAG AGAGTGTTGT GGTGCAATGT GGTCATTTCA GGTTGACCAC GTCGCTTAAA AACCTTGAGA AAACTTCGAA AACGCAGGTT AAAAAAAATC TCAGAGAGCC TCTGCTCCGG CAGCAAAAGG GCTCCTGGTC AGCAATCACC TCTGAGGTGG ATTCGACAAA ACTTGACTTG CGGGGGCTCA GTGGTGATGA GGCGATCATG AAAATCGACA GGTTTATCGA TACCATGCGT CTTAATCGTA TTCATTCAGC GATGATTCTT CACGGCAAGG GAACCGGATC GCTGCGGCAG CGAACGGCGG AATTTCTCCA GCAGCATGGC TCGGTCAAAA GTTTTCGACT GGGAGAGTGG GGCGAGGGAG GAGCAGGCGT GACCATCGTC GAGATTGAAT CGTGA
|
Protein sequence | MNPVSLKKLE FDKVANYAAQ FCLSAMGRDR LLAAEPEVGR RELVAELERV LELRNMLQEG SALPFSWLPD TRPLLKKLEI LESYLEPEEL QDIYHLLFSS VQLRKFMFFN REVYPLLNEF TIRLWLEKSL QTSIRRIIDE QSRVRDTASE ELLLIRRELG GSRELIRRKM ERLQRRCQES GWLMEDTIAI KNGRLTLGLR VEYKYKIAGY IQDYSGSGQT VFIEPAETLE ISNRIQDLEI SERREIERIL KEMSGALRLE LENLRYNEII LGDFDSLYAR ARFAVETNSV LPGIADGQSL RIIRGFHPWL LISHHHKEVM PLDLDLDETD RVLVISGPNA GGKSVAMKTA GLLCCMLVHG YLLPCSESSV FPLFSDIFIE IGDDQSIEND LSTFSSHLGA IRTILDVAGS GDLVLIDELC AGTDVEEGGA IARAVMEELL NRGTKTIVTT HLGDLKAYAH EREGVLNGAM EFDRAGLVPT FRFVKGLPGN SFAFAMMKRM GFPVKMVERA SEFMMDERIG LDRMLDDLSR LFEENRLLKQ QLEGERADLA ERVIALRAEE AGVERKQREL RLGAARELQK EVEHARKEIR EIVQEVRNAP ADAKTVQDSR KKLGLKKQEA EKSESVLDAE AESAVHLDRS IREGDLVRIL DSTASGEVES VNGESVVVQC GHFRLTTSLK NLEKTSKTQV KKNLREPLLR QQKGSWSAIT SEVDSTKLDL RGLSGDEAIM KIDRFIDTMR LNRIHSAMIL HGKGTGSLRQ RTAEFLQQHG SVKSFRLGEW GEGGAGVTIV EIES
|
| |