Gene Cpha266_0668 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCpha266_0668 
Symbol 
ID4569822 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium phaeobacteroides DSM 266 
KingdomBacteria 
Replicon accessionNC_008639 
Strand
Start bp761185 
End bp763569 
Gene Length2385 bp 
Protein Length794 aa 
Translation table11 
GC content52% 
IMG OID639765266 
ProductSmr protein/MutS2 
Protein accessionYP_911147 
Protein GI119356503 
COG category[L] Replication, recombination and repair 
COG ID[COG1193] Mismatch repair ATPase (MutS family) 
TIGRFAM ID[TIGR01069] MutS2 family protein 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.529959 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATCCCG TCAGTCTGAA AAAACTTGAA TTCGATAAAG TAGCCAATTA TGCCGCGCAG 
TTCTGCCTTT CGGCGATGGG GCGCGACAGG CTTCTTGCTG CGGAACCGGA GGTGGGGCGC
CGGGAGCTTG TGGCGGAACT TGAACGGGTG CTTGAGTTGC GCAATATGCT TCAGGAGGGG
AGTGCGCTTC CTTTTTCATG GTTGCCGGAT ACACGGCCTC TTCTGAAAAA GCTTGAAATC
CTTGAGAGTT ATCTTGAGCC GGAGGAGTTG CAGGATATTT ATCATCTCCT TTTTTCATCG
GTTCAGTTGC GCAAGTTCAT GTTTTTTAAC CGCGAGGTCT ATCCTCTGCT GAATGAGTTT
ACCATCAGGC TCTGGCTTGA AAAGAGCCTT CAGACCTCGA TTCGTCGTAT TATCGATGAG
CAGTCGAGGG TGCGCGATAC GGCAAGCGAG GAGCTTCTGT TGATCCGGCG TGAGCTCGGC
GGCAGCCGTG AGCTGATTCG AAGGAAGATG GAGCGGCTAC AGAGGCGTTG TCAGGAGAGC
GGATGGCTGA TGGAGGATAC GATAGCGATC AAGAACGGGC GCCTGACGCT TGGTCTTCGG
GTGGAGTACA AATACAAAAT TGCCGGCTAT ATACAGGATT ACTCCGGTAG CGGACAGACG
GTTTTTATCG AGCCTGCCGA AACGCTTGAG ATCAGTAACC GCATTCAGGA TCTGGAGATC
AGCGAGCGAA GGGAGATCGA GCGAATTCTG AAGGAGATGT CGGGAGCGTT GCGCCTTGAA
CTTGAAAATC TGAGGTATAA CGAGATCATT CTTGGTGATT TTGATTCGCT CTACGCGCGG
GCACGCTTTG CCGTTGAAAC GAACTCGGTG CTTCCGGGTA TTGCCGATGG ACAGTCCTTG
CGAATTATCA GAGGGTTTCA TCCCTGGCTC CTGATTTCGC ATCATCATAA AGAGGTTATG
CCTCTCGATC TTGATCTGGA TGAAACTGAC CGGGTGCTCG TAATTTCCGG TCCCAATGCG
GGCGGTAAAT CGGTGGCGAT GAAGACCGCC GGTCTGCTCT GCTGCATGCT GGTGCATGGT
TACCTGCTGC CTTGCAGCGA GAGTTCCGTG TTCCCTCTTT TCAGTGATAT TTTTATCGAG
ATCGGCGACG ATCAGTCTAT TGAAAATGAT CTCTCCACCT TCAGCTCCCA TCTTGGCGCG
ATCAGAACCA TCCTTGACGT TGCAGGGAGC GGCGATCTGG TGCTGATTGA CGAGCTTTGC
GCCGGCACGG ATGTTGAGGA GGGCGGGGCC ATTGCTCGAG CAGTGATGGA GGAACTGCTC
AATCGCGGGA CAAAAACCAT TGTTACCACT CATCTCGGCG ACCTGAAGGC CTATGCTCAT
GAGCGTGAGG GAGTGCTTAA CGGCGCCATG GAGTTTGACC GGGCTGGTCT GGTGCCGACT
TTCCGTTTTG TCAAGGGATT GCCGGGTAAC AGTTTTGCCT TTGCGATGAT GAAGCGGATG
GGTTTTCCTG TGAAAATGGT TGAGCGGGCT TCGGAATTTA TGATGGATGA GCGTATCGGG
CTTGACCGGA TGCTTGATGA CTTGAGTCGT CTCTTTGAAG AGAATCGTCT GCTGAAGCAG
CAGCTTGAGG GTGAACGGGC TGATCTTGCT GAACGGGTTA TTGCTCTTCG CGCCGAGGAG
GCCGGTGTTG AACGGAAGCA GAGAGAACTG AGACTTGGTG CTGCAAGAGA GTTGCAGAAA
GAGGTAGAAC ATGCACGAAA AGAGATCAGG GAGATTGTTC AGGAGGTGAG GAACGCTCCA
GCTGATGCAA AAACTGTACA GGATTCGAGA AAAAAACTTG GTCTGAAAAA GCAGGAAGCT
GAAAAGAGTG AATCAGTTCT GGATGCTGAA GCTGAGAGTG CAGTTCATCT TGATCGTTCC
ATCCGAGAGG GTGATCTGGT CAGGATTCTT GACAGCACGG CCTCAGGCGA GGTCGAGAGC
GTCAATGGAG AGAGTGTTGT GGTGCAATGT GGTCATTTCA GGTTGACCAC GTCGCTTAAA
AACCTTGAGA AAACTTCGAA AACGCAGGTT AAAAAAAATC TCAGAGAGCC TCTGCTCCGG
CAGCAAAAGG GCTCCTGGTC AGCAATCACC TCTGAGGTGG ATTCGACAAA ACTTGACTTG
CGGGGGCTCA GTGGTGATGA GGCGATCATG AAAATCGACA GGTTTATCGA TACCATGCGT
CTTAATCGTA TTCATTCAGC GATGATTCTT CACGGCAAGG GAACCGGATC GCTGCGGCAG
CGAACGGCGG AATTTCTCCA GCAGCATGGC TCGGTCAAAA GTTTTCGACT GGGAGAGTGG
GGCGAGGGAG GAGCAGGCGT GACCATCGTC GAGATTGAAT CGTGA
 
Protein sequence
MNPVSLKKLE FDKVANYAAQ FCLSAMGRDR LLAAEPEVGR RELVAELERV LELRNMLQEG 
SALPFSWLPD TRPLLKKLEI LESYLEPEEL QDIYHLLFSS VQLRKFMFFN REVYPLLNEF
TIRLWLEKSL QTSIRRIIDE QSRVRDTASE ELLLIRRELG GSRELIRRKM ERLQRRCQES
GWLMEDTIAI KNGRLTLGLR VEYKYKIAGY IQDYSGSGQT VFIEPAETLE ISNRIQDLEI
SERREIERIL KEMSGALRLE LENLRYNEII LGDFDSLYAR ARFAVETNSV LPGIADGQSL
RIIRGFHPWL LISHHHKEVM PLDLDLDETD RVLVISGPNA GGKSVAMKTA GLLCCMLVHG
YLLPCSESSV FPLFSDIFIE IGDDQSIEND LSTFSSHLGA IRTILDVAGS GDLVLIDELC
AGTDVEEGGA IARAVMEELL NRGTKTIVTT HLGDLKAYAH EREGVLNGAM EFDRAGLVPT
FRFVKGLPGN SFAFAMMKRM GFPVKMVERA SEFMMDERIG LDRMLDDLSR LFEENRLLKQ
QLEGERADLA ERVIALRAEE AGVERKQREL RLGAARELQK EVEHARKEIR EIVQEVRNAP
ADAKTVQDSR KKLGLKKQEA EKSESVLDAE AESAVHLDRS IREGDLVRIL DSTASGEVES
VNGESVVVQC GHFRLTTSLK NLEKTSKTQV KKNLREPLLR QQKGSWSAIT SEVDSTKLDL
RGLSGDEAIM KIDRFIDTMR LNRIHSAMIL HGKGTGSLRQ RTAEFLQQHG SVKSFRLGEW
GEGGAGVTIV EIES