Gene Cpha266_1896 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCpha266_1896 
Symbol 
ID4570855 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium phaeobacteroides DSM 266 
KingdomBacteria 
Replicon accessionNC_008639 
Strand
Start bp2202604 
End bp2205222 
Gene Length2619 bp 
Protein Length872 aa 
Translation table11 
GC content54% 
IMG OID639766478 
ProductDNA mismatch repair protein MutS 
Protein accessionYP_912336 
Protein GI119357692 
COG category[L] Replication, recombination and repair 
COG ID[COG0249] Mismatch repair ATPase (MutS family) 
TIGRFAM ID[TIGR01070] DNA mismatch repair protein MutS 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0824451 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTAGTC CCCCGAGAGA ACACTCTCCG ATGATGCGTC AGTATCTCGA TGTCAAGGAG 
CGGTATCCCG ATTACCTGCT GCTCTTCAGG GTGGGCGATT TTTACGAAAC CTTTTTCGAT
GACGCAAAAG AGGTTTCCTC AGCACTGAAC ATCGTGCTCA CAAGGCGTTC GAACGGCTCA
TCTTCAGAGG TTCCCATGGC GGGGTTTCCG CACCATGCAA GCGAAGGCTA TATTGCCAGG
CTGGTTAAAA AAGGGTACAA GGTAGCCGTT TGCGATCAGG TTGAAGATCC TTCTGAGGCA
AAAGGGATCG TCAGGCGTGA AATCACCGAT ATTGTAACGC CGGGAATTAC CTACAGCGAC
AAAATTCTCG ATGACCGGCA CAACAACTAT CTCTGTGCGC TTGCCCTGCT TAAAGAGGGG
CGGCGGGTCG TTGCCGGTGC GGCATTTATC GACGTTACCA CCGCCGAGTT CAAAATTGCA
GAGCTCCTGC CTGAAGAGGT TGCTGATTTT GTCCGTTCGC TTCATCCCGC CGAACTGCTG
ATTGCAAGAA AGGAGAAAGA GCGGTTTGAG CCTGTCCGAA AGGAATTTCC GCCCGATATG
GTTGTTACCG AGCTCGATGA CTGGATGTTT GGCGAAGACC AGGCATCGGC AGTGCTTGCC
AGGCAGTTCA AAACCCATTC GCTCAAAGGT TTCGGCATTC ATGGCAACAG TGCCGGAAAA
GTAGCCGCGG GAGTCATTCT TCAGTACCTC GAAGAGACCC GTCAAAACCG TCTGCACTAC
ATTACCCGTA TCGGTACGCT GCAGAACACC GATTATATGA CGCTCGATCT GCAGACCCGG
CGAAACCTCG AAATCATCTC CTCCATGCAG GATGGCACGA TCAACGGCAG TCTGCTTCAG
GTGATCGATC GTACCGCCAA TCCCATGGGC GCACGCCTGA TTCGTCGCTG GCTGCAAAGC
CCGCTCAAGC GGCTTGAGGA TATCGCTTTG CGTCTTGACG CCGTTGAGGA GTTTAAGGAT
TTTTCGCCAT TGCGTCGCGA GGTACACGGT CATCTTTCTG AGATCAATGA TCTCGAACGG
GTGCTGTCGC GCATCGCCAC ATTCCGGTCT ATTCCCAGAG AGATGCGTCA GTTCGGCAGT
GCGTTATCGA AGATTCCGCT GTTGAAGGAG GCTCTGCTGC AAACCACAAC GGCAAGGCTT
CAGGCCCTCG GCAGGTCGCT GGTGGAGATG CCCGAGCTTG TCGCACTGAT TGAAAAAGCT
GTCGATCCGG AGGCCGGAGC CTCAATGCGC GACGGCGGCT ACATCCGGGC AGGGTACCAT
CAGGAGCTTG ACGAGCTGCG CACCATTGCC TCGACAGCCA AGGATCGGCT GCTCGAAATT
CAGCAGGAAG AGCGTGCCCG AACGTCGATT TCATCCCTCA AGGTTCAGTT CAACAAGGTT
TTCGGCTACT ATATCGAAAT CAGCAAAAGC AATCTCGACA AGGTGCCCGA CTACTATGAA
AAAAAGCAGA CACTTGTCAA TGCCGAACGT TTCACGATTC CGGCATTGAA AGAGTATGAA
GCGAAAATTC TCAATGCCGA AGAGAAGAGC ATTGTTCTTG AGCAGCGGCT GTTTCATGAT
CTCAGCCTTC TTATTGCGGA GCAGGCAGCT CTTGTTCAGA CTAACGCCGC GGTTATCGCC
GAGATTGACT GCCTCGCATC CTTTGCCGCC GTTGCCGAAG AGTACGGCTA CTGCAAGCCC
GAGGTTGCCG GGCATGACCG GCTGCTTGTT ACCGGCGGAC GTCACCCTGT TCTTGAACGG
ATGATGAGCA CGGACGACCC CTATGTTTCA AACGATCTGC TTTTTGACCG GAAGCAGAGA
TTACTGATCA TTACCGGACC GAACATGGCT GGTAAAAGTT CCTATCTGCG TCAGGCAGGG
CTGATTGTGC TGCTTGCCCA GGCAGGCTCT TTTGTTCCGG CGCAAAAGGC TGAAATCGGC
CTTGTCGACC GTATTTTCAC CAGGGTTGGC GCTTCGGACA ACCTTGCTTC GGGAGAGAGC
ACCTTTCTGG TGGAGATGAA CGAGGCAGCC AGCATTCTTA ACAACGCCAC ATCGAAAAGC
CTTCTCCTGC TCGATGAAAT AGGGAGGGGA ACCAGCACCA GCGACGGCAT GTCGATTGCC
TGGTCGATGA GCGAATTCAT CCACGACAGC ATCGGGGCGC GAACGCTCTT TGCCACGCAC
TACCATGAGC TCGCCGAGCT TGAAACGCGC CTTCAGGGTG TTGTCAACTA CAACGCCACC
GTGATTGAGA CGGCTGAAAA GGTTATCTTT CTGCGCAAAA TTGTCAGAGG CGCTTCCGAT
AACAGCTACG GCATCGAAGT TGCCAGAATG GCCGGCATGC CTCAGGAGGT CATCGTGCGG
GCAAAGGAGA TTCTGGCGGG AATGGAAAAA CGGGAGATCG ACGTATCAGG AATAAAGCAA
CCATCGATAG AAAGCATGCA GATAAGCCTG TTTGAAGAGG CCGATTCGCG GCTTCGAACC
GCGATTGAAA ATCTCGACCT TGACCGGCTG ACTCCGCTCG ACGCCCTGAT TGAACTGAAA
AAGTTGCAGG ATCTGGCACT CAAAGGATGC GGACGCTGA
 
Protein sequence
MSSPPREHSP MMRQYLDVKE RYPDYLLLFR VGDFYETFFD DAKEVSSALN IVLTRRSNGS 
SSEVPMAGFP HHASEGYIAR LVKKGYKVAV CDQVEDPSEA KGIVRREITD IVTPGITYSD
KILDDRHNNY LCALALLKEG RRVVAGAAFI DVTTAEFKIA ELLPEEVADF VRSLHPAELL
IARKEKERFE PVRKEFPPDM VVTELDDWMF GEDQASAVLA RQFKTHSLKG FGIHGNSAGK
VAAGVILQYL EETRQNRLHY ITRIGTLQNT DYMTLDLQTR RNLEIISSMQ DGTINGSLLQ
VIDRTANPMG ARLIRRWLQS PLKRLEDIAL RLDAVEEFKD FSPLRREVHG HLSEINDLER
VLSRIATFRS IPREMRQFGS ALSKIPLLKE ALLQTTTARL QALGRSLVEM PELVALIEKA
VDPEAGASMR DGGYIRAGYH QELDELRTIA STAKDRLLEI QQEERARTSI SSLKVQFNKV
FGYYIEISKS NLDKVPDYYE KKQTLVNAER FTIPALKEYE AKILNAEEKS IVLEQRLFHD
LSLLIAEQAA LVQTNAAVIA EIDCLASFAA VAEEYGYCKP EVAGHDRLLV TGGRHPVLER
MMSTDDPYVS NDLLFDRKQR LLIITGPNMA GKSSYLRQAG LIVLLAQAGS FVPAQKAEIG
LVDRIFTRVG ASDNLASGES TFLVEMNEAA SILNNATSKS LLLLDEIGRG TSTSDGMSIA
WSMSEFIHDS IGARTLFATH YHELAELETR LQGVVNYNAT VIETAEKVIF LRKIVRGASD
NSYGIEVARM AGMPQEVIVR AKEILAGMEK REIDVSGIKQ PSIESMQISL FEEADSRLRT
AIENLDLDRL TPLDALIELK KLQDLALKGC GR