Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cpha266_1896 |
Symbol | |
ID | 4570855 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium phaeobacteroides DSM 266 |
Kingdom | Bacteria |
Replicon accession | NC_008639 |
Strand | - |
Start bp | 2202604 |
End bp | 2205222 |
Gene Length | 2619 bp |
Protein Length | 872 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 639766478 |
Product | DNA mismatch repair protein MutS |
Protein accession | YP_912336 |
Protein GI | 119357692 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0249] Mismatch repair ATPase (MutS family) |
TIGRFAM ID | [TIGR01070] DNA mismatch repair protein MutS |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.0824451 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTAGTC CCCCGAGAGA ACACTCTCCG ATGATGCGTC AGTATCTCGA TGTCAAGGAG CGGTATCCCG ATTACCTGCT GCTCTTCAGG GTGGGCGATT TTTACGAAAC CTTTTTCGAT GACGCAAAAG AGGTTTCCTC AGCACTGAAC ATCGTGCTCA CAAGGCGTTC GAACGGCTCA TCTTCAGAGG TTCCCATGGC GGGGTTTCCG CACCATGCAA GCGAAGGCTA TATTGCCAGG CTGGTTAAAA AAGGGTACAA GGTAGCCGTT TGCGATCAGG TTGAAGATCC TTCTGAGGCA AAAGGGATCG TCAGGCGTGA AATCACCGAT ATTGTAACGC CGGGAATTAC CTACAGCGAC AAAATTCTCG ATGACCGGCA CAACAACTAT CTCTGTGCGC TTGCCCTGCT TAAAGAGGGG CGGCGGGTCG TTGCCGGTGC GGCATTTATC GACGTTACCA CCGCCGAGTT CAAAATTGCA GAGCTCCTGC CTGAAGAGGT TGCTGATTTT GTCCGTTCGC TTCATCCCGC CGAACTGCTG ATTGCAAGAA AGGAGAAAGA GCGGTTTGAG CCTGTCCGAA AGGAATTTCC GCCCGATATG GTTGTTACCG AGCTCGATGA CTGGATGTTT GGCGAAGACC AGGCATCGGC AGTGCTTGCC AGGCAGTTCA AAACCCATTC GCTCAAAGGT TTCGGCATTC ATGGCAACAG TGCCGGAAAA GTAGCCGCGG GAGTCATTCT TCAGTACCTC GAAGAGACCC GTCAAAACCG TCTGCACTAC ATTACCCGTA TCGGTACGCT GCAGAACACC GATTATATGA CGCTCGATCT GCAGACCCGG CGAAACCTCG AAATCATCTC CTCCATGCAG GATGGCACGA TCAACGGCAG TCTGCTTCAG GTGATCGATC GTACCGCCAA TCCCATGGGC GCACGCCTGA TTCGTCGCTG GCTGCAAAGC CCGCTCAAGC GGCTTGAGGA TATCGCTTTG CGTCTTGACG CCGTTGAGGA GTTTAAGGAT TTTTCGCCAT TGCGTCGCGA GGTACACGGT CATCTTTCTG AGATCAATGA TCTCGAACGG GTGCTGTCGC GCATCGCCAC ATTCCGGTCT ATTCCCAGAG AGATGCGTCA GTTCGGCAGT GCGTTATCGA AGATTCCGCT GTTGAAGGAG GCTCTGCTGC AAACCACAAC GGCAAGGCTT CAGGCCCTCG GCAGGTCGCT GGTGGAGATG CCCGAGCTTG TCGCACTGAT TGAAAAAGCT GTCGATCCGG AGGCCGGAGC CTCAATGCGC GACGGCGGCT ACATCCGGGC AGGGTACCAT CAGGAGCTTG ACGAGCTGCG CACCATTGCC TCGACAGCCA AGGATCGGCT GCTCGAAATT CAGCAGGAAG AGCGTGCCCG AACGTCGATT TCATCCCTCA AGGTTCAGTT CAACAAGGTT TTCGGCTACT ATATCGAAAT CAGCAAAAGC AATCTCGACA AGGTGCCCGA CTACTATGAA AAAAAGCAGA CACTTGTCAA TGCCGAACGT TTCACGATTC CGGCATTGAA AGAGTATGAA GCGAAAATTC TCAATGCCGA AGAGAAGAGC ATTGTTCTTG AGCAGCGGCT GTTTCATGAT CTCAGCCTTC TTATTGCGGA GCAGGCAGCT CTTGTTCAGA CTAACGCCGC GGTTATCGCC GAGATTGACT GCCTCGCATC CTTTGCCGCC GTTGCCGAAG AGTACGGCTA CTGCAAGCCC GAGGTTGCCG GGCATGACCG GCTGCTTGTT ACCGGCGGAC GTCACCCTGT TCTTGAACGG ATGATGAGCA CGGACGACCC CTATGTTTCA AACGATCTGC TTTTTGACCG GAAGCAGAGA TTACTGATCA TTACCGGACC GAACATGGCT GGTAAAAGTT CCTATCTGCG TCAGGCAGGG CTGATTGTGC TGCTTGCCCA GGCAGGCTCT TTTGTTCCGG CGCAAAAGGC TGAAATCGGC CTTGTCGACC GTATTTTCAC CAGGGTTGGC GCTTCGGACA ACCTTGCTTC GGGAGAGAGC ACCTTTCTGG TGGAGATGAA CGAGGCAGCC AGCATTCTTA ACAACGCCAC ATCGAAAAGC CTTCTCCTGC TCGATGAAAT AGGGAGGGGA ACCAGCACCA GCGACGGCAT GTCGATTGCC TGGTCGATGA GCGAATTCAT CCACGACAGC ATCGGGGCGC GAACGCTCTT TGCCACGCAC TACCATGAGC TCGCCGAGCT TGAAACGCGC CTTCAGGGTG TTGTCAACTA CAACGCCACC GTGATTGAGA CGGCTGAAAA GGTTATCTTT CTGCGCAAAA TTGTCAGAGG CGCTTCCGAT AACAGCTACG GCATCGAAGT TGCCAGAATG GCCGGCATGC CTCAGGAGGT CATCGTGCGG GCAAAGGAGA TTCTGGCGGG AATGGAAAAA CGGGAGATCG ACGTATCAGG AATAAAGCAA CCATCGATAG AAAGCATGCA GATAAGCCTG TTTGAAGAGG CCGATTCGCG GCTTCGAACC GCGATTGAAA ATCTCGACCT TGACCGGCTG ACTCCGCTCG ACGCCCTGAT TGAACTGAAA AAGTTGCAGG ATCTGGCACT CAAAGGATGC GGACGCTGA
|
Protein sequence | MSSPPREHSP MMRQYLDVKE RYPDYLLLFR VGDFYETFFD DAKEVSSALN IVLTRRSNGS SSEVPMAGFP HHASEGYIAR LVKKGYKVAV CDQVEDPSEA KGIVRREITD IVTPGITYSD KILDDRHNNY LCALALLKEG RRVVAGAAFI DVTTAEFKIA ELLPEEVADF VRSLHPAELL IARKEKERFE PVRKEFPPDM VVTELDDWMF GEDQASAVLA RQFKTHSLKG FGIHGNSAGK VAAGVILQYL EETRQNRLHY ITRIGTLQNT DYMTLDLQTR RNLEIISSMQ DGTINGSLLQ VIDRTANPMG ARLIRRWLQS PLKRLEDIAL RLDAVEEFKD FSPLRREVHG HLSEINDLER VLSRIATFRS IPREMRQFGS ALSKIPLLKE ALLQTTTARL QALGRSLVEM PELVALIEKA VDPEAGASMR DGGYIRAGYH QELDELRTIA STAKDRLLEI QQEERARTSI SSLKVQFNKV FGYYIEISKS NLDKVPDYYE KKQTLVNAER FTIPALKEYE AKILNAEEKS IVLEQRLFHD LSLLIAEQAA LVQTNAAVIA EIDCLASFAA VAEEYGYCKP EVAGHDRLLV TGGRHPVLER MMSTDDPYVS NDLLFDRKQR LLIITGPNMA GKSSYLRQAG LIVLLAQAGS FVPAQKAEIG LVDRIFTRVG ASDNLASGES TFLVEMNEAA SILNNATSKS LLLLDEIGRG TSTSDGMSIA WSMSEFIHDS IGARTLFATH YHELAELETR LQGVVNYNAT VIETAEKVIF LRKIVRGASD NSYGIEVARM AGMPQEVIVR AKEILAGMEK REIDVSGIKQ PSIESMQISL FEEADSRLRT AIENLDLDRL TPLDALIELK KLQDLALKGC GR
|
| |