Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cphamn1_0302 |
Symbol | |
ID | 6373957 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium phaeobacteroides BS1 |
Kingdom | Bacteria |
Replicon accession | NC_010831 |
Strand | + |
Start bp | 296357 |
End bp | 298231 |
Gene Length | 1875 bp |
Protein Length | 624 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 642682816 |
Product | DNA mismatch repair protein MutL |
Protein accession | YP_001958752 |
Protein GI | 189499282 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0323] DNA mismatch repair enzyme (predicted ATPase) |
TIGRFAM ID | [TIGR00585] DNA mismatch repair protein MutL |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.0000565974 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 4 |
Fosmid unclonability p-value | 0.00985332 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGCCGAGAA TTGCCAGGTT GCCCGATATT GTCGCGAACA AGATTTCAGC AGGGGAGGTT GTCCAGCGCC CTGCTTCAGT GGTGAAAGAA CTGCTCGAAA ATGCCATTGA TGCAGGGGCG ACACGTATCA CTGTGGCGAT AAAAGATGCG GGAAAAGAGC TTGTTCAGGT TATCGACAAT GGTTCGGGCA TGGATGAAGA GGATGCGCTC CGTTGTGTCG AACGGTTTGC TACCAGCAAA ATTTCTGATG CGGAGGAGCT CGATGCCCTC ACGACTCTGG GGTTCAGGGG AGAGGCGCTG GCAAGTATCT CCACGGTCTC GCATTTCGAA CTCAGGACGC GGCGTGAGAA CGACAACGTG GGGATTCAGC TGCGTTACGA GGGAGGTGTG CTGTCGGAGA GGGGCAAGGC GGCGTCTGAG CCGGGTACGG CGGTAAGCGT ACGAAACCTT TTTTACAATG TTCCGGCCCG CAGGAAGTTT CTTAAATCAA ATGCTACGGA ATTCAAGCAT ATTTTCGAGA GCGTGAAGGC CCAGGTGCTT GCCTATCCCG AGATACAGTG GCAGATGATC AATGATGATG AAACGCTTTT CGATTTCAGA AGCTCGGACA TGCACGAGCG CCTGAATTTT TTTTTCGGTG ACGACTTTGC CGGGAGTCTG ATAGAAGTTC ACGATGATAA CGATTTTCTT TCTCTGCACG GTTATGTCGG CAAGCCTTCC ATGCAGAAAC GCCAGAAGAA TGAACAGTTC ATCTATCTGA ACAGGCGGGT GATCCAGAAC AGGATGCTCT CACAGGCTTT GCAGCAGGCC TACGGGGAAC TTCTTGTAGA GCGCCATTCT CCCTTCGTGC TTCTCTTTCT TGGAATCGAC CCTCAGCAGA CTGATGTTAA CGTTCATCCC GCAAAGCTTG AAGTGAAGTT CGAGGATGAA CGCAGTGTGC GAACCATGTT TTATACTATT ATAAAACGAT CTGTCAGGAT GCAGGACTTC TCACCTGATG TCGGCGGTGA AGGGTTCCAT GAGACGAGTG ATTCTTTTTC TTCCCGGAGT TCTCAGCATA GCGATGCCAG GCTTGGCTTT CAGGCGGTTC CTTCCAGAGC GTCATCAACC GATGATCTCT ACAGGGAGTT TCAGGAGAGT ACGCCGAAGC GTCCGATGCC GGACAGAACG CGTGTCAGTG AACAGGAAGA GATGTTCAGT CACAGCGCCG ATATTTTCTG TGAACCGGAC AGGGAGTTTC GCAGCAGTGA TTTCGGACAG GTTTCAGAGG AGTTTGTTGA CGGAGTGCGC CTGGAACCGG AAGAGAAAGA TCCCAAAATC TGGCAACTGC ATAACAAGTA TATCATCTGT CAGATCAAGA CAGGATTGAT GCTTATCGAT CAGCATGTCG CTCATGAACG GGTTCTCTAT GAGCGTGCGG TAGATATTAT GGACAACAAC GTCCCGAATG CCCAGCAACT TCTTTTTCCT CAGAAAGTCG AGCTCAAGCC TTGGGAATTC GAGATCTATC TGGAGATTTG CGATGACCTC GACAGGCTTG GTTTCAATCT CGGCACACTG GGAACGAGGA CCGTTATGAT AGAGGGTGTT CCACAGGATG TTCGCAGCGG TTCGGAGGCC TATATCCTTC AGGACATGAT TCAGGAGTAT CAGCAGAATG CGTCAAAACT GAAGCTCGAG AAACGTGAAA ATCTTGCTAA ATCCTACTCC TGCCGGAACG CGATAATGAG CGGTCAGGCA TTGAGCCTTG AAGATATGCG CTCCCTTATT GACAGGCTGT TTGCGACGAA AATGCCGTAT GTCTGTCCAC ATGGGCGTCC GGTAATTATA CGGATCTCTC TTGACCAGCT GGACAGAATG TTCGGGCGGA AGTAG
|
Protein sequence | MPRIARLPDI VANKISAGEV VQRPASVVKE LLENAIDAGA TRITVAIKDA GKELVQVIDN GSGMDEEDAL RCVERFATSK ISDAEELDAL TTLGFRGEAL ASISTVSHFE LRTRRENDNV GIQLRYEGGV LSERGKAASE PGTAVSVRNL FYNVPARRKF LKSNATEFKH IFESVKAQVL AYPEIQWQMI NDDETLFDFR SSDMHERLNF FFGDDFAGSL IEVHDDNDFL SLHGYVGKPS MQKRQKNEQF IYLNRRVIQN RMLSQALQQA YGELLVERHS PFVLLFLGID PQQTDVNVHP AKLEVKFEDE RSVRTMFYTI IKRSVRMQDF SPDVGGEGFH ETSDSFSSRS SQHSDARLGF QAVPSRASST DDLYREFQES TPKRPMPDRT RVSEQEEMFS HSADIFCEPD REFRSSDFGQ VSEEFVDGVR LEPEEKDPKI WQLHNKYIIC QIKTGLMLID QHVAHERVLY ERAVDIMDNN VPNAQQLLFP QKVELKPWEF EIYLEICDDL DRLGFNLGTL GTRTVMIEGV PQDVRSGSEA YILQDMIQEY QQNASKLKLE KRENLAKSYS CRNAIMSGQA LSLEDMRSLI DRLFATKMPY VCPHGRPVII RISLDQLDRM FGRK
|
| |