Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cpha266_0172 |
Symbol | |
ID | 4568469 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium phaeobacteroides DSM 266 |
Kingdom | Bacteria |
Replicon accession | NC_008639 |
Strand | - |
Start bp | 194046 |
End bp | 195920 |
Gene Length | 1875 bp |
Protein Length | 624 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 639764772 |
Product | DNA mismatch repair protein MutL |
Protein accession | YP_910663 |
Protein GI | 119356019 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0323] DNA mismatch repair enzyme (predicted ATPase) |
TIGRFAM ID | [TIGR00585] DNA mismatch repair protein MutL |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCAAAAA TCGCGCGATT ACCGGACAAT GTCGCCAACA AAATTTCGGC TGGCGAGGTA GTGCAACGTC CGGCATCAGT CATCAAGGAA CTTCTCGAAA ACGCCATTGA TGCCTGCGCG TCAAAAATAA CCGTGACCAT CAAGGATGCC GGCAAAGAAC TGGTGCAGAT AGTTGACAAC GGAATCGGTA TGAGTCGCCA GGATGCGCTG CTCTCGGTTG AACGCTTTGC AACAAGCAAG ATTTCAGGCG TTGAAGACCT CGACTCGCTC ATGAGCCTTG GATTCAGAGG CGAAGCGCTG CCAAGCATCG CTTCGGTTTC ACAGTTCGAA CTGAAAACCA AGCCGGAAGG CGCCCTGCTC GGCTTCAGGT TTCGCTGCGA CGGAGGAGAA CCGGTTGAAG AATCCGAAGT CAATGCCGAA AAGGGAACAA CCATCACAGT AAGAAATCTC TTTTACAACG TTCCGGCACG CAGAAAATTC CTCAAATCAA ACGCAACAGA GTTCCGTCAT ATTTTCGAGT CAGTCAAGTC GCTGGCACTT GCCTATCCGG AAATCGAATG GAAAATGGTC AGCGATGACG AAGAACTCTT CCACTTCAGA ACTCCCGACA TTTACGAACG CCTCGATGCT TTTTATGGTG AAAATTTCTC CCTGAGCCTC ATACCTGTTT CTGAAGAGAA CGATTACCTG TCGATAAGCG GCTTTCTGGG AAAACCGGGC ATGCAGAAAC GGCAGAAACT CGATCAGTAT ATCTATGTTA ACCGGAGAAT TATTCAAAAC AGGATGCTCT CACAAGCCTT GCAGCAAGCC TATGGCGAAC TGCTCGTCGA GCGTCAGGCT CCGTTTGCCC TGCTCTTTCT CGGCATTGAC CCCTCACGTA TTGATGTTAA CGTACACCCG GCAAAACTTG AAGTCAAATT CGAAGATGAG CGAAGTGTAC GGACCATGTT TTATCCTGTT ATCAAGCGGA CCATCCAGCT TCACGACTTT TCACCTGATG CCGCTGAAAA AGAACCCTGT TCGATCAAGG AAGGCACTCT TGATTGTTCA TCAAGAAAAC TCGGGTTTCA GGACATCGCG GAACCTGCAT CGACAACCAG CACACTCTAT GCAAACTATC GGCAGGGGGC TTTCGGCGAT ACACCCTTCG AACGACCTGC CTACGCGGAA AAAGAGCCCC GCCCGTCATC CATCAATACA GGCTTTGAGC GTTTTGAACC AGATCTGCGC GAAGGAGGCG ACCTGTTTTC GACAACACTC CAGGCAAGAC CTTACGAGGA CGACAACACT CCTGATCCGG GAGAAAACGA CCCCAAAATC TGGCAACTGC ACAACAAATA CATTATCTGC CAGATCAAGA CAGGAATGAT GATTATCGAC CAGCACGTAG CCCATGAGCG AGTGCTCTAC GAACGAGCCG TTGATGTCAT GAACCAGAAC GTACCAAACT CTCAGCAACT GCTCTTCCCC CAGAAAATCG AACTCCGTGC CTGGGAATAT GAAGTGTTCG AAGAAATTCG GGACGACCTC TATCGGCTTG GATTCAACCT CCGCTCATTC GGCGCAAAAA CAGTGATGAT CGAAGGAATT CCTCAGGATG TCAGACCCGG AACCGAAGTC ACCATCCTGC AGGACATGAT TACCGAGTTT CAGGAAAACA GCTCAAAGCT GAAACTCGAA AGAAGAGAAA ACCTTGCAAG ATCCTACTCC TGCCGCAATG CCATTATGGC CGGTCAGAAA CTATCGCTTG AAGAGATGCG CTCGTTGATT GACAACCTCT TCGCCACACG GGTACCCTAT ACCTGTCCGC ACGGCAGACC TGTTATCATA AAGCTCTCGC TCGACCAGCT CGACAGGATG TTCGGGCGAA AATAA
|
Protein sequence | MAKIARLPDN VANKISAGEV VQRPASVIKE LLENAIDACA SKITVTIKDA GKELVQIVDN GIGMSRQDAL LSVERFATSK ISGVEDLDSL MSLGFRGEAL PSIASVSQFE LKTKPEGALL GFRFRCDGGE PVEESEVNAE KGTTITVRNL FYNVPARRKF LKSNATEFRH IFESVKSLAL AYPEIEWKMV SDDEELFHFR TPDIYERLDA FYGENFSLSL IPVSEENDYL SISGFLGKPG MQKRQKLDQY IYVNRRIIQN RMLSQALQQA YGELLVERQA PFALLFLGID PSRIDVNVHP AKLEVKFEDE RSVRTMFYPV IKRTIQLHDF SPDAAEKEPC SIKEGTLDCS SRKLGFQDIA EPASTTSTLY ANYRQGAFGD TPFERPAYAE KEPRPSSINT GFERFEPDLR EGGDLFSTTL QARPYEDDNT PDPGENDPKI WQLHNKYIIC QIKTGMMIID QHVAHERVLY ERAVDVMNQN VPNSQQLLFP QKIELRAWEY EVFEEIRDDL YRLGFNLRSF GAKTVMIEGI PQDVRPGTEV TILQDMITEF QENSSKLKLE RRENLARSYS CRNAIMAGQK LSLEEMRSLI DNLFATRVPY TCPHGRPVII KLSLDQLDRM FGRK
|
| |