Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_3219 |
Symbol | |
ID | 4070431 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | + |
Start bp | 3810624 |
End bp | 3812567 |
Gene Length | 1944 bp |
Protein Length | 647 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 637985240 |
Product | DNA mismatch repair protein MutL |
Protein accession | YP_592294 |
Protein GI | 94970246 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0323] DNA mismatch repair enzyme (predicted ATPase) |
TIGRFAM ID | [TIGR00585] DNA mismatch repair protein MutL |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.428383 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.251683 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGCCGCA TCCACGTTCT CTCCGAACAT GTCGCCAACA AAATCGCCGC CGGCGAGGTG GTCGAACGCC CCGCGTCGGT AGTGAAGGAG TTGATCGAGA ACTCGCTCGA TGCCGGCGCC AAGCGCATTC GCGTGCACGT AGAAGCTGGT GGCAAGAAAC TGATCCACAT TGTGGACGAC GGCATCGGCA TGTTTCGCGA CGACGCTATG CTTGCGTTCG AACGTCACGC GACTTCGAAA CTGAAAAATC CCGAGGACCT GCTCAGCATC TCGACACTCG GCTTCCGCGG TGAAGCACTC CCGTCCATCG CATCGGTTGC GCGCGTCCGC CTTGAAACTC GCGCCAACGA AGAACCAAGC GGCACCGTCC TCGAGATCGC CGGCGGCAAG ATTCTGAAGA TCGAAGAAGC CGGCCTACCC CTCGGTACCT CCATCGCAAT AAAAGACCTC TTTTTCAATA CACCCGCCCG CAAGAAATTT CTGAAAAGCG AATCCACCGA GCTCTCCCAC ATCGCGTCGC TGGTCACCCA TTACGCGCTC GCGCATCCCG AGATGCATTG GGAGCTGCAC TCCGCGACCA ACGCACTGCT CATTGCTCCG CCAGTCGCCA CGCAGAGCGA ACGCATCTAC CAGGTCTTCG GCAACGAGAC GCTCGACCAA CTCATCCCGC TCGCCGCGCA AATCAAACTC GAACGCATCG GGCTGCCTAA GCCACCGCCG TGGCTACGCA AAAACGAAGA CGACGAGGAA GAACAAACAG TCGAGCCCGG TGAAGTTCGA CTCCACGGAT TCATCTCGAA GCCCGAAATC CAGAAGCTCA ATCGCAACTC GATCTTCGTC TTCGTCAACG GCCGTCTCAT TCGCGACCGC CTCGTCCAGC ACGCACTCAC CGAGGCCTAT CGCAACATCC TGCCACCCAC GCTCTTCCCT GTCGTTCTCC TCTTTCTCGA AATGCCCTAC ACGGAAGTCG ACGTCAACGT GCATCCGTCG AAGACCGAAG TCCGCTTCCG CCAGCAGTCG CTCGTCCATG ACTTCGTGCG CGACTCGGTC CGCGCGGCAT TGTCGAAAGC GCGCCCAATC CCACAGTTCA TTTCCGAGAT TCACGCCCAA CCGAAGGCTT CGCCGTCACT TACACCCGGC GCACAGACCG CGCCTGCCTT CGCGTTGGAA GCGCAGGAAG AACCCGTGCT GCCCGAGCGC CTTCAATTCG GTGGCGACGC CATCTCCGTC GAGGCCAACG CAGCTGTGCC CGTCGCCCGC TTCGGTGCGC AGACCTTCGG TTCTCACGTC GCGCAGCAAC AAACGGTTCC CGAGACCAGT GGCTGCGACT ACGAACTCCC CGATCTACCG GCCGCTGACG CTCCGCTCGC ATCGCTCAGG CCCCTCGGCC AGATTCGCGA ATCCTTCATT CTTGCTACGA GCAACGAAGG CCTCTGGATC ATCGACCAGC ACGTCGCCCA CGAGCGCGTA CTCTTCGAGA AGGTCCTAAA GCAACGAGCA GCCGCCTCGG TTGAAACCCA GCAATTACTC ATGCCTCTGA TCGTCGAACT CACTCCCGGC CAGCAAGCCG TCTTCACCGA GATCGCCGAA GAGCTTCACC AGAACGGCTT CGAAGTCGAA CCCTTTGGCT CAAGGACTTT CGCCGTCAAA GCAGCACCCG CAGGAATCCG CGCCGAAGAT ATCGAAAAGA CCCTGAGCGA AGTCCTCGAT TCCTTCGAAC GCGAGCAGCA GGCCTTCAAC CTTGAGCACG CGCAAAGCCG CATCGCCGCC ACCATTGCCT GCCACGCAGC CATCAAGGTC AACATGCCGC TCACGCAGGA TAAAATGGAG TGGCTCCTCG CCGAGCTCGC CAAAACCGAG CACCCCATGA CTTGTCCGCA CGGACGCCCC ATCGTTCTTC GCTATTCTGT CAAGGACATT CAGAAGGCCT TTAAGCGCAT CTGA
|
Protein sequence | MGRIHVLSEH VANKIAAGEV VERPASVVKE LIENSLDAGA KRIRVHVEAG GKKLIHIVDD GIGMFRDDAM LAFERHATSK LKNPEDLLSI STLGFRGEAL PSIASVARVR LETRANEEPS GTVLEIAGGK ILKIEEAGLP LGTSIAIKDL FFNTPARKKF LKSESTELSH IASLVTHYAL AHPEMHWELH SATNALLIAP PVATQSERIY QVFGNETLDQ LIPLAAQIKL ERIGLPKPPP WLRKNEDDEE EQTVEPGEVR LHGFISKPEI QKLNRNSIFV FVNGRLIRDR LVQHALTEAY RNILPPTLFP VVLLFLEMPY TEVDVNVHPS KTEVRFRQQS LVHDFVRDSV RAALSKARPI PQFISEIHAQ PKASPSLTPG AQTAPAFALE AQEEPVLPER LQFGGDAISV EANAAVPVAR FGAQTFGSHV AQQQTVPETS GCDYELPDLP AADAPLASLR PLGQIRESFI LATSNEGLWI IDQHVAHERV LFEKVLKQRA AASVETQQLL MPLIVELTPG QQAVFTEIAE ELHQNGFEVE PFGSRTFAVK AAPAGIRAED IEKTLSEVLD SFEREQQAFN LEHAQSRIAA TIACHAAIKV NMPLTQDKME WLLAELAKTE HPMTCPHGRP IVLRYSVKDI QKAFKRI
|
| |