Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | VC0395_A2756 |
Symbol | mutL |
ID | 5137458 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Vibrio cholerae O395 |
Kingdom | Bacteria |
Replicon accession | NC_009457 |
Strand | + |
Start bp | 2909158 |
End bp | 2911119 |
Gene Length | 1962 bp |
Protein Length | 653 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 640534202 |
Product | DNA mismatch repair protein |
Protein accession | YP_001218614 |
Protein GI | 147675065 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0323] DNA mismatch repair enzyme (predicted ATPase) |
TIGRFAM ID | [TIGR00585] DNA mismatch repair protein MutL |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.25704 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACGATTC GAATCCTACC CGCCCGTTTA GCCAACCAAA TTGCAGCGGG TGAAGTTGTA GAAAGACCGG CTTCAGTAGT CAAGGAGTTG GTGGAAAACA GTTTAGATGC TGGAGCCACT CGAATTGATA TTGATCTCGA GAAGGGGGGC GCTAAGCTCA TTCGTATTCG CGATAATGGT TCGGGTATTG ATAAGGACGA ACTAGGGCTT GCGCTGAGTC GTCACGCGAC CTCTAAAATT CACACCCTCG ATGATCTGGA AGCGATCATG AGCCTTGGTT TTCGTGGCGA GGCCTTAGCC AGTATTAGCT CAGTCTCACG CTTAACCCTT ACTTCGCGCA CGGTAGCTCA AGAAGAAGCG TGGTCTGCCT ACAGTGAAGG CCGTGATATG GCGGTCAAGT TACAGCCCGC GGCTCACCCC GTGGGTACGA CTGTGGAAGT GCTGGATCTC TTTTTCAATA CCCCGGCACG GCGTAAATTT CTGCGCACCG AAAAAACCGA ATTTACCCAT ATTGATGAGC TGCTCAAACG AATTGCTCTG AGCCGTTTTG ATGTCAGCTT TACTCTGCGC CATAACGGAA AAATCGTGCG TCAGTATCGC GCTGCAACGA CTTTACCGCA GCAGGAAAAA CGTTTAGCTG CGGTGTGTGG CAACCCGTTT GTGCAACATA TGTTACGCAT TGAGTTAGAG CACCAAGGGC TTAAACTGCA TGGCTGGATC ACCACACCCG AAGGAGCGCG CCAGCAGAGC GATCTGCAAT ACTGTTACGT AAACGGCCGG ATGATGCGTG ATAAGCTGAT CAATCATGCG ATTCGCCAAA GCTACGAAAC CAGCTTACGT GTCGATCAAT TTGCTACCTA TGTGCTGTTT ATTGAACTCG ATCCTCATCA AGTGGATGTC AACGTGCATC CAGCCAAGCA TGAAGTGCGC TTCCATCAAG CTCGCCTCGT TCATGATTTC ATCTATCAAG CCTTAAGCAG TGCGCTAGTA CAAGGTGCGC AGGTGATGGC TCCGACCATT AATGAAGGGG CGTTTCATTT ACCTCACTGC GCGGAAGAGG TGAATCCTCC GGTTGTGCCG ATGATCGATA CCACGCAACA AGAAAGAGTG TGGCAAGCGG TGCAAAATAC GCCAGATTAT CCGCGCAAAG CTCCACGTGA TAACGATAGG GATGAGAGCG ATAATCCGCA GGTTAGAGAG CGAGCGGTCA GTAATCCTTG GGTGGCTTCA CCCAAGACTG CAAGTACTGG CAAAGAAAGA TATGGTTCGG CATCCGTCAG CAAAAAAGAG GCCGCGGTTT ATCAAACCTT GATGCAAACT CCCGATTTAT CGGATGAAGA ACCGAGTACT GCCAGTACCA TCGTCTCATC AATTGAAGCC GTCAAGGCTA ATATTGCTAT CGAAAAACTG GGTAAAGCGA TCCAAGTGGT CGCGGGACAG TACCTGTTGA TGAGTAGCCC ACAAGGCTGT GTGCTTATCT CGCTATACCA AGCTCAGCAA CTCAAACTGC GTGGATTACT CAATGCTCAG CATGGTGCGC TGAAAGCGCA GCCACTGCTG GTGCCACTCG CGCTAAAACT GAATGAATCT GAATGGCAAG TGGCGCAGCG TCATTCATCG GCCCTGTTGC AACTGGGTAT TGAGCTGAAA TCGCGCACCA ATCACAGCAT TATGGTGATG GCGGTTCCGC AGCCGCTGCG CCAGCAAAAT TTACAGCAAT TGCTGCCGGA TCTGTTATCT TACGCGGCGA GTTGTTCGGA AAGCCAAGCC TTGAGCCATC AAGCGTTGGC GGATTGGTTA ACTCAACGCA TCGTTGTAGA AAAAAGAGAC TACACTTTAG CCGAGGCGAT CGGCTTGATC GCAGAGCTGG AGCAGCTCTG GCAAGGCAAC TTGCCTTTGC AAGACCCGCA TTTTATTACT TTGGTGGATT TTTCCGCCTC AATTACAGCA TTACACTCAT GA
|
Protein sequence | MTIRILPARL ANQIAAGEVV ERPASVVKEL VENSLDAGAT RIDIDLEKGG AKLIRIRDNG SGIDKDELGL ALSRHATSKI HTLDDLEAIM SLGFRGEALA SISSVSRLTL TSRTVAQEEA WSAYSEGRDM AVKLQPAAHP VGTTVEVLDL FFNTPARRKF LRTEKTEFTH IDELLKRIAL SRFDVSFTLR HNGKIVRQYR AATTLPQQEK RLAAVCGNPF VQHMLRIELE HQGLKLHGWI TTPEGARQQS DLQYCYVNGR MMRDKLINHA IRQSYETSLR VDQFATYVLF IELDPHQVDV NVHPAKHEVR FHQARLVHDF IYQALSSALV QGAQVMAPTI NEGAFHLPHC AEEVNPPVVP MIDTTQQERV WQAVQNTPDY PRKAPRDNDR DESDNPQVRE RAVSNPWVAS PKTASTGKER YGSASVSKKE AAVYQTLMQT PDLSDEEPST ASTIVSSIEA VKANIAIEKL GKAIQVVAGQ YLLMSSPQGC VLISLYQAQQ LKLRGLLNAQ HGALKAQPLL VPLALKLNES EWQVAQRHSS ALLQLGIELK SRTNHSIMVM AVPQPLRQQN LQQLLPDLLS YAASCSESQA LSHQALADWL TQRIVVEKRD YTLAEAIGLI AELEQLWQGN LPLQDPHFIT LVDFSASITA LHS
|
| |