Gene VC0395_A2756 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagVC0395_A2756 
SymbolmutL 
ID5137458 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameVibrio cholerae O395 
KingdomBacteria 
Replicon accessionNC_009457 
Strand
Start bp2909158 
End bp2911119 
Gene Length1962 bp 
Protein Length653 aa 
Translation table11 
GC content50% 
IMG OID640534202 
ProductDNA mismatch repair protein 
Protein accessionYP_001218614 
Protein GI147675065 
COG category[L] Replication, recombination and repair 
COG ID[COG0323] DNA mismatch repair enzyme (predicted ATPase) 
TIGRFAM ID[TIGR00585] DNA mismatch repair protein MutL 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.25704 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGATTC GAATCCTACC CGCCCGTTTA GCCAACCAAA TTGCAGCGGG TGAAGTTGTA 
GAAAGACCGG CTTCAGTAGT CAAGGAGTTG GTGGAAAACA GTTTAGATGC TGGAGCCACT
CGAATTGATA TTGATCTCGA GAAGGGGGGC GCTAAGCTCA TTCGTATTCG CGATAATGGT
TCGGGTATTG ATAAGGACGA ACTAGGGCTT GCGCTGAGTC GTCACGCGAC CTCTAAAATT
CACACCCTCG ATGATCTGGA AGCGATCATG AGCCTTGGTT TTCGTGGCGA GGCCTTAGCC
AGTATTAGCT CAGTCTCACG CTTAACCCTT ACTTCGCGCA CGGTAGCTCA AGAAGAAGCG
TGGTCTGCCT ACAGTGAAGG CCGTGATATG GCGGTCAAGT TACAGCCCGC GGCTCACCCC
GTGGGTACGA CTGTGGAAGT GCTGGATCTC TTTTTCAATA CCCCGGCACG GCGTAAATTT
CTGCGCACCG AAAAAACCGA ATTTACCCAT ATTGATGAGC TGCTCAAACG AATTGCTCTG
AGCCGTTTTG ATGTCAGCTT TACTCTGCGC CATAACGGAA AAATCGTGCG TCAGTATCGC
GCTGCAACGA CTTTACCGCA GCAGGAAAAA CGTTTAGCTG CGGTGTGTGG CAACCCGTTT
GTGCAACATA TGTTACGCAT TGAGTTAGAG CACCAAGGGC TTAAACTGCA TGGCTGGATC
ACCACACCCG AAGGAGCGCG CCAGCAGAGC GATCTGCAAT ACTGTTACGT AAACGGCCGG
ATGATGCGTG ATAAGCTGAT CAATCATGCG ATTCGCCAAA GCTACGAAAC CAGCTTACGT
GTCGATCAAT TTGCTACCTA TGTGCTGTTT ATTGAACTCG ATCCTCATCA AGTGGATGTC
AACGTGCATC CAGCCAAGCA TGAAGTGCGC TTCCATCAAG CTCGCCTCGT TCATGATTTC
ATCTATCAAG CCTTAAGCAG TGCGCTAGTA CAAGGTGCGC AGGTGATGGC TCCGACCATT
AATGAAGGGG CGTTTCATTT ACCTCACTGC GCGGAAGAGG TGAATCCTCC GGTTGTGCCG
ATGATCGATA CCACGCAACA AGAAAGAGTG TGGCAAGCGG TGCAAAATAC GCCAGATTAT
CCGCGCAAAG CTCCACGTGA TAACGATAGG GATGAGAGCG ATAATCCGCA GGTTAGAGAG
CGAGCGGTCA GTAATCCTTG GGTGGCTTCA CCCAAGACTG CAAGTACTGG CAAAGAAAGA
TATGGTTCGG CATCCGTCAG CAAAAAAGAG GCCGCGGTTT ATCAAACCTT GATGCAAACT
CCCGATTTAT CGGATGAAGA ACCGAGTACT GCCAGTACCA TCGTCTCATC AATTGAAGCC
GTCAAGGCTA ATATTGCTAT CGAAAAACTG GGTAAAGCGA TCCAAGTGGT CGCGGGACAG
TACCTGTTGA TGAGTAGCCC ACAAGGCTGT GTGCTTATCT CGCTATACCA AGCTCAGCAA
CTCAAACTGC GTGGATTACT CAATGCTCAG CATGGTGCGC TGAAAGCGCA GCCACTGCTG
GTGCCACTCG CGCTAAAACT GAATGAATCT GAATGGCAAG TGGCGCAGCG TCATTCATCG
GCCCTGTTGC AACTGGGTAT TGAGCTGAAA TCGCGCACCA ATCACAGCAT TATGGTGATG
GCGGTTCCGC AGCCGCTGCG CCAGCAAAAT TTACAGCAAT TGCTGCCGGA TCTGTTATCT
TACGCGGCGA GTTGTTCGGA AAGCCAAGCC TTGAGCCATC AAGCGTTGGC GGATTGGTTA
ACTCAACGCA TCGTTGTAGA AAAAAGAGAC TACACTTTAG CCGAGGCGAT CGGCTTGATC
GCAGAGCTGG AGCAGCTCTG GCAAGGCAAC TTGCCTTTGC AAGACCCGCA TTTTATTACT
TTGGTGGATT TTTCCGCCTC AATTACAGCA TTACACTCAT GA
 
Protein sequence
MTIRILPARL ANQIAAGEVV ERPASVVKEL VENSLDAGAT RIDIDLEKGG AKLIRIRDNG 
SGIDKDELGL ALSRHATSKI HTLDDLEAIM SLGFRGEALA SISSVSRLTL TSRTVAQEEA
WSAYSEGRDM AVKLQPAAHP VGTTVEVLDL FFNTPARRKF LRTEKTEFTH IDELLKRIAL
SRFDVSFTLR HNGKIVRQYR AATTLPQQEK RLAAVCGNPF VQHMLRIELE HQGLKLHGWI
TTPEGARQQS DLQYCYVNGR MMRDKLINHA IRQSYETSLR VDQFATYVLF IELDPHQVDV
NVHPAKHEVR FHQARLVHDF IYQALSSALV QGAQVMAPTI NEGAFHLPHC AEEVNPPVVP
MIDTTQQERV WQAVQNTPDY PRKAPRDNDR DESDNPQVRE RAVSNPWVAS PKTASTGKER
YGSASVSKKE AAVYQTLMQT PDLSDEEPST ASTIVSSIEA VKANIAIEKL GKAIQVVAGQ
YLLMSSPQGC VLISLYQAQQ LKLRGLLNAQ HGALKAQPLL VPLALKLNES EWQVAQRHSS
ALLQLGIELK SRTNHSIMVM AVPQPLRQQN LQQLLPDLLS YAASCSESQA LSHQALADWL
TQRIVVEKRD YTLAEAIGLI AELEQLWQGN LPLQDPHFIT LVDFSASITA LHS