Gene VC0395_A0063 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagVC0395_A0063 
SymbolmutS 
ID5135925 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameVibrio cholerae O395 
KingdomBacteria 
Replicon accessionNC_009457 
Strand
Start bp56617 
End bp59205 
Gene Length2589 bp 
Protein Length862 aa 
Translation table11 
GC content52% 
IMG OID640531523 
ProductDNA mismatch repair protein MutS 
Protein accessionYP_001216036 
Protein GI147675773 
COG category[L] Replication, recombination and repair 
COG ID[COG0249] Mismatch repair ATPase (MutS family) 
TIGRFAM ID[TIGR01070] DNA mismatch repair protein MutS 


Plasmid Coverage information

Num covering plasmid clones51 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATGAAAT CGAACGCCTC ACCGAGCGAG TCTCTTTCGC ACCACACCCC GATGATGCAA 
CAGTATTTAA GACTCAAGGC GGAAAACCCA GACATTCTGC TGTTCTATCG CATGGGTGAT
TTTTACGAAC TGTTTTATGA CGATGCCAAA CGCGCCTCTG AACTGCTAGA TATTTCCTTA
ACCAAACGCG GCGCATCGGC GGGTGAACCC ATTCCTATGG CGGGTGTGCC TTTCCATGCC
GTGGAAGGTT ATTTGGCGAA ACTCGTCCAG ATGGGGGAAT CGGTCGCGAT CTGCGAACAG
ATTGGCGATC CAGCCACCAG CAAAGGCCCC GTTGAGCGCA AAGTAGTGCG GATTGTGACA
CCGGGAACGG TGACCGATGA AGCCCTGCTC TCAGAGCGAG TCGATAACCT GATCGCGGCG
ATTTATCACC ATAACGGCCG CTTTGGTTAT GCAACCATGG ATATCACCTC CGGTCGTTTT
CAGCTCAGTG AGCCGCAAAC CGAAGAAGAG ATGGCCGCCG AGCTGCAACG CACTTCGCCT
CGTGAACTCT TGTTTCCTGA AGATTTTTCG CCAGTGCATC TAATGGCAAG CCGCCAAGGC
AACCGCCGTC GCCCGATTTG GGAATTTGAA CTCGATACCG CGAAACAGCA GCTCAACCAG
CAATTTGGCA CGCGCGATCT GGTTGGCTTT GGGGTTGAGC AAGCAAAACT TGGCTTGTGC
GCAGCAGGCT GCTTGATCCA ATACGTCAAA GATACTCAGC GTACCGCTTT GCCACACATT
CGTTCACTGA CTTGGGATCG CCAAGATCAA TCGGTGATTT TGGATGCCGC GACGCGGCGC
AACCTTGAAC TCACTCATAA CTTGGCGGGT GGAACAGATA ACACGCTTGC AGAAGTACTC
GACCATTGTG CTACCCCCAT GGGCAGCCGC ATGCTCAAAC GTTGGATCCA CCAACCGATG
CGCGATAACG CCACTCTCAA TCAGCGTTTA GATGCGATCA CTGAGCTCAA AGAAACCGCT
TTGTATGGGG AACTGCATCC CGTGCTCAAA CAGATTGGCG ATATTGAACG GATCCTCGCC
CGTCTAGCGC TGCGTTCAGC GCGGCCGCGC GATTTAGCTC GCCTACGCCA TGCCATGCAG
CAGTTACCTG AATTGCACTC GGTCATGAGC GAGCTTAAGC AGCCTCACCT TACCGAGCTA
CGCACCCATG CCGAGCCAAT GGATGAATTG TGCGATTTAC TCGAACGTGC GATCAAAGAA
AACCCACCCG TGGTGATCCG CGATGGTGGC GTGATCGCCG ATGGTTACAG TGCCGAACTC
GATGAATGGC GCGATTTAGC CAATGGCGCA ACCGAATTTC TGGAACGCTT GGAAGCCGAA
GAGCGCGATC GTCACGGCAT TGATACCCTG AAAGTGGGTT ATAACAATGT GCACGGTTTC
TACATTCAAG TGAGCCGGGG TCAGAGCCAT TTAGTGCCTC CCCACTATGT ACGCCGTCAA
ACCTTAAAAA ATGCTGAGCG TTACATCATC GAAGAACTTA AACAGCATGA AGATAAAGTG
CTCAATTCTA AGTCTCGTGC TTTAGCGTTA GAAAAACAGC TGTGGGAAGA GTTGTTCGAT
TTGCTGATGC CGCATCTTGA GCAGCTGCAA CAACTGGCGG CTTCCGTTGC TCAATTGGAT
GTGCTGCAAA ACCTCGCAGA GCGCGCAGAA AACTTGGAAT ATTGTCGCCC AACGCTGGTT
CAAGAAGCGG GCATTCACAT CCAAGGTGGT CGCCACCCTG TGGTAGAGCG AGTAATGAAT
GAGCCGTTTA TCGCCAACCC GATCGAACTT AATCCACAGC GACGCATGCT GATCATTACT
GGTCCGAATA TGGGCGGTAA ATCGACTTAC ATGCGTCAAA CCGCCTTGAT TGCACTGATG
GCGCATATCG GTAGTTATGT GCCGGCTGAA AGTGCGTCAA TCGGCCCACT AGATCGTATT
TTTACCCGTA TCGGCGCGTC CGATGATCTT GCGTCCGGCC GCTCAACCTT TATGGTGGAG
ATGACGGAAA CCGCCAATAT TTTGCACAAT GCGACCCGTA ACAGTTTGGT GTTGATGGAT
GAGATTGGCC GCGGTACCAG TACTTATGAC GGACTCTCGC TCGCTTGGGC GAGTGCCGAG
TGGCTTGCCA AAGAGATTGG TGCCATGACG CTGTTTGCTA CTCACTATTT TGAGTTAACC
GAACTGCCGA ATGTGTTACC TCATCTAGCG AATGTTCATT TAGATGCGGT TGAACATGGT
GATGGCATCG CCTTTATGCA CGCAGTGCAA GAGGGTGCCG CGAGTAAATC GTATGGTTTA
GCCGTAGCAG GACTGGCTGG CGTACCCAAG CCCGTGATTA AAAATGCGCG CGCCAAATTA
CAACAACTTG AGCTATTAAG CTCACAACCT GCCGAGACTC GTAAGCCAAG CCGCGTCGAT
ATTGCCAACC AGCTGAGCTT AATTCCTGAG CCGAGTGCTG TTGAGCAAGC CTTGGCAGGC
GTAGATCCTG ACCAGCTCAC TCCTCGCCAA GCTCTAGATA TGCTCTATCA ATTGAAAAAG
CTGCTCTAG
 
Protein sequence
MMKSNASPSE SLSHHTPMMQ QYLRLKAENP DILLFYRMGD FYELFYDDAK RASELLDISL 
TKRGASAGEP IPMAGVPFHA VEGYLAKLVQ MGESVAICEQ IGDPATSKGP VERKVVRIVT
PGTVTDEALL SERVDNLIAA IYHHNGRFGY ATMDITSGRF QLSEPQTEEE MAAELQRTSP
RELLFPEDFS PVHLMASRQG NRRRPIWEFE LDTAKQQLNQ QFGTRDLVGF GVEQAKLGLC
AAGCLIQYVK DTQRTALPHI RSLTWDRQDQ SVILDAATRR NLELTHNLAG GTDNTLAEVL
DHCATPMGSR MLKRWIHQPM RDNATLNQRL DAITELKETA LYGELHPVLK QIGDIERILA
RLALRSARPR DLARLRHAMQ QLPELHSVMS ELKQPHLTEL RTHAEPMDEL CDLLERAIKE
NPPVVIRDGG VIADGYSAEL DEWRDLANGA TEFLERLEAE ERDRHGIDTL KVGYNNVHGF
YIQVSRGQSH LVPPHYVRRQ TLKNAERYII EELKQHEDKV LNSKSRALAL EKQLWEELFD
LLMPHLEQLQ QLAASVAQLD VLQNLAERAE NLEYCRPTLV QEAGIHIQGG RHPVVERVMN
EPFIANPIEL NPQRRMLIIT GPNMGGKSTY MRQTALIALM AHIGSYVPAE SASIGPLDRI
FTRIGASDDL ASGRSTFMVE MTETANILHN ATRNSLVLMD EIGRGTSTYD GLSLAWASAE
WLAKEIGAMT LFATHYFELT ELPNVLPHLA NVHLDAVEHG DGIAFMHAVQ EGAASKSYGL
AVAGLAGVPK PVIKNARAKL QQLELLSSQP AETRKPSRVD IANQLSLIPE PSAVEQALAG
VDPDQLTPRQ ALDMLYQLKK LL