Gene Acid345_2697 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_2697 
Symbol 
ID4071599 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp3187411 
End bp3190059 
Gene Length2649 bp 
Protein Length882 aa 
Translation table11 
GC content59% 
IMG OID637984714 
ProductDNA mismatch repair protein MutS 
Protein accessionYP_591772 
Protein GI94969724 
COG category[L] Replication, recombination and repair 
COG ID[COG0249] Mismatch repair ATPase (MutS family) 
TIGRFAM ID[TIGR01070] DNA mismatch repair protein MutS 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.60859 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.57313 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATGACC CGACAACGCC GTTGATGAAG CAGTGGGCCC AGGTCAAACG CGACCATCCC 
AATGCCCTGC TCTTCTTCCG ACTCGGGGAT TTCTACGAGC TGTTCTTTGA CGACGCCGTG
ATCGCCGCGC GCGAATTGCA GATCACGCTG ACGGCGCGCA ACAAGGAAAA GGGCAACAGC
GTTCCGATGT GCGGCGTGCC CTATCACGCC GCCGAGAACT ACATTTCCAA ACTGATTCGT
CGCGGCTTCA AGGTTGCGGT CTGCGACCAG GTGGAAGACC CGAAACTCGC GAAGAAGTTG
GTGAAACGCG AAGTGACGCG GGTGATGACA CCCGGCACAA CGGCCGATTC GCAGCTTGGT
TCGGAAGAAA ACAACTTCCT TGCGGCGGTC GCGAGCCACG GAGATTTCGT CGGTTTTGCA
GCACTTGATT TGTCAACCGG CGAATTTCGC GCGACCGAGT TCAAGGGTAG CGATGCCCGG
CGGCGGATCC AGGAAGAGTT ACTGACGCTG CGTCCGCGAG AAATACTCTA CGGCTCTTCC
CTGCCGTTGT TCGACGCCGC ACGACCGGCC AACGCTGTGA GTGGCGCGAT GCCGAAATTG
GCAACGGTTG AGGGTGCGAG TTGGGCGGAG ACGCCGCTGG AAGACTGGGT GTTTGCTCCG
GACTACGCGA TCCCGCTTGT TGAGAATCAT TTCGGCGTGT TGTCGCTAGA GGGATTCGGG
CTGGCGAACA AGGCGTCGGC CGCGAGCGCA GCGGGAGCGA TCCTGCACTA CGTTCGCCAG
ACGCAGCGCG GCTCGCTGCA TCACGTGGAC CGTATCGGCT TCTACGAACG GCAGAACTGC
CTGGTGCTCG ATGCGGTGAC GGTGCGCAAC CTGGAGTTGA TTGAGCCGCT GTTCACAAAT
ACTGGCGAGG GTGTAACTCT CTTTCGCGCA CTGGACGCGA CGATGACGCC GATGGGCAAG
CGCCTGCTGC GGGCGTGGAT GCTGCGGCCT TCGATTGATA CGTCGGAGAT CAATGCGCGG
CTGGATGCGA TCGAGATGCA GGTAGTGGAT ACACTCGGCC GCGAAGAACT GCGCCGGGCG
ATGGACGGAA TTCTCGACAT CGAACGCTTG CTCAGCCGGG TGACGCTGGA GACCGCAAAT
CCACGTGATC TGCTGGCGCT GGCCCAATGT TTTGGACGAC TGCCAAAGGT GCGGGCGGCC
ATGCAGCGGT TCACGTCGGC ACGATTCCTG GTGCTCCATG GACTACTTGA TGACCTTGCT
GACCTTCGCG ATCGCATCTT CACCACGCTG GTAGATGAGC CGCCTATCAC ACTGAACGAT
GGCGGGGTGG TGCGCGAGGG ACTTGATGCT GCGCTCGATG AACTTCGCAA CCTGAGCCAC
AACAGCAAGC AATTCATCGC GCAGATCGAA GAGCGCGAAC GCAAGCGCAC CGGGATCGGC
TCGCTGAAAA TCAAGTTCAA CAATGTCTTC GGGTACTACC TCGAAATCTC AAATGCGAAT
AAGCACCTTG CGCCGGCGGA CTACGAGCGC AAGCAGACGC TTGTAAACGC CGAGCGCTTC
ACGACGCCAG AATTAAAGGA ATACGAGGCG AAGGTGCTGG ATGCGCAGGA GAAGATCGTC
GAGATTGAAC GGCGGATCTT CGGCGAGCTG CGCACAGCGA TTGCGGCCGA GGCGCGGCGG
GTGCGACAAA CGGGCTTGGC CCTAGCCGAA GTGGATGTGC TGGCGAATTT CGCACACCTG
GCTGCAACGA GGAATTATTG TCGTCCGAAG TTCGATCAGA GCGGTGAGTT TGAGTTGATA
GAGGCGCGGC ATCCGGTGAT TGAGTTGCCG GAATTGACGG GTAGCGCGGA CCGCTTCGTG
CCGAACGATC TGTATTTGAA TGCGACAACC CATACGGTGA TTGTGCTTAC CGGACCGAAC
ATGGGTGGCA AGTCTACCTA TCTTCGCCAA GCAGCGTTGG TTGCCGTGAT GGCGCAGATG
GGCAGCTTCG TTCCAGCCCG CTCCGCGCGT CTGAGCGTGG TGGACCGGGT GTTCACACGC
ATCGGCGCCG CCGACAATCT CGCGCGCGGA CGATCGACGT TCATGGTCGA GATGACCGAG
ACGGCCGCGA TCCTGAATAC GGCGACGGAC CGTTCGCTGA TTCTGCTCGA CGAAGTAGGC
CGGGGCACTT CCACCTATGA CGGGCTGGCG ATTGCGTGGG CGTGTATCGA GTTCCTGCAT
GCACGAACGC GAGCGAAGGC TCTCTTTGCT ACGCATTACC ACGAGTTGAC CGTGCTCGCC
GATGAGTTGA GCGGTGTGAA GAACTATCAC GTGTCGGTGA AAGAGAGCGG CGGGAATGTC
GTGTTCCTAC GCAGGGTGGA ACCGGGCGCT GCGGACAAGA GCTACGGCAT CGAGGTCGCG
AAGCTTGCGG GATTACCTGC AGAAGTCATC GAGCGTGCGC GGGCGGTGCT GAAGGAGCAT
GAATCGGTCG AGCGGCAGGC GACCTCGCAT CTGTCGAAAG ACGAACGTGG ATCCGACTCT
ATGCAATTGA CGATCTTCAC TCCTCTGTCG CAAAAGATTG TGGACCAACT GAAGGAGACG
GACTTGAACC GCCTGACGCC GATCGAAGCG CTGAACCTAC TGCATGAGTT AAAGAAGCAG
TTGGACTAA
 
Protein sequence
MNDPTTPLMK QWAQVKRDHP NALLFFRLGD FYELFFDDAV IAARELQITL TARNKEKGNS 
VPMCGVPYHA AENYISKLIR RGFKVAVCDQ VEDPKLAKKL VKREVTRVMT PGTTADSQLG
SEENNFLAAV ASHGDFVGFA ALDLSTGEFR ATEFKGSDAR RRIQEELLTL RPREILYGSS
LPLFDAARPA NAVSGAMPKL ATVEGASWAE TPLEDWVFAP DYAIPLVENH FGVLSLEGFG
LANKASAASA AGAILHYVRQ TQRGSLHHVD RIGFYERQNC LVLDAVTVRN LELIEPLFTN
TGEGVTLFRA LDATMTPMGK RLLRAWMLRP SIDTSEINAR LDAIEMQVVD TLGREELRRA
MDGILDIERL LSRVTLETAN PRDLLALAQC FGRLPKVRAA MQRFTSARFL VLHGLLDDLA
DLRDRIFTTL VDEPPITLND GGVVREGLDA ALDELRNLSH NSKQFIAQIE ERERKRTGIG
SLKIKFNNVF GYYLEISNAN KHLAPADYER KQTLVNAERF TTPELKEYEA KVLDAQEKIV
EIERRIFGEL RTAIAAEARR VRQTGLALAE VDVLANFAHL AATRNYCRPK FDQSGEFELI
EARHPVIELP ELTGSADRFV PNDLYLNATT HTVIVLTGPN MGGKSTYLRQ AALVAVMAQM
GSFVPARSAR LSVVDRVFTR IGAADNLARG RSTFMVEMTE TAAILNTATD RSLILLDEVG
RGTSTYDGLA IAWACIEFLH ARTRAKALFA THYHELTVLA DELSGVKNYH VSVKESGGNV
VFLRRVEPGA ADKSYGIEVA KLAGLPAEVI ERARAVLKEH ESVERQATSH LSKDERGSDS
MQLTIFTPLS QKIVDQLKET DLNRLTPIEA LNLLHELKKQ LD