Gene Avi_0423 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvi_0423 
SymbolmutS 
ID7388438 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAgrobacterium vitis S4 
KingdomBacteria 
Replicon accessionNC_011989 
Strand
Start bp359072 
End bp361723 
Gene Length2652 bp 
Protein Length883 aa 
Translation table11 
GC content61% 
IMG OID643650082 
ProductDNA mismatch repair protein MutS 
Protein accessionYP_002548297 
Protein GI222147340 
COG category[L] Replication, recombination and repair 
COG ID[COG0249] Mismatch repair ATPase (MutS family) 
TIGRFAM ID[TIGR01070] DNA mismatch repair protein MutS 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATGGAAC AGTATATTGA AATCAAGGCG AACAATCCCG GTTCGCTGCT GTTTTACCGG 
ATGGGCGATT TCTATGAGCT GTTTTTCGAG GACGCCGTCG AGGCGTCCCG GGCGCTCGGC
ATCACCCTGA CCAAGCGCGG GCAGCATCTG GGTCAAGACA TTCCGATGTG CGGTGTCCCT
GTACATGCCT CTGACGATTA TCTGCAAAAG CTGATCACGC TCGGGTTCCG TGTCGCCGTT
TGCGAACAGG TGGAAGACCC GGCTGAGGCA AAGAAGCGCG GCTCCAAATC CGTGGTCAAG
CGTGACGTGG TACGTCTGGT CACGCCCGGA ACGCTGACGG AAGAAAAGCT GCTGTCGCCG
TCGGAGACCA ATTATCTGAT GGCGCTGGCA CGGGTGCGCG GCAGCGGCGA TGAACTGGCG
CTGGCCTGGA TCGATATTTC CACCGGCGTT TTCCGTCTGG CAGAAACCAA TCCCACCCGT
CTGCTGGCGG ATATTTTCCG CATCGATCCG CGTGAAGTGA TCGTCGCCGA AACCCTGATG
CAGGACCCGG ACCTCAAGCC GGCCTTCGAT GTGCTTGGCC GCGTGGTCGT GCCGCAGCCA
TCGGTGCTGT TCGATAGCGC TTCGGCGGAG GGGCGCATCA CGCGCTATTT CAACGTCAAG
ACGCTCGATG GTTTCGGCGG GTTTTCACGG CCGGAAATGG CCGCGGCTTC GGCGGCGATT
GCCTATGTGG AAAAAACCCA GATGTCCGAG CGTCCGCCGC TTGGGTTGCC TGAGCGGCAA
TCCTCGTCCT CGACGCTGTT TATTGACGCG GCCACCCGCG CCAATCTGGA ATTGGTGAAG
ACCCTTTCCG GCCAGAAACA GGGTTCCCTG CTCAACACCA TTGACCGAAC CGTAACCGGG
GGCGGCGCGC GGCTGATGGC CGAACGGCTG ATGTCGCCTT TGACGGAGGT TGTGGCAATT
GCGCAGCGGC AGGATGCGGT TGCCTATCTG TTGACGGACG GTTTTCTGTG TGAACGGCTG
CGCGACCTTT TGAAACGTGC CCCGGATATG CCGCGTGCCC TGTCGCGGCT GGCGCTGGAC
CGGGGCGGGC CAAGAGATCT GGCGGCGATC CGGTATGGGT TGTCCACATC AGGCGATGTG
GCTGGGTTGC TGCGGGGGGC TGTCCTGCCG GATGAGCTTG CCTCGGCGCT CACTGATCTC
GAAATGCTGT CGCCGTCGCT CGAAAATCTG CTGGCCTCGC AACTGGCGGA AGATCTGCCA
TTGTTGAAAC GCGATGGCGG GTTTTTGAGA GAGGGCGCCG ATGAGGGGCT GGACGAGGTG
CGGGCGCTGC GTGACCAGTC GCGCCGGGTG ATCGCCGGGC TGCAATTGCA ATATGCCGAG
GAAACCGGTG TCAAGTCGCT GAAGATCAAG CATAACAATG TGCTGGGCTA TTTCATCGAA
GTCACCGCCG GCAATGCCGG GCCGCTGATC GAGGGTGAGG CGAAGGCCCG CTTTATCCAT
CGCCAGTCGA TGGCCAATGC CATGCGCTTT ACCACGACCG AACTGGCTGA TCTTGAAAGC
CGGATTGCCA ATGCGGCGGG ACAGGCGCTG GAAATCGAGC TTGCGGCGTT TGAGCGAATG
CGCCAGGCGG TGGTGACCGA GGCCGAGGCC ATCAAGAAAG CGGCGCGGGC GCTGGCGGTG
ATCGATGTGG CCGCCGGTCT TGCCGTGCTG GCCGAGGAGC AGGGCTATTG CCGGCCCCTG
GTCGATGACA GCAGGATGTT TTCCATTGTC GCCGGGCGTC ATCCCGTGGT CGAGCAGGCC
TTGCGCAAAC AGGCGGCCAG CCCTTTCATT GCCAATAATT GCGATCTTTC ACCGGTAGGC
GATCAGAAGC ACGGGGCGAT CTGGATGCTG ACCGGTCCGA ACATGGGCGG TAAATCCACC
TTTCTGCGCC AGAATGCGCT GATCGCCATT CTCGCCCAGA TGGGGTCGTT CGTGCCCGCC
GGGTCGGCGC ATATCGGCAT TGTCGATCGG CTGTTTTCCC GCGTTGGCGC CTCCGACGAT
CTGGCGCGGG GCCGTTCGAC CTTCATGGTC GAGATGGTAG AGACGGCGGC CATTCTCAAT
CAGGCGGGCG AGCGGTCTCT GGTCATTCTC GATGAGATTG GCCGTGGTAC GGCGACGTTC
GACGGCCTGT CGATTGCCTG GGCGACTGTC GAGCATCTGC ATGAGGTCAA TCGCTGCCGG
TCGTTGTTTG CCACGCATTT TCACGAGTTG ACGGCGCTGT CTGAAAAGCT TGTCCGGCTG
TCGAACGTCA CTATGAAAGT CAAGGAATGG CACGGCGACG TGATTTTCCT GCATGAGGTT
GGGGCCGGTG CCGCTGACCG TTCCTACGGC ATCCAGGTGG CGCGACTGGC CGGGCTGCCG
GGCATGGTGG TGGAGCGCGC CCGCGCCGTG TTGTCGCAGC TCGAAGATGC CGATCGCAAA
AATCCGGCCA GCCAGTTGAT TGACGACCTG CCACTGTTCC AGGTCAGTCA GAGGCGGGAG
AGCCGGACGG GGACAGGGGC GCAGGTGTCG GCGGTGGAAG AGGCATTGCG CAGCCTCAAT
CTCGATGACT TGACGCCAAG GCAAGCGATG GATGCGCTCT ACGATCTGAA AACCACTCTG
GCGAAATCCT GA
 
Protein sequence
MMEQYIEIKA NNPGSLLFYR MGDFYELFFE DAVEASRALG ITLTKRGQHL GQDIPMCGVP 
VHASDDYLQK LITLGFRVAV CEQVEDPAEA KKRGSKSVVK RDVVRLVTPG TLTEEKLLSP
SETNYLMALA RVRGSGDELA LAWIDISTGV FRLAETNPTR LLADIFRIDP REVIVAETLM
QDPDLKPAFD VLGRVVVPQP SVLFDSASAE GRITRYFNVK TLDGFGGFSR PEMAAASAAI
AYVEKTQMSE RPPLGLPERQ SSSSTLFIDA ATRANLELVK TLSGQKQGSL LNTIDRTVTG
GGARLMAERL MSPLTEVVAI AQRQDAVAYL LTDGFLCERL RDLLKRAPDM PRALSRLALD
RGGPRDLAAI RYGLSTSGDV AGLLRGAVLP DELASALTDL EMLSPSLENL LASQLAEDLP
LLKRDGGFLR EGADEGLDEV RALRDQSRRV IAGLQLQYAE ETGVKSLKIK HNNVLGYFIE
VTAGNAGPLI EGEAKARFIH RQSMANAMRF TTTELADLES RIANAAGQAL EIELAAFERM
RQAVVTEAEA IKKAARALAV IDVAAGLAVL AEEQGYCRPL VDDSRMFSIV AGRHPVVEQA
LRKQAASPFI ANNCDLSPVG DQKHGAIWML TGPNMGGKST FLRQNALIAI LAQMGSFVPA
GSAHIGIVDR LFSRVGASDD LARGRSTFMV EMVETAAILN QAGERSLVIL DEIGRGTATF
DGLSIAWATV EHLHEVNRCR SLFATHFHEL TALSEKLVRL SNVTMKVKEW HGDVIFLHEV
GAGAADRSYG IQVARLAGLP GMVVERARAV LSQLEDADRK NPASQLIDDL PLFQVSQRRE
SRTGTGAQVS AVEEALRSLN LDDLTPRQAM DALYDLKTTL AKS