Gene Mmar10_3021 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMmar10_3021 
Symbol 
ID4285588 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMaricaulis maris MCS10 
KingdomBacteria 
Replicon accessionNC_008347 
Strand
Start bp3304603 
End bp3307296 
Gene Length2694 bp 
Protein Length897 aa 
Translation table11 
GC content65% 
IMG OID638142517 
ProductDNA mismatch repair protein MutS 
Protein accessionYP_758240 
Protein GI114571560 
COG category[L] Replication, recombination and repair 
COG ID[COG0249] Mismatch repair ATPase (MutS family) 
TIGRFAM ID[TIGR01070] DNA mismatch repair protein MutS 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones51 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATGCCA TGTCGACCCT CCCCGTGCCG CGCGATCGCA CCTTTCCCTC GAGTGAGGGT 
GCAACCCCGA TGATGCAGCA ATTCCTCGAG CTGCGCGCCC AGGCGCCGGC CGATGCCTTG
CTGTTCTACC GGATGGGCGA CTTCTACGAG TTGTTCTTCG ATGATGCCGT GCGCGCTTCG
GCGGCCCTTG ATATCGCCCT GACCAAACGC GGGGAGCACC AGGGCGAGCC GATCTCGATG
TGCGGCGTCC CGGCGGCGAC AGCGGAGGCC TATCTCGCGC GCCTGATCAA GGCCGGTTTC
AAGGTTGCCG TCGGCGAGCA GATGGAAGAC CCGAAGACGG CCAAGGCCCG CGGCGGTTCG
AAAGCCGTTG TCCGACGGGC CATTACCCGT GTTGTCACGC CGGGCACGCT GACCGAGGAC
AGCCTGCTGG ACCCGCGCGT CTCCAACCGG ATTGCGGCAC TGGCCCAATT GGTGACCGGT
GAGGCAGCGC TGGCCTGGGC CGATGTCTCG ACCGGAGACT TCCGGGTTTC TCCGGTTGCC
ACCGAGAACC TGGCTGCCGA GATCGCCGCC ATGAGCCCGG CCGAATTGCT GGTCGAGGAG
CGGGGATTTG CCGAGGCCGC TATGTTGGCG CCGCGATCGA CCCTGACCCC CCTGCCAAAA
GCCAAATTTG ACCCGAGCTC GGCCGAGCGC CAGTTGAAGG ACCAGTTCAG GGTCCAGGAA
CTGACCGCTT TCGGAAACTT CACCAAGGCC GAATGCGCCG CGCTGGGCGC CTTGCTGGAT
TATCTGTCAC TGTCACAGGC CGGGGCACCG GCGAAGCTGG CGCCACCGCG CCAGGTCGCC
GCCGGAGCCT GTCTGGCCAT CGATCCGGCA ACTCGCGCCA GCCTCGAGAT CGAACGCACT
CTGTCAGGGT CACGTCAGGG GTCCCTGCTG GATGCCATCG ACCGGACGGT AACGGCACCG
GGCGCGCGCA AGCTGGCTGA ACGTCTGGCC AGACCATTGA CGAATGTCGC TGAAATCGAG
GCCCGTCTGG ACGCGATTGC CTGGTTCGAG CGCGCCCGGC CCGAACGTCG GGATCTGCGC
GATCGTCTGC GCCAGGCTGG AGATGCCGAG CGCGCCCTGT CGCGGTTATT GCTGGGTCGG
GGCGGACCGC GCGATCTCAA ATCACTCGCG GCGGCCTTGC AGGAGGGTGA GATCATCGCT
TCCCGTCTGC TCGACCGGAC GTTGGACACC CCGCCGACCC TGATCTCGGA AGGCCTTGAG
GCCCTGGTTC TTGGCGACAA GCCGGAACTG GCAGAGCTGA TCGCCGAGTT GGAACGCGCC
ATTGTCGACG AGCCACCGCT TCTCGCGCGC GATGGCGGTT TCATCGCCGA AGGCTGGCAG
GTTGAACTGG ACGAGTTGCG GCAATTGCGT GATGCGTCCC GGCGCGTGGT CGCGGGTCTG
CAGCAGACTT ATGCCGAGGC GGTGGGTGTC AGCGCCTTGA AGATCAAGCA CAATAATGTG
CTGGGTTACT TCGTCGAGGT CACCGCCAAA CATGGCGATG CCCTGATGGG GGACGACCGT
TTTATCCATC GTCAGACCAT GGCCAATGCG ATCCGCTTTT CGACCACCGA GCTGGCTGAG
CTGGAGGCCA AGATCGCATC GGCCGGTGAT CGCGCCCTGG CGATGGAGAT CGATGCCTTT
GCCGGATTGC GGGACCGGGT TGAGGCTCAG GCCGACCTGA TCCGTGGCGC GGCCCGGGCT
CTGGCCGAGT TCGACGTCGC GGCGAGCCTT GCAGAATGGG CGGAAGACAG CGAGGCCGCG
CGGCCGGTGA TGTCTCAGGA CAGTGTCTTC CACATTGAAG GTGGTCGACA TCCGGTCGTT
GAGCGGGCGC TCGCGAAGGC GGGGGACGGG CGTTTCACGC CGAATGACTG TCATCTGGAT
GGAGCCGGCG AAGAAGCCAA ACGGCTGACA TTCGTGACCG GCCCCAACAT GGCCGGTAAA
TCGACATTTC TGCGCCAGAA CGCCCTGATC CTGATGTTGG CGCAGGCTGG TTGCTATGTA
CCGGCGCGGG CGGCACGTAT CGGTGTTGCC GACCGCCTTT ACTCGCGCGT GGGCGCTGCT
GATGATCTGG CCCGCGGTCG CTCGACCTTC ATGGCGGAGA TGATCGAGAC GGCTGCCATT
CTCAATCAGG CAACAGCGCG CAGCTTTGTC ATTCTTGACG AAATCGGACG CGGCACCGCG
ACCTTTGATG GCCTGTCGAT CGCCTGGGCG GCTGCCGAGC ATCTGCATGC GGTGAATGGC
TGCAGGGCCC TCTTCGCAAC CCATTACCAT GAGCTGACAC GGCTCGCCGA CGATCTCGAC
GCGGCTGGCA ATGTCTCGCT CAAGGCCCGG GAATGGAAAG GTGAGCTGGT TTTTCTTCAC
GAAGTCGGAA GCGGAGCGGC GGACCGGTCT TACGGGATCG AGGTCGCGCG CCGGGCCGGG
TTGCCGCGGG TGTCAGTCAA GCGAGCCCAA GCGATCCTGG CACGGCTTGA GGCAGATGGA
GCCCCGGCTG CGGCCCTCAC CGACTTGCCG TTGTTTGCCC TGTCCGAACC GGAACCGGAG
CCCGTCATTT CAGAGGTCGA GACCCGTCTT GGTGAAATCG ATCCGGATGC GCTGAGTCCG
CGCGAGGCGC TGGAAGTCCT GTATGCGCTC AAGGCGATGA CGAAGAAGAC GTGA
 
Protein sequence
MNAMSTLPVP RDRTFPSSEG ATPMMQQFLE LRAQAPADAL LFYRMGDFYE LFFDDAVRAS 
AALDIALTKR GEHQGEPISM CGVPAATAEA YLARLIKAGF KVAVGEQMED PKTAKARGGS
KAVVRRAITR VVTPGTLTED SLLDPRVSNR IAALAQLVTG EAALAWADVS TGDFRVSPVA
TENLAAEIAA MSPAELLVEE RGFAEAAMLA PRSTLTPLPK AKFDPSSAER QLKDQFRVQE
LTAFGNFTKA ECAALGALLD YLSLSQAGAP AKLAPPRQVA AGACLAIDPA TRASLEIERT
LSGSRQGSLL DAIDRTVTAP GARKLAERLA RPLTNVAEIE ARLDAIAWFE RARPERRDLR
DRLRQAGDAE RALSRLLLGR GGPRDLKSLA AALQEGEIIA SRLLDRTLDT PPTLISEGLE
ALVLGDKPEL AELIAELERA IVDEPPLLAR DGGFIAEGWQ VELDELRQLR DASRRVVAGL
QQTYAEAVGV SALKIKHNNV LGYFVEVTAK HGDALMGDDR FIHRQTMANA IRFSTTELAE
LEAKIASAGD RALAMEIDAF AGLRDRVEAQ ADLIRGAARA LAEFDVAASL AEWAEDSEAA
RPVMSQDSVF HIEGGRHPVV ERALAKAGDG RFTPNDCHLD GAGEEAKRLT FVTGPNMAGK
STFLRQNALI LMLAQAGCYV PARAARIGVA DRLYSRVGAA DDLARGRSTF MAEMIETAAI
LNQATARSFV ILDEIGRGTA TFDGLSIAWA AAEHLHAVNG CRALFATHYH ELTRLADDLD
AAGNVSLKAR EWKGELVFLH EVGSGAADRS YGIEVARRAG LPRVSVKRAQ AILARLEADG
APAAALTDLP LFALSEPEPE PVISEVETRL GEIDPDALSP REALEVLYAL KAMTKKT