Gene Daro_2347 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDaro_2347 
Symbol 
ID3566067 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDechloromonas aromatica RCB 
KingdomBacteria 
Replicon accessionNC_007298 
Strand
Start bp2527689 
End bp2530271 
Gene Length2583 bp 
Protein Length860 aa 
Translation table11 
GC content64% 
IMG OID637680814 
ProductDNA mismatch repair protein MutS 
Protein accessionYP_285553 
Protein GI71907966 
COG category[L] Replication, recombination and repair 
COG ID[COG0249] Mismatch repair ATPase (MutS family) 
TIGRFAM ID[TIGR01070] DNA mismatch repair protein MutS 


Plasmid Coverage information

Num covering plasmid clones34 
Plasmid unclonability p-value0.836402 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.482784 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTTAAAG ACAAAGCCCC CGACCTGTCC AAGCACACAC CGATGATGCG CCAATATCTG 
GCGCTCAAGG CCAATCATCC GAACACCCTG CTGTTTTACC GGATGGGCGA TTTTTATGAG
CTGTTCCATG AGGATGCCGA AAAGGCTGCA CGCCTGCTCG ACATCACACT AACGACGCGC
GGCCAGTCAG CCGGCGTGCC GATCAAGATG TGCGGCATTC CCTTCCACTC GCTGGAGCCT
TATCTGGCCC GTCTGGTCAA GCTGGGCGAA TCGGCGGTCA TCTGCGAGCA GATCGGCGAT
CCGGCGACCA GCAAGGGGCC GGTCGAGCGC GCCGTGGCGC GCATCGTGAC GCCCGGAACG
CTGACCGATG CGGCATTGAT CGACGACAAG CAGGATCTTT GGCTGCTGGC AGTCACGACG
CACCGCAACA CGGCCGGTAT CGCCCGCCTG AATCTGGCCA GTGGCGAATT TATCCTGATC
GAGGTACCCA CCGAGCAAAT ACCGGCGACG CTGGAGCGCA TCCGGCCGGC TGAAATTCTT
TATCCCGAAA GCTGGACGCC AAATTTCGGC GTCGACGTTG CCCGCACTCG CCAGCCGGAC
TGGTATTTCG AATTCGACTC GGCGCGCCGC CTGCTTTGCG ACCAGTTCGA GGTCGCTTCG
CTCGCCGGTT TTGGTGCCGA GGGCCTGAAG CCGGCCATTG CCGCTGCCGG GGCCCTGCTG
CAATACGCCC AGGCGACGCA GTCCGGCAAG TTGCCGCATC TGCGCGGCTT GACCGTGGAA
ATCGAGGGCG CCTATCTTGG CCTCGATCTC GCCACCCGGC GTAACCTGGA GCTGACCGAG
ACCCTGCGCG GCCAGCCGTC GCCGACCCTG TTCTCTCTGC TCGACAACTG CGTGACCAGC
ATGGGATCTC GTCTCTTGCG CCACACGCTG CACCACCCTC TGCGCGCGCG CGATATTCCC
GCTGCCCGTC ATGGCGCGGT CGAAGCCTTG CTAGAAGACT ACGGCCGACT GGGCAACGAG
GTCAGAAAAG CCCTGCGCGG CATCGCTGAC ATCGAGCGCA TTGCCGGCCG TGTCGCGCTG
CGCAACGCCC GTCCGCGCGA CCTGGCCAGC CTGCGCGAAT CGGTGGCGCG CCTTGAGGGC
CTGCGGGCGC CATTGTCTGA CAGCCCCGCA CCGCTGATTG CCCAGCTCTT TACCGAACTC
GAAACGCCCT ACCCGGCCCT CGAACTTCTG GTTCGTGCCA TCGCGGCCGA ACCCGGCGCC
CAGGTGCGCG ACGGCGGCGT CATTGCCCCG GGCTACGACC CTGATCTCGA CGAACTGCGC
TCGCTGAACG ACAACTGTGG CGCCTTCCTG GTTGATATGG AAGCCCGAGA GCGCGAGCGC
AGTGGCATTG CCAGCCTCAA GGTCGAATAC AACAAGGTGC ATGGCTTCTA CATTGAGGTC
ACCCATGCCA ATGTCGACAA GATTCCCGAC GACTACCGCC GCCGTCAGAC GCTGAAGAAT
GCGGAGCGCT ACATCACGCC GGAACTGAAA GCCTTCGAGG ACAAGGCGCT GTCGGCCCAG
GAACGCTCGC TGGCCCGTGA GAAGCTGCTC TATGAGGCCA TTCTCGATGC GCTGCTACCA
GTTGTCCCGA CCCTGCAGAC AATCGCTCGC GCCATCGCCC AACTCGACCT GCTGGCCGGC
TTCGCCGAGT CGGCGCTGAA GCGCAACTGG TGCAAACCGG AATTCGCCGC CGAAACCCAG
TTAAGCATCA CCGGCGGCCG TCATCCGGTG GTCGAGGGCG AATTGACCAA CCAGGCTGAA
ACCTTCATCG CCAACGACTG CCTGCTCGCC GAAAACCGCC GCCTGCTGCT GATCACCGGC
CCGAACATGG GCGGTAAATC GACCTACATG CGCCAGGTGG CGCTGATCGC CCTGCTTGCC
CACATTGGCT GCTACGTGCC GGCCGACCGC TGCGTACTCG GCCCGCTCGA CCGCATCTTC
ACTCGCATCG GCGCTTCCGA CGATCTGGCT TCAGGACGTT CGACCTTCAT GGTCGAGATG
ACCGAGGCGG CCGCCATCCT GCACCACGCG ACGAATCAGA GCCTGGTGCT GATGGACGAA
ATCGGACGCG GCACATCCAC GTTTGACGGC ATGGCGCTGG CTTTCGCCAT CCTGCGCCAC
CTGATCGAGA AAAACCAAAG CCTGACGCTA TTCGCCACGC ACTATTTCGA ATTGACGCGG
CTGTCGCACG AGTACTCCGA ACTGGCCAAC GTCCATCTCG GCGCCGTCGA GCACAACGAC
CGCATCGTCT TCATGCATGC CGTCGAAGAA GGCCCGGCCA ACCAGAGTTA CGGTATCCAG
GTCGCCGCAC TGGCCGGCAT CCCAACCGCC GTCGTCCGTG CTGCACGCAA GCAACTGCGC
GAATTCGAAC AGCGGGCGGC CGTCGATCCG CTGCAGCCCG ACCTCTTCGC TCAGGGCGCA
CCCGAGCCGG CCGAACCGGA GCCGCATCCA GTCGTAGAGC AACTCGCCGC CATCGATCCG
GACAGCCTGA CGCCGCGCGA AGCGCTCGAT GCCCTGTATG CCTTGAAAGG CCTTTTGCGG
TGA
 
Protein sequence
MVKDKAPDLS KHTPMMRQYL ALKANHPNTL LFYRMGDFYE LFHEDAEKAA RLLDITLTTR 
GQSAGVPIKM CGIPFHSLEP YLARLVKLGE SAVICEQIGD PATSKGPVER AVARIVTPGT
LTDAALIDDK QDLWLLAVTT HRNTAGIARL NLASGEFILI EVPTEQIPAT LERIRPAEIL
YPESWTPNFG VDVARTRQPD WYFEFDSARR LLCDQFEVAS LAGFGAEGLK PAIAAAGALL
QYAQATQSGK LPHLRGLTVE IEGAYLGLDL ATRRNLELTE TLRGQPSPTL FSLLDNCVTS
MGSRLLRHTL HHPLRARDIP AARHGAVEAL LEDYGRLGNE VRKALRGIAD IERIAGRVAL
RNARPRDLAS LRESVARLEG LRAPLSDSPA PLIAQLFTEL ETPYPALELL VRAIAAEPGA
QVRDGGVIAP GYDPDLDELR SLNDNCGAFL VDMEARERER SGIASLKVEY NKVHGFYIEV
THANVDKIPD DYRRRQTLKN AERYITPELK AFEDKALSAQ ERSLAREKLL YEAILDALLP
VVPTLQTIAR AIAQLDLLAG FAESALKRNW CKPEFAAETQ LSITGGRHPV VEGELTNQAE
TFIANDCLLA ENRRLLLITG PNMGGKSTYM RQVALIALLA HIGCYVPADR CVLGPLDRIF
TRIGASDDLA SGRSTFMVEM TEAAAILHHA TNQSLVLMDE IGRGTSTFDG MALAFAILRH
LIEKNQSLTL FATHYFELTR LSHEYSELAN VHLGAVEHND RIVFMHAVEE GPANQSYGIQ
VAALAGIPTA VVRAARKQLR EFEQRAAVDP LQPDLFAQGA PEPAEPEPHP VVEQLAAIDP
DSLTPREALD ALYALKGLLR