Gene Saro_3293 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_3293 
Symbol 
ID3915940 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp3512607 
End bp3515183 
Gene Length2577 bp 
Protein Length858 aa 
Translation table11 
GC content69% 
IMG OID640446078 
ProductDNA mismatch repair protein MutS 
Protein accessionYP_498562 
Protein GI87201305 
COG category[L] Replication, recombination and repair 
COG ID[COG0249] Mismatch repair ATPase (MutS family) 
TIGRFAM ID[TIGR01070] DNA mismatch repair protein MutS 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0310827 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATGGCGC AATATCTTGC GCTAAAGGAC CAGGCTGGCG ACTGCCTGCT GTTCTATCGC 
ATGGGCGACT TCTTCGAGCT TTTCTTCGAC GACGCGAAGG TCGCCGCACA GGTTCTCGAC
ATAGCCCTCA CCAGCCGGGG CGAGCATGGC GGCGCGCCCA TTCCGATGTG CGGCGTGCCG
GTCCATTCGG CCGAGGGATA CCTTGCCCGG CTGATCAAGG CGGGGTGCCG CGTCGCCATC
GCGGAACAGG TGGAAACCCC CGAGGAAGCG AAGAAACGCG GGGGTTCCAA GGCTCTCGTC
GCGCGCGACA TCGTCCGCTT CGTCACGGCC GGGACGCTGA CCGAGGAGGC GCTGCTCGAA
CCGCGCCGGG CCAACGTGCT TGCGGCGGTG TGCGAAGTAC GCGGCCTCAT CGGCATCGCC
GCCTGCGACA TCTCGACCGG CCGGATGGAG CTGGAGGAAT GCGCCGCCGA CCAGATTGGC
GCGGCTCTTG CCCGGCTTGG TGCGAGCGAG ATCGTCGCAC CCGATTCCTG GGATGACCGG
CCCTTCGATT GCGTTCCGCG TCCGAATCGG ACCTTCGCCA GCGAAGAGGG CGAAGCGCGA
CTCAAGGCCG TGCACGGCGT GTCGACCCTC GACGGCTTCG GCCAGTTCAC CCGGGCGATG
CTCTCGGCGG CCGGCGGGCT GGTCACCTAC CTCGACCATG TCGGGCGCGG CGCGCTGCCG
CTCCTGCTGC CCCCGGTGGC GCGGGAAGCG GGCACCCACA TGGCAATGGA CGAGGCCACG
CGGGCAAGCC TCGAGATCCT GAACAGTTCC ACCGGGACGC GGCGGGGCAG CCTTGTCGAG
GCGATCGACC GCTGCGTCAC GGGCGCCGGC GCGCGCCTCC TGGCGGAAGA CCTCTCCGCG
CCGCTGACCG ACGCACGCGC GATCAACCGC CGCCTCGAAA TGGTCAGCTG GCTCCACGAC
GATCCGCTGC TGAGGGGCGA CATCCGCGCC ATCCTGCGCT CGCTGCCCGA CGTCGGGCGT
GCGCTTGGCC GCGTCGTCGC GGGGCGTGGA AGCCCTCGCG ATCTCGGGCA ATTGCGCGAC
GGCCTGTCCG AAGCGCGGCG GCTGCACGGT CTGCTGCACG CTCGCGCTGA CCGGCCCGAA
CCGGTCCACG CGCTGCTCCC CTCGCTCGCC GGCCACGGCG CACTTTGCGA CCTCTATGCC
CGTGCGCTGG TCCCTGCCCC TCCGACCGAA AGGTCACAGG GGGGCTACAT CGCCGAGGGC
TACGACGCGG CGCTCGATGA ACTGCGCCGC ATATCGGGCA ACGCCCGCCG CGCGATTGCC
GCGCTGGAGG CCAAGTACCG CGACGACACC GGGATCACTG CCCTCAAGAT CCGCCACAAC
GGTGTGCTCG GCTATTTCAT CGAGGTTCCC GCAAAGCACG CCGACCGGTT GATGGCGCCC
GATTCCGGTT TCACCCATCG CCAGACCATG GCTGGAGCCG TGCGTTTCAA CGCACTGGCG
CTGCATGAGG AGGCGAGCCG CATCGCCGAG AGCGGCGGAC ACGCGCTGGC AGCGGAAGAA
GCACACTTCG AGGACCTCGT CGGCCACGCG GTGCGCGCGA AGGAGGCGAT CGCGGCCACC
GCCGCCGCGC TTGCGCGCAT CGACGTCGCC GCCGGTCAGG CCGAACGCGC TGCCGAAGGC
GGCTGGGCCC TGCCGCGCGT GGTAGACGAG CCTTGTCTCG AAATAAGTGG CGGGCGCCAT
CCGGTCGTGG AAGCGGCGCT TGCCGCCAAG GGCGAGCGCT TTGTCGCCAA CGACTGCGCG
CTCGGGCCGC AGGACCGGCT GTGGCTGGTC GGAGGGCCTA ACATGGGCGG CAAGTCCACG
TTCCTGAGGC AGAACGCGCT GATCGTACTG CTCGCCCAGG CAGGCGGCTT CGTTCCGGCA
CGGTCGGCGA CAGTGGGCCT CGTCGACCGC CTGTTCAGCC GCGTCGGCGC ATCGGACAAT
CTCGCGCGCG GCCGCTCGAC CTTCATGGTC GAGATGGTCG AGACGGCAGC GATCCTCAGC
CAGGCAACGG ACCGCAGCTT CGTCATTCTC GACGAAGTCG GGCGCGGCAC TTCGACCTAC
GACGGACTCG CGCTCGCCTG GGCGGTAGCC GAGGCGGTCC ACACCATCAA CCGCTGCCGC
TGCCTTTTCG CCACGCACTA CCACGAACTC GCCCGCCTCG CCGAAAGCTG CGACGCCCTC
TCGCTCCATC ACGTCCGCGC GCGCGAGTGG AAGGGCGACC TCGTCCTGCT GCACGAACTG
GCCGATGGTC CGGCCGACAA GTCCTACGGC CTTGCCGTGG CCCGCCTCGC CGGCGTTCCC
GCGCCCGTGA TCAAGCGCGC CAAGTCGGTG CTGGAGAAGC TGGAGAAAGG CCGCGCCGCC
ACCGGCGGGC TGGCGGCCGG GCTCGACGAC CTGCCCCTCT TCGCCGCCGC CATCGAGGCC
GCCGAGGAAA AGGTCGATGC CCTTCGCGAA CGCCTCAACG GCCTCGACAT CGACGCACTG
TCCCCTCGCG AGGCTCTGGA CCTGCTCTAC GAACTGAAAG CCCAGGCCAA TGGTTGA
 
Protein sequence
MMAQYLALKD QAGDCLLFYR MGDFFELFFD DAKVAAQVLD IALTSRGEHG GAPIPMCGVP 
VHSAEGYLAR LIKAGCRVAI AEQVETPEEA KKRGGSKALV ARDIVRFVTA GTLTEEALLE
PRRANVLAAV CEVRGLIGIA ACDISTGRME LEECAADQIG AALARLGASE IVAPDSWDDR
PFDCVPRPNR TFASEEGEAR LKAVHGVSTL DGFGQFTRAM LSAAGGLVTY LDHVGRGALP
LLLPPVAREA GTHMAMDEAT RASLEILNSS TGTRRGSLVE AIDRCVTGAG ARLLAEDLSA
PLTDARAINR RLEMVSWLHD DPLLRGDIRA ILRSLPDVGR ALGRVVAGRG SPRDLGQLRD
GLSEARRLHG LLHARADRPE PVHALLPSLA GHGALCDLYA RALVPAPPTE RSQGGYIAEG
YDAALDELRR ISGNARRAIA ALEAKYRDDT GITALKIRHN GVLGYFIEVP AKHADRLMAP
DSGFTHRQTM AGAVRFNALA LHEEASRIAE SGGHALAAEE AHFEDLVGHA VRAKEAIAAT
AAALARIDVA AGQAERAAEG GWALPRVVDE PCLEISGGRH PVVEAALAAK GERFVANDCA
LGPQDRLWLV GGPNMGGKST FLRQNALIVL LAQAGGFVPA RSATVGLVDR LFSRVGASDN
LARGRSTFMV EMVETAAILS QATDRSFVIL DEVGRGTSTY DGLALAWAVA EAVHTINRCR
CLFATHYHEL ARLAESCDAL SLHHVRAREW KGDLVLLHEL ADGPADKSYG LAVARLAGVP
APVIKRAKSV LEKLEKGRAA TGGLAAGLDD LPLFAAAIEA AEEKVDALRE RLNGLDIDAL
SPREALDLLY ELKAQANG