Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sfum_0730 |
Symbol | |
ID | 4460796 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Syntrophobacter fumaroxidans MPOB |
Kingdom | Bacteria |
Replicon accession | NC_008554 |
Strand | - |
Start bp | 888744 |
End bp | 891413 |
Gene Length | 2670 bp |
Protein Length | 889 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 639701492 |
Product | DNA mismatch repair protein MutS |
Protein accession | YP_844863 |
Protein GI | 116748176 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0249] Mismatch repair ATPase (MutS family) |
TIGRFAM ID | [TIGR01070] DNA mismatch repair protein MutS [TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.687248 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACAAAA TCACCCCCAT GATGCAGCAA TATCTCGAAA TCAAGGAGAA GTATCCGGAC GCGCTGCTGC TGTACCGGAT GGGGGATTTC TACGAGATGT TCATGGACGA TGCGGTGACG GCATCTGGGC TCCTCGAGAT CGCGCTCACC TCCCGCGACC GGCAGTCGGA AGTCAGGATT CCCATGTGCG GCGTTCCCTA TCACGCCGCC GAGGGGTACA TCGCCAGGCT CGTCTCGGCC GGGAAAAAGG TGGCCATCTG CGACCAGGTG GAGGATCCCA GGAAGGCGAA GGGGCTGGTG CGCAGGGAAG TCACGCGGGT GATCACTCCC GGGCTGGTCC TGGACGCGCA GAACCTCGCC GCCAAGCAGC CCAATTACCT TGCCGCGGTC TCGAATTCCA CGGCGGGCGA ACGTTTCGGG CTCGCCTTCC TGGACGTCTC CACGGCCGAA TTCAAGATGG TCGAGATTGA ATCCCGCGAA GCCCTCCTGG AGGAGCTGAT CCGGGTCTCG CCCCGCGAAC TGCTCCTCTC CGATGACGAC GAACATCCAT GGGCCGAGGA GCTCCCGAAG CTTTACGGAA TCGCCCTCAC CCCCCTGGGT GCGGACAGAT TCGACGGCAA GCGCGCCGAG GAAGCCCTGG TCGGCCACTT TCGGGTCCAT TCCCTCGAGG GATTCGGCAT TTCGGGGATG GACCTCGGAA TCCGGGCGGC GGGGGCCATC CTTGCTTACA TGCAGGCGAA TCTCCTCGGG TCGTGCGATC ACATCACGCG GCTTCTCCCA TACAGCCGCG GCGACTTCAT GATCGTGGAC GAAGCCGGCG TCCGCAACCT CGAGATATTC CACTCCCAGA GCTTCCAGGG CCGCAAGGGC TCGCTCATTG ACATCCTGGA CGAAACCAAA ACCGCCATGG GCGGGCGCAA GCTTCAGCAG TGGCTGCGGT ATCCCCTGCT CGATCTCGCG CGCATCAACA ACCGCCGGGA GGCGATCGCC GAACTGGCGG CGAACGCGCC CATGCGCGGC GAAACCCTGG GCCTGCTCAG CCGGATAAGC GACGTGGAAC GCCTCAACGG CCGCAACAGC ACGGGGACTT CGACGCCGCG GGACCTGGTG GCCCTCAAGA AGTCCCTGCA GAACCTCCCC GCCCTCGGCG CGGCGCTTGC CGAGCTCACC TCCCCAAGGC TCTCCGAGCT TCGAGCCCGC TGGGACGATC TCGCGGACGT CGCCGACATC ATCGAGCGGA CCCTCCTCGA TCCCCCTCCA CCGGGACTTG CCGCCGGGGG CGTCATTTCC GCCGGCGTCA GCGAAGAGCT CGACCATTTC GTGCGGCTGA GCCGCGACGC CAAGGGGTGG ATGGCGGACT ACGAGGTCCA GCAGCGCCGG GATACGGGCA TTTCCTCGCT CAAGGTCCGC TACAACAAGG TGTTCGGTTA TTACATCGAG ATTTCGAACG CGAACCTCAA CTCCGTCCCT GAACACTATT TTCGCAAGCA AACGCTGGTG AATGCGGAAC GATTCATCAC CGAGGAGCTC AAGACCTTCG AAACTCAGGT GCTCCAAGCC GAGGAGAAGC GGCTCGAGCT CGAGCAGCAG ATCTTCGCGG ATCTGCGCGC CCGGATCGCG CGGGAGGCCG GGCGCATCCA GGCCGCGGCC GACCGGATCG CCGACCTCGA TTGCGTGTCC GCCCTGGCCG AGGTTGCGTG CCGCTACGAC TACTGCCGCC CGGTGATGGA CGAGTCCGAC GCGATCCGCA TTCGCGACGG CCGTCACCCG GTGATCGAGC ATTACCTCAA AGACGGGACC TTCGTTCCCA ACGACCTGGA CATGGATCAG CGGGATCAGC AGGTGCTCGT CATAACCGGG CCCAACATGG CCGGAAAATC GACCATTCTC CGGCAGGCCG CCCTTATCGT CCTCATGGGA CACATCGGGA GTTTCGTCCC GGCATCGGAA GCGCACATCG GCCTGGTGGA CCGCATATTC ACCCGTGTGG GCGCATCCGA CGACCTGGCC CGCGGGCGTT CCACCTTCAT GGTGGAGATG CAGGAAACCG CCAACATTCT CCATCACGCC ACACCGCGCA GCCTTATCAT TCTCGACGAG ATCGGCCGCG GCACGAGCAC CTACGACGGG CTGAGCATCG CCTGGGCGGT TGCCGAACAT TTGCACGACT TCCAGGAAAA GGGTATAAAG ACGCTTTTTG CAACGCATTA TCATGAACTG ACGGAGTTGG CCCGCAGTCG TCCGAGGGTC AGGAATTTCA ACGTGGCCAT CCGGGAATGG CAGCAGGAGA TCCTGTTCTT TCACAAGCTG GTCCAGGGTG GAGCGAGCCG CAGCTACGGA ATCCAGGTGG CGCGACTGGC GGGCTTGCCG GAGGAAGTGA CCGGGCGCGC AAGGGAAATC CTCCAGCAGC TCGAATCCGG TCATGCGCCG TTTGCGGCCG CGCCGTCCGG CGCGGCCAGG CGCGGCCGCC CGGCGCGCGA GAAGGAACCC GGCATCCAGA TGAGCCTCTT TCAGCGTTCG CCGGAATGGC TCCGGGATCG CATTTTGGCC CTCGACCTGG ACAACATGAC ACCGATTGCC GCGCTGCAGA ACCTTCATGC ACTGAAAGAG CAGATACGGG GCTCCGCCGG GGAAGAACCG GCAAGCTGCG CTTCCAGGGG AAAGCGATGA
|
Protein sequence | MNKITPMMQQ YLEIKEKYPD ALLLYRMGDF YEMFMDDAVT ASGLLEIALT SRDRQSEVRI PMCGVPYHAA EGYIARLVSA GKKVAICDQV EDPRKAKGLV RREVTRVITP GLVLDAQNLA AKQPNYLAAV SNSTAGERFG LAFLDVSTAE FKMVEIESRE ALLEELIRVS PRELLLSDDD EHPWAEELPK LYGIALTPLG ADRFDGKRAE EALVGHFRVH SLEGFGISGM DLGIRAAGAI LAYMQANLLG SCDHITRLLP YSRGDFMIVD EAGVRNLEIF HSQSFQGRKG SLIDILDETK TAMGGRKLQQ WLRYPLLDLA RINNRREAIA ELAANAPMRG ETLGLLSRIS DVERLNGRNS TGTSTPRDLV ALKKSLQNLP ALGAALAELT SPRLSELRAR WDDLADVADI IERTLLDPPP PGLAAGGVIS AGVSEELDHF VRLSRDAKGW MADYEVQQRR DTGISSLKVR YNKVFGYYIE ISNANLNSVP EHYFRKQTLV NAERFITEEL KTFETQVLQA EEKRLELEQQ IFADLRARIA REAGRIQAAA DRIADLDCVS ALAEVACRYD YCRPVMDESD AIRIRDGRHP VIEHYLKDGT FVPNDLDMDQ RDQQVLVITG PNMAGKSTIL RQAALIVLMG HIGSFVPASE AHIGLVDRIF TRVGASDDLA RGRSTFMVEM QETANILHHA TPRSLIILDE IGRGTSTYDG LSIAWAVAEH LHDFQEKGIK TLFATHYHEL TELARSRPRV RNFNVAIREW QQEILFFHKL VQGGASRSYG IQVARLAGLP EEVTGRAREI LQQLESGHAP FAAAPSGAAR RGRPAREKEP GIQMSLFQRS PEWLRDRILA LDLDNMTPIA ALQNLHALKE QIRGSAGEEP ASCASRGKR
|
| |