Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tbd_1159 |
Symbol | |
ID | 3673231 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thiobacillus denitrificans ATCC 25259 |
Kingdom | Bacteria |
Replicon accession | NC_007404 |
Strand | - |
Start bp | 1212400 |
End bp | 1214952 |
Gene Length | 2553 bp |
Protein Length | 850 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 637709843 |
Product | DNA mismatch repair protein MutS |
Protein accession | YP_314917 |
Protein GI | 74317177 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0249] Mismatch repair ATPase (MutS family) |
TIGRFAM ID | [TIGR01070] DNA mismatch repair protein MutS |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.0179722 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCGATCG CGCCCGCGGC CCACACGCCG ATGATGCAGC AGTATCTCGG CATCAAGGCA CAGCATCCCG ACATGCTGCT GTTCTATCGC ATGGGCGATT TCTACGAGCT GTTCTTCGAG GACGCGGAGA AGGCCGCGCG GCTGCTCAAC ATCACGCTCA CGACGCGCGG CGCTTCGGCC GGCAGCCCGA TCAAGATGGC CGGCGTGCCC TACCACAGCG CCGAGCAATA CCTGGCGCGC CTGCTCAAGC TCGGTGAATC GGTCGTGATC GCGGAACAGG TCGGCGACCC GGCGGCATCG AAAGGCCCGG TCGAGCGCCG CGTCAGCCGC GTCGTCACCC CCGGCACGCT GACCGACGCG GGGCTTCTCG ACGAGACGCG CGACGCGCTG ATCATGGCGA TCGCCGTCGC CGGCGACGTG CTGGGCGTGG CCTGGCTCAA CCTCGCTGCC GGCCGTTTCC AGGTCACCGA ACTCGACCGC ACTGCGCTGC CCGCCCTGCT CGCGCGCGTG CGGCCGGCCG AGATCCTTGC GCACGAGCAC CTCGACCTCG CTGCCGACTG TCCCGTGCGC CGGCTCGATG CTTGGCAGTT CGATGCGGAC GGGGCAAGCA AGCGCCTCGC GCGACAGTTC GGCAGCTGCG ATCTCCAAGG CTTCGGCGTG GCCGAAATGC ACTGCGCCAT TGCCGCCGCC GGCGCGCTGC TGGGCTACAT CGAGACGACG CAGCGCACCG CCCTGCCCCA CCTTCTGTCG ATCCGCGCCG AGCGCGACAG CGACTTTGTG CTGCTCGACG CGGCGACGCG GCGCAACCTC GAACTGACCG AGACGTTGCG CGGCGACGCG GCGCCGACCT TGCGCTCGGT GCTCGATACC ACCTGCAGCG GAATGGGAAC GCGCCTGCTT CGTCACTGGC TGCACCACCC CTTGCGCGAC CGCGCCGCCG TCGCCGCCCG TCGTGACGCG ATCGGCGTGC TCGCCGCCGC GCCGGACAGC GCCGCCCGCC TCGCCGATCT GCTCAAACGC TGCGCCGACG TCGAACGCAT CGGCGGCCGT ATCGCGCTGA AGAACGCACG GCCGCGCGAT CTCTCCGGCC TGCGCGACAC GCTCGCCCTG CTCCCCGAAC TCGCCGCGGC GTTGCCCACC GATGGTGCAC GCCTCGCCGC CTTGCGCGAC GCGGTCGCCG CCACACCCGA CGTGCACGCG CTGCTGGTCC GCGCGATCCA GCCCGAGCCC GCCAGCGTGC TGCGCGAGGG TGGCGTGATC GCCGACGGCT ACGACGCCGA ACTGGACGAA CTGCGGGCCT TGACGCGCGA TGCCGGCGCC TTCCTGCTCG AACTCGAGAC CCGCGAACGT GCGCGCTCGG GCATCGCGAC ACTCAAGGTC GAGTACAACA AGGTGCACGG CTTCTACATC GAAGTCGGGC GGGCACAGGC CGAGCGGGTG CCGGACGACT ATCGCCGCCG GCAGACGCTC AAGAACGTCG AACGCTATCT GACGCCCGAG CTCAAGGCCT TCGAGGACAA GGCCTTGTCG GCGCAGGAGC GCGCGTTGGC GCGGGAGAAG GCCTTGTTCG AGGCCTTGCT CGACACGCTG ATCCCGCACG TCCCCAACCT GCTGTCGATC GCGTCCGCCC TCGCCGAAAT CGACGTGCTG GCCAGCCAGG CGGAACGCGC GAGTACGCTC AGGCTCTGCG CGCCCGAGTT CAGCGACGAC CCGTGCATCG TGATCCGCGG CGGGCGGCAT CCGGTCGTTG AAGCACAGGT CGAGCATTTC ATTCCCAACG ACGTCGTCCT CAATCGCACG CGCCAGATGC TCCTGATCAC GGGGCCCAAC ATGGGCGGCA AGTCGACCTA CATGCGTCAG GTCGCGCTGA TCACGCTCAT GGCGTGCTGC GGCTTGTGGG TACCGGCAGC TTCGGCCAGG ATCGGTGCCA TCGACCAGAT CTTCACCCGC ATCGGCGCGT CGGACGACCT GGCGGGCGGG CGTTCGACCT TCATGGTCGA GATGACCGAG ACGGCGAACA TTCTGCACAG CGCGACCGCC GACAGCCTCG TGCTGCTCGA CGAAATCGGC CGCGGCACCT CGACCTTCGA CGGCCTCGCC CTCGCGTGGG CCGTCGCGCG TCATCTCGTC AGTGCGACCC GCGCGTTCAC GCTGTTCGCG ACGCATTATT TCGAATTGAC CCAGCTCGCG CAGGAATACC GCCAACTCGC CAATGTTCAT CTCGACGCCA AGGAACACGG CGCGGACCTG GTTTTCCTCC ACGCCGTCGA GGACGGCCCG GCCTCGCAGA GTTACGGCAT CCAGGTCGCA CGGCTTGCCG GCGTGCCGGG CCCCGTCATC CACGCGGCGC GGCGCCGTTT GCGCGAACTC GAAGACGCGC AATTGCAGCC GGGCCCGCAA GGCGACCTGT TCGCCGCTCA CCTGCCCAGG GACGAAGCCC CGCCGCACCC GGCGCTCGAC CAGCTGCGCG AACTCGACCC CGACACGCTC ACGCCCAAGG CCGCACTCGA CATGCTTTAC GCATTGAAAG CCTTGACCGA CAGCGAACCG TGA
|
Protein sequence | MSIAPAAHTP MMQQYLGIKA QHPDMLLFYR MGDFYELFFE DAEKAARLLN ITLTTRGASA GSPIKMAGVP YHSAEQYLAR LLKLGESVVI AEQVGDPAAS KGPVERRVSR VVTPGTLTDA GLLDETRDAL IMAIAVAGDV LGVAWLNLAA GRFQVTELDR TALPALLARV RPAEILAHEH LDLAADCPVR RLDAWQFDAD GASKRLARQF GSCDLQGFGV AEMHCAIAAA GALLGYIETT QRTALPHLLS IRAERDSDFV LLDAATRRNL ELTETLRGDA APTLRSVLDT TCSGMGTRLL RHWLHHPLRD RAAVAARRDA IGVLAAAPDS AARLADLLKR CADVERIGGR IALKNARPRD LSGLRDTLAL LPELAAALPT DGARLAALRD AVAATPDVHA LLVRAIQPEP ASVLREGGVI ADGYDAELDE LRALTRDAGA FLLELETRER ARSGIATLKV EYNKVHGFYI EVGRAQAERV PDDYRRRQTL KNVERYLTPE LKAFEDKALS AQERALAREK ALFEALLDTL IPHVPNLLSI ASALAEIDVL ASQAERASTL RLCAPEFSDD PCIVIRGGRH PVVEAQVEHF IPNDVVLNRT RQMLLITGPN MGGKSTYMRQ VALITLMACC GLWVPAASAR IGAIDQIFTR IGASDDLAGG RSTFMVEMTE TANILHSATA DSLVLLDEIG RGTSTFDGLA LAWAVARHLV SATRAFTLFA THYFELTQLA QEYRQLANVH LDAKEHGADL VFLHAVEDGP ASQSYGIQVA RLAGVPGPVI HAARRRLREL EDAQLQPGPQ GDLFAAHLPR DEAPPHPALD QLRELDPDTL TPKAALDMLY ALKALTDSEP
|
| |