Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_1711 |
Symbol | |
ID | 3833161 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | - |
Start bp | 1749221 |
End bp | 1751626 |
Gene Length | 2406 bp |
Protein Length | 801 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 637829636 |
Product | MutS2 family protein |
Protein accession | YP_430556 |
Protein GI | 83590547 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1193] Mismatch repair ATPase (MutS family) |
TIGRFAM ID | [TIGR01069] MutS2 family protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.00000187694 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.426303 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGGAAATGG GGGCCAAAGG TTTGAATAAG ACGTACATTA AACTGGAACT GGATAAAATT CTGGCCCGGC TGGAGGGCTA TTGCCTATCG CCCCTGGGAG CGGAGAGGGT GCGTGCTCTG GAACCGGTCC GGGACCTGGA ACTGGTGCGC CGGCGCCAGG AAGCAACGGG AGAAGGGGAA CGGGTATTGC TGCGCTATCC GGATTTACCC CTGGGTAACT TCCAGGACAT CCGGCCGGAG ATCAACCGGG CGGCAGCCGG AGCCATCCTG GAGGGCCAGG AACTCCTCCG GGTGGCCACC ACCCTGGCTG CCGGCCGGCG CTTGCGCCAG TTCCTCCTGG GGCTGGAGGG CGAGTGGCCG GTTCTTAAAG GGCTGGCCGC CGGGATTGGC GACTTTCGCC AGCTGGAAAG GGCGATTACG GCCGCCATCG GCCCCGACGG CCTGGTCCTG GACCAGGCCA CGCCGCGCCT GGCGAGCCTG CGGCAGCAGG TTCGCCAGGC CCAGGAGAGG ATCAAAGAGC GCCTGGATAG CTACCTGCGT TCAACCGAGA TGCAGAAATA CCTCCAGGAT AACCTGTTCA CCATTCGCAA TGATCGCTAT GTCTTACCCG TCAAGCAGGA GTACCGCCAC CAGGTACCGG GGCTGGTTCA CGATCAATCG GCCAGCGGGG CCACCCTCTT CATCGAACCC ATGGCCCTGG TGGAGTTGAA TAACGAACTA CGCCGTCTCC AGACAGCCGA AGCCAGGGAA GTGGAGGCTA TTCTCCTGCA CTTGAGTAAA CTCGTAGGCG GGCAGAAAGA AGAAATCCTG GCCAGCCTGG CCGCCCTGGG GGAACTGGAT TTTACCCTGG CCAAAGGACG CCTCAGCCAG GCCATGGCCG CGGTGCCGCC GCGTCTCAAT GCCGGGGGGC GCTGGCGTAT CCGCCAGGGT CGTCATCCCC TCCTGGGCGG CCGGGTTGTA CCCGTTAGCC TGACCCTGGG CGAGGATTTT GATACCCTGG TCATTACCGG ACCCAACACC GGCGGCAAGA CGGTTACTTT GAAGACCATG GGCCTGTTTA CCCTGATGGC CCAGTGCGGC CTGCACCTGC CGGCGGCTGA CGGTACCGAA GTCGATGTCA CGGCAGCCGT CTATGCCGAC ATCGGTGACG AACAGAGTAT CGAGCAATCC CTGAGCACCT TCTCGGCCCA CATGCGTCAG ATTGTGGCCA TTGTCAGGGA AGTGGAAGCG GGGAGCCTGG TCCTCCTGGA CGAACTGGGA GCCGGTACCG ACCCTACTGA AGGAGCGGCC CTGGCCATGG CCATCCTGGA CTACCTGACC GGGGTGGGGG CCCGGACGGT AGCTACCACC CACTTCAGCG AACTCAAGGC CTACGCCTAT GCCACGCCCC GGGTAGAAAA CGCCGCCGTG GAGTTTGACA GCGAGACCCT GCAGCCGACT TATAAGCTCC TCATCGGCAC CCCCGGGGAG AGCAACGCCT TCGCCGTTGC CGGTCGTCTG GGACTGCCGC CGGCCCTCAT CGAACAGGCC CGTGGCTTCC TGAGCGAGGA GAACCGGCGG GTCAGCCGTC TGATCGAGGG ACTGACGGCC GACAGGCGGG CCAGCGCCCG GGAACGGGCC GAGGCCGAAT CCTTACGCCG CGAGGCGGAA GCCGCCCGGG AAGCCATGGA AAAGGAACGT AGGGAGTGGC AGCAGCAGGC TGCCAGGCAA TTGGAGAAGG CCCGGGAGGA AGCCCGGGCC ATCCTCCGCC GGGCCCGTTA TGAAGTCAGG GAGCTCATGG CCAGGGTGGA GAAGGCCCTG GCGGAGGAAA GCCTGCGCAG CCAGCAGCAG GTCCTCTCTA GAGCCCGCCA GCGGTTAAAG GAACTGGAGG ATGAGGTAGA AACCGGGATG GAACGCTACC AGCCGGTAGC AGGCGGCCAG CCGCCGGAGC ACCTCCGGGC GGGGGACAGG GTCTTCCTGG CTTCCTGGGG CCAGGTCGGA GAAGTTATCA GCCCTCCTAA CGAGCAGGGA GAGGTCCTGG TCCAGGTTGG CGCTCTCAAA GTGAATGTGC CGGTAAAGGA GCTTCGCCTG GTTAATAACG ACCATCACGA AAATAGAACT AAAACTAGAA GGAACGTCGC CGGTGCCGGC TGGACTGTCC AAGCGGCCGT CAATGACGAC ATCCGGCCGG AAATCGACCT GCGGGGGCTA ACCGTAGCCG AAGCCTGCCA TCAAGTAGAC GAATACCTGG ACGACGCCGT TCTGGCAGGT CTGAACCGGG TAAGTTTAAT CCATGGCAAG GGTACCGGCG CCCTACGAGT GGCCCTGCAG GACTACCTGC GCCAGCACCC CCTGGTCAAG GGCTTTCGCC TGGGCGGGGC CGGGGAAGGT GGCAGCGGGG TGACCATCGT CGATATCGGC CGTTGA
|
Protein sequence | MEMGAKGLNK TYIKLELDKI LARLEGYCLS PLGAERVRAL EPVRDLELVR RRQEATGEGE RVLLRYPDLP LGNFQDIRPE INRAAAGAIL EGQELLRVAT TLAAGRRLRQ FLLGLEGEWP VLKGLAAGIG DFRQLERAIT AAIGPDGLVL DQATPRLASL RQQVRQAQER IKERLDSYLR STEMQKYLQD NLFTIRNDRY VLPVKQEYRH QVPGLVHDQS ASGATLFIEP MALVELNNEL RRLQTAEARE VEAILLHLSK LVGGQKEEIL ASLAALGELD FTLAKGRLSQ AMAAVPPRLN AGGRWRIRQG RHPLLGGRVV PVSLTLGEDF DTLVITGPNT GGKTVTLKTM GLFTLMAQCG LHLPAADGTE VDVTAAVYAD IGDEQSIEQS LSTFSAHMRQ IVAIVREVEA GSLVLLDELG AGTDPTEGAA LAMAILDYLT GVGARTVATT HFSELKAYAY ATPRVENAAV EFDSETLQPT YKLLIGTPGE SNAFAVAGRL GLPPALIEQA RGFLSEENRR VSRLIEGLTA DRRASARERA EAESLRREAE AAREAMEKER REWQQQAARQ LEKAREEARA ILRRARYEVR ELMARVEKAL AEESLRSQQQ VLSRARQRLK ELEDEVETGM ERYQPVAGGQ PPEHLRAGDR VFLASWGQVG EVISPPNEQG EVLVQVGALK VNVPVKELRL VNNDHHENRT KTRRNVAGAG WTVQAAVNDD IRPEIDLRGL TVAEACHQVD EYLDDAVLAG LNRVSLIHGK GTGALRVALQ DYLRQHPLVK GFRLGGAGEG GSGVTIVDIG R
|
| |