Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Athe_1500 |
Symbol | |
ID | 7408159 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anaerocellum thermophilum DSM 6725 |
Kingdom | Bacteria |
Replicon accession | NC_012034 |
Strand | - |
Start bp | 1583184 |
End bp | 1585775 |
Gene Length | 2592 bp |
Protein Length | 863 aa |
Translation table | 11 |
GC content | 35% |
IMG OID | 643715863 |
Product | DNA mismatch repair protein MutS |
Protein accession | YP_002573371 |
Protein GI | 222529489 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0249] Mismatch repair ATPase (MutS family) |
TIGRFAM ID | [TIGR01070] DNA mismatch repair protein MutS |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAAGAGT TAACTCCTAT GATGCAGCAA TATATGGAAA TAAAACAGAA GGTGAAAGAT TGTATTTTAT TTTTTAGGCT TGGCGATTTT TATGAGATGT TTTTTGAAGA TGCGATTGTG GCGTCTAAAG AGCTTGAGAT AGCTTTAACC AGCAGAGATT GTGGAAACAA TGAAAAAGCT CCTATGTGCG GTGTGCCGTA CCATTCTGCG ACCAGTTACA TTGCAAAGCT TATCGAAAAA GGTTACAAGG TTGCTATCTG CGAACAGGTG GAAGACCCAA AGCTTGCAAA GGGAATTGTT AAAAGGGAAA TTACAAGAAT AATAACTCCA GGCACATTTA TTGACGATAA TATTTCGACA GCCAATAATT TCATATGTTG TATATCAAAA GACAGATCTG AATTTGCCCT GACATTTGTA GATGTTTCAA CTGGTGAGAT GTACTCTTGC CTTCTTGAAG AGGACCTTCA GAAACTGGTA AATGAAATTG GCAAATACAG TCCCAGTGAA ATTTTAATTT CAAATATAGA AGATGAGCTT TATGAATTTT TGAAGAAAAA CTGCACTTCT TTTGTGCAGA TGATAGAGTT TGTGGATTTA CAAAAGTGCC ATGAGATTAT TGAAAATCAG ATAAATGTTG GCAAAATAGA TGAGAAGCTG ATTTTGAGCG TAGGAAATTT GCTAAAATAC TTAACAGAGA CCCAAAAAAT TTCCTTTGAT TATATAAGGA GGTTTGAATT TTACAGAGTC CAAAACTATC TTCAAATTGA CATAAACACA AAACGAAATT TGGAGCTCAC AGAGAGTATT ATTCAGCGCT CCAGAAAGAA TAGCCTTCTT GGTATTTTGG ATCAAACGAA GACTTCAATG GGTTCAAGGC TATTGAAAAA ATGGATTGAA AGACCTCTTA TTGACATTAT TGAGATTAAT AAAAGGCTTG ATAGTGTTGA GGAGCTCAAA TCAAACTATT CCACTTTAGT GCAGGTAGAA GAACTTTTGA GCAGAATGTA TGATATAGAA AGACTTTCCT CAAAGTTTGC ATATAAGAAT GTGAATGCTA AAGATTTGCT GAGTCTAAAA AAGTCTATTG AAGTGTTGCC AACTTTAAAA CAATTTCTTT CTTCATTTGA TTCAGAATTA TTAAAAGAGA TTTATGAAGG TCTTGATACA TTAGAAGATA TATATGCGCT TATTGACAGT TCTATAAATG AGGATGCACC TGTGTCCCTA AAAGAGGGTG GAATAATTAA AGAGGGTTTT AACGAAGAAG TAGATAGGCT GAGAAATATA TCTAAAAATA GCAAAGAGCT TTTAGTTGAG TATGAAGAAA AAGAGAGGAA CCTCACAGGT ATAAAAAATC TCAGAATTGG TTATAACAAG GTTTTTGGAT ACTATATTGA GGTGACCAAG TCAAACTACT CTCTTGTTCC TGACAGATAC ATTCGGAAAC AAACTCTTGC AAATGCAGAG AGGTATATAA CAGAGGAACT CAAAAAATTG GAAGATGAAA TATTGGGTGC TGATCAAAAA CTTATCGAAC TTGAATACCA GCTTTTTTGC GAAATAAGGG ATAGGATTGA GGCTCAGATT GAAAGAATTC AAAAGACAGC AAGCAATATT GCCAACTTGG ATGTTCTGTG TTCATTTGCC CGTATTGCAA TTGACAATGA GTATGTCAGG CCAAATGTTT ACTTAGGGGA TAGAATATAT ATTAAGAACG GTAGACACCC AGTTGTTGAA AAGATGATAG GCAGGGGCAA TTTCATTCCA AATGACACTG AACTTGATCA GGCAGAAAAC AGAGTTTTGA TTATAACTGG TCCAAATATG GCTGGTAAGT CTACATACAT GAGACAGGTA GCCTTAATTG TCATAATGGC ACAGATGGGA TGTTTTGTAC CTGCTGATGA GGCACACATT GGTGTAGTTG ATAAAATCTT TTCACGGATA GGGGCATCTG ATGATATTTC ATCTGGGCAG AGTACCTTCA TGGTGGAGAT GTCAGAAGTT GCGAACATAT TGAAAAATGC AACGCCAAAA AGCCTTATAA TTTTTGACGA GGTTGGAAGA GGAACAAGCA CATATGATGG ACTTTCCATA GCATGGGCAG TTTTAGAGTA TGTTGCTGAT AAATCTAAAA TTGGTGCAAA AACCCTTTTT GCAACTCATT ACCATGAGTT AACAGAGCTT GAAGAGAGGA TTCCAGGTGT AAAAAACTAT AGGGTTGATG TCAAGGAGGA GGGCAAAAAC ATTATATTTT TAAGGAAAAT TGTTAGAGGT GGATGTGACT CAAGTTATGG GATTCATGTT GCACGGCTTG CTGGAATTCC AGAAGAGGTA TTAAAAAGGG CTGAGGAAAT TCTAAAACAG CTTGAAGAAG CTGATATAAA TAGGAAAAAT ATCAGAAAAC TCAGAAGAGA AATCAAAAAG GAGTTTACTG AGCAGATAGA TTTTTTTTCC TATAAAAAAG AAGAGATAAT AGACAAAATT GAGAAACTTG ATATTTTAAA TATAACTCCT ATCCAGGCTT TAAACATTTT AAGTGAGCTC AAACATGAAA TAATTAAAGC CAAAGAGAGG CAATTGATAT GA
|
Protein sequence | MQELTPMMQQ YMEIKQKVKD CILFFRLGDF YEMFFEDAIV ASKELEIALT SRDCGNNEKA PMCGVPYHSA TSYIAKLIEK GYKVAICEQV EDPKLAKGIV KREITRIITP GTFIDDNIST ANNFICCISK DRSEFALTFV DVSTGEMYSC LLEEDLQKLV NEIGKYSPSE ILISNIEDEL YEFLKKNCTS FVQMIEFVDL QKCHEIIENQ INVGKIDEKL ILSVGNLLKY LTETQKISFD YIRRFEFYRV QNYLQIDINT KRNLELTESI IQRSRKNSLL GILDQTKTSM GSRLLKKWIE RPLIDIIEIN KRLDSVEELK SNYSTLVQVE ELLSRMYDIE RLSSKFAYKN VNAKDLLSLK KSIEVLPTLK QFLSSFDSEL LKEIYEGLDT LEDIYALIDS SINEDAPVSL KEGGIIKEGF NEEVDRLRNI SKNSKELLVE YEEKERNLTG IKNLRIGYNK VFGYYIEVTK SNYSLVPDRY IRKQTLANAE RYITEELKKL EDEILGADQK LIELEYQLFC EIRDRIEAQI ERIQKTASNI ANLDVLCSFA RIAIDNEYVR PNVYLGDRIY IKNGRHPVVE KMIGRGNFIP NDTELDQAEN RVLIITGPNM AGKSTYMRQV ALIVIMAQMG CFVPADEAHI GVVDKIFSRI GASDDISSGQ STFMVEMSEV ANILKNATPK SLIIFDEVGR GTSTYDGLSI AWAVLEYVAD KSKIGAKTLF ATHYHELTEL EERIPGVKNY RVDVKEEGKN IIFLRKIVRG GCDSSYGIHV ARLAGIPEEV LKRAEEILKQ LEEADINRKN IRKLRREIKK EFTEQIDFFS YKKEEIIDKI EKLDILNITP IQALNILSEL KHEIIKAKER QLI
|
| |