Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Athe_1642 |
Symbol | |
ID | 7409472 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anaerocellum thermophilum DSM 6725 |
Kingdom | Bacteria |
Replicon accession | NC_012034 |
Strand | - |
Start bp | 1739173 |
End bp | 1741536 |
Gene Length | 2364 bp |
Protein Length | 787 aa |
Translation table | 11 |
GC content | 36% |
IMG OID | 643716011 |
Product | MutS2 family protein |
Protein accession | YP_002573509 |
Protein GI | 222529627 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1193] Mismatch repair ATPase (MutS family) |
TIGRFAM ID | [TIGR01069] MutS2 family protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.0000590377 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGAACCAAA AGACACTCAA AGCGTTGGAG TATGACAAGA TAGTTGAGAT TTTAAAAAAC ATGGCAAAGT CAACTCCAGC AAAGGAATAT TTTGAAAACC TTATTCCATC AACCAATGTT GCAGATATAG AAAATGAACT TAATAAGGTC GATGAAAGCT ACAGGTACGT TCTAAAGTAT GGAAATCTAC CAACTTTGGA GTTTGAAAAT ATACTGCCAA GCCTTAAGAA ATCAAAGCTT GGGGCAACTT TAAATCCACA TGAGATTTTG CAAATTGGAA AAGTGTTGAA ACTTTCTTAT GAGATGCGAA CTTATCTTTC TTTCACACAA GACTTTAGTT TTCTTGAAAG TATGAAAAAA AGACTTGTAA ATTTAAAAGA AGTAATCTCA AGAATTGACC AGACATTTTT AACACCAGAT GAAATCTTAG ATACAGCATC GTCAAAGCTC AAGGAGATAA GAGATAAAAT CAGAAAGCTT GAAAATAAAA TAAGAGACGA GTTAAATAGC ATGATTCGCG ACCCAAAAAT CCAAAGGTTT TTACAAGAAC CAATTATAAC AATCAGAGGT GAAAAGCTCT TACTTCCTGT CAAGGCAGAG TTTAGAAATG AAGTAAAAGG AATTGTCCAT GACCAGTCAG CAACAGGTGC AACTTTATTT GTTGAGCCTT TTGTTTGTGT TGAGATATCA AATCAAATCA AAATTTTAAA AAATCAAGAA AAAGAGGAAA TAGAGAGAAT TTTGCAAGAG ATATCCTCTT TGATAGCAAG CTATTGTGAG GTAATTGAAA CATCTTTTTA TGCACTTGTT GAGCTTGACA TAGTATTTAC GAAAGCTATT TGGGCAAAAG AAATGAACGC AAGCAAACCA ATTATTAACG CAAGTGGCAT TATAAATCTC AAAAAAGCTC GGCATCCATT AATACAAAAA GACAAGGTTG TTCCTATTGA TATTCATTTA GGCAAAGACT TCGACGTTCT TATAATAACA GGTCCAAACA CAGGTGGAAA GACTGTAACA CTAAAGACGG TTGGACTTTT TTGTCTTCTT TGCCAGAGCG GAATATTCAT TCCAGCAGAT GAGGGTTCTG AGCTTTGCAT ATTTCAAAAG ATTTTTGCTG ATATTGGAGA TGATCAAAGT ATAGTCCAGA GTCTTTCCAC ATTTTCAGCA CATATGAAAA ACATCATTGA AATAACAAAA AATGCAGATG ATAAAACTCT TGTGCTATTA GATGAGATTG GAGCAGGTAC AGACCCTGAA GAAGGTGCAG CTTTGGCGAA GGCAATCTTG AAATATCTTT CTGAGAAGGG CAGCAAGGTG ATAGCTACAA CACACTATGG TGAGCTAAAA ATATTTGCTC AGCAAGAAGA TCGGTTTGAA AACGCTTCTT GTGAGTTTGA TGTAAAAACC CTAAAGCCTA CGTACAGGCT TTTGATAGGA ATTCCAGGAA GGAGCAATGC ACTTGTAATT TCATCCAATC TTGGGCTTGA CAAAGGTATT GTTGAGATGG CAAGAGGGTA TCTGTCTCAA AAGACAATTG ATCTTGACAG AATAATAAAC GAAATGGAAC AAAAGAGAAA AGAAGCTGAA GAAAACCTTG AACTTGCTCA AAAATTGAAG CATGAAGCAC AAGCTTTAAA AGCGGCGTAT GAAGAGGAGA AGAAAAGGTT TGAGACTGAA AGAGAGAGAA TTCGCAAAAA GGCCATAAAT GAGGCAAAAG AGATTGTTGA AAGCTCACAG TATGAAATAG AAAATCTTTT TAAAGACCTT CGAAAACTTG CTGAAAACTT AAAGGAAAAA GAAGTTTTAA AGGAGTTAGA AGAGAAAAAA AGAGAATATG AAAGGCTGAT TCAAAGTATT TCACAGCAGG TAAAACAAGA AGCTGAGTCC AAAACCAAAA AAACAATACA GAATCTTCGC TTAGGTCAAA AGGTATATGT CAGAAGCTTT GATGCTGAGG GGTTTGTCGA AAGCCTGCCA GACTCAAAGG GAAATCTTAC TGTCCAGATA GGTATTATGA AGATAAATGT CAATCTTTCT GATATTGAAG AGGTGGAAGG ACAAGATAGC AAAATATATC AAATAGCATC TAGAAATGTA ATAATCAAAG AAAAGAACAT TGACATGTCC ATTGATGTAA GGGGCAAAAC AAGCGATGAT GCTATTTTAG AGGTTGACAA GTACTTAGAT GACGCATACA CAGCGGGACT AAAACAGGTT ACAATAATCC ATGGGAAAGG CACGGGGGTT TTGCGCCAGG CGATAAGGAA TTTTTTGAGA CGGCATCCGC ATGTAAAATC ATTCAGAGAT GGAACATATG GTGAAGGTGA ACAGGGTGTT ACGGTGGTTG AGCTGAAAGA CTAA
|
Protein sequence | MNQKTLKALE YDKIVEILKN MAKSTPAKEY FENLIPSTNV ADIENELNKV DESYRYVLKY GNLPTLEFEN ILPSLKKSKL GATLNPHEIL QIGKVLKLSY EMRTYLSFTQ DFSFLESMKK RLVNLKEVIS RIDQTFLTPD EILDTASSKL KEIRDKIRKL ENKIRDELNS MIRDPKIQRF LQEPIITIRG EKLLLPVKAE FRNEVKGIVH DQSATGATLF VEPFVCVEIS NQIKILKNQE KEEIERILQE ISSLIASYCE VIETSFYALV ELDIVFTKAI WAKEMNASKP IINASGIINL KKARHPLIQK DKVVPIDIHL GKDFDVLIIT GPNTGGKTVT LKTVGLFCLL CQSGIFIPAD EGSELCIFQK IFADIGDDQS IVQSLSTFSA HMKNIIEITK NADDKTLVLL DEIGAGTDPE EGAALAKAIL KYLSEKGSKV IATTHYGELK IFAQQEDRFE NASCEFDVKT LKPTYRLLIG IPGRSNALVI SSNLGLDKGI VEMARGYLSQ KTIDLDRIIN EMEQKRKEAE ENLELAQKLK HEAQALKAAY EEEKKRFETE RERIRKKAIN EAKEIVESSQ YEIENLFKDL RKLAENLKEK EVLKELEEKK REYERLIQSI SQQVKQEAES KTKKTIQNLR LGQKVYVRSF DAEGFVESLP DSKGNLTVQI GIMKINVNLS DIEEVEGQDS KIYQIASRNV IIKEKNIDMS IDVRGKTSDD AILEVDKYLD DAYTAGLKQV TIIHGKGTGV LRQAIRNFLR RHPHVKSFRD GTYGEGEQGV TVVELKD
|
| |