Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nther_2141 |
Symbol | |
ID | 6314800 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Natranaerobius thermophilus JW/NM-WN-LF |
Kingdom | Bacteria |
Replicon accession | NC_010718 |
Strand | + |
Start bp | 2263527 |
End bp | 2265401 |
Gene Length | 1875 bp |
Protein Length | 624 aa |
Translation table | 11 |
GC content | 35% |
IMG OID | 642644528 |
Product | DNA mismatch repair protein MutS domain protein |
Protein accession | YP_001918295 |
Protein GI | 188586750 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0249] Mismatch repair ATPase (MutS family) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 61 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCAGCTA ATTACAGCAG TGTCAATAAC AGATCCGAAG CCAATAGTAT TAAAAGTAGA TACAAAGCAC GTAAAGAACA ATTCGCAAGT GAATCAAAAG CACTTTCTAA CAAAATGAAT CTAATATCTA ATTTAAGGTT ACTAACCGTT GTTTTAGGTA TTGGGTTAAC TGGCTATTTG TTTTATATTG GGAATTATGT AGTCAGTTTT TTCTCTCTAA TCATTTTTAC AACAATTTTC ATATTTCTGG TCATCAGATA TCGTGTATTA ACAAATAAGC GAAATTATGC CTGGGCCCTT GCCGATATTA ATGAGCGTTC TCTTATCAGA TTAAAGGGGA GTTGGAATAG TTTTGAGGAT ACTGGTAATG AATTTTTAGA CCATGATCAT CCCTATGCAG AAGATCTAGA TATCTTCGGA CAAAATTCCC TTTTCCAGTG GATAAATACC ACTACTACCT TTTCGGGAAG GCATAAATTA AAGACCATGT TAACACAACC ATGTAATAAG ATAGAAGAGA TACAAGCAAG GCAAAGGTCC ATTCAGGAAT TAGCCGAGAA ACTCCATTGG CGTCAACATT TAGAAGCTTT AGGGAAACAA CCCGAATACG AACACCGCAA TAACAAAGAA CTCCAAGTTG ATCCCCTTAA TTTAATATCA TGGGCAAAAA CACAAAACCC CTTTTATTTA AAAACTTGGC TTAAGGTGAT CGTGAATCTA CTACCCTTAA TAACTCTAAG TTTTATAATC GCAGCAATTA TGACTGGAGT CACTTATATT TTTCCTATAA TTATGATTGC ACTACACATA TTGATCCTTA CTTATGATTA CTCTAATCGC ATTAGCGAGT TCAGCTTGAT CTCAGGCTTT AATGAAAAAT TGTCGGCTTA CAAAGATATT TTAACAGCCA TAGAAACAGA ACAATTTCAT GGAAGTATGT TAACAAAACT ACAGGATACA ATATTAAATT CCAAAACGGG GACAAATGCT TCGGATCGTC TAAAGCTCCT TGACGAGATA ATGGAATTCA TTTCGCATCG CTCAGGCCAG TTTTATATTA TATTTAATAT CTTATTTCTA TTGGATTATC GATGGCAAAT TTCTTTAGAA CATTGGAAAC ATCAATCAGG AGATGAGCTT GAACAATGGT TTGATATTTT GGGAGATTTT GAAGCCCTGA GTAGTTTAGC GATAATTCCA TGTGATCATC CTGATTGGGC TCAACCTGAG ATCACAGCAG AACCTGGTCT ATTTCAGGCC GAACAAATAG GCCACCCTTT ATTAACAGAA CATCGAGTTT GCAACGATAT TGATATGGGT TCAAGTACAA ATAGTTTACT AATTACGGGT TCCAATATGT CTGGTAAAAG TACTTTACTC AGAACAGCTG GAATCAATCT GGTACTAGCA TATTTGGGTG CTCCCGTTTG TGCAAATACT ATGCAGGCGT CATTAATGAA AATTTATACA TGTATGAGAG TTAGCGATAA TTTAGAAAAG AATTTATCTT CTTTTTATGC CGAACTACTT AGAATTAAAC ATATAGTTAA AAGTGCTGAA CAGATACCAG TTTTTTATTT ATTAGACGAG ATTTTTAAAG GGACTAATTC TCGTGACCGA CATACTGGTG CTAGAGCTGT AATTAAAAAA TTACAATCAG AAGGTGCCCT GGGCCTTGTT TCAACACACG ATCTAGAATT AGGTGCCCTG GAGAGTCAAA ACACAAGTAT AAAAAATTAC CATTTCAGAG AGTACTATCA AAATGGTGAA ATTTACTTTG ATTATATTTT GAGACCTGGG TTGGCTCCTA CTACTAATGC TATTTATCTA ATGAAAATGG CAGGCATCGA TCCAGATGAA GAAGATTTGG GTTAA
|
Protein sequence | MSANYSSVNN RSEANSIKSR YKARKEQFAS ESKALSNKMN LISNLRLLTV VLGIGLTGYL FYIGNYVVSF FSLIIFTTIF IFLVIRYRVL TNKRNYAWAL ADINERSLIR LKGSWNSFED TGNEFLDHDH PYAEDLDIFG QNSLFQWINT TTTFSGRHKL KTMLTQPCNK IEEIQARQRS IQELAEKLHW RQHLEALGKQ PEYEHRNNKE LQVDPLNLIS WAKTQNPFYL KTWLKVIVNL LPLITLSFII AAIMTGVTYI FPIIMIALHI LILTYDYSNR ISEFSLISGF NEKLSAYKDI LTAIETEQFH GSMLTKLQDT ILNSKTGTNA SDRLKLLDEI MEFISHRSGQ FYIIFNILFL LDYRWQISLE HWKHQSGDEL EQWFDILGDF EALSSLAIIP CDHPDWAQPE ITAEPGLFQA EQIGHPLLTE HRVCNDIDMG SSTNSLLITG SNMSGKSTLL RTAGINLVLA YLGAPVCANT MQASLMKIYT CMRVSDNLEK NLSSFYAELL RIKHIVKSAE QIPVFYLLDE IFKGTNSRDR HTGARAVIKK LQSEGALGLV STHDLELGAL ESQNTSIKNY HFREYYQNGE IYFDYILRPG LAPTTNAIYL MKMAGIDPDE EDLG
|
| |