Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Aasi_0767 |
Symbol | |
ID | 6376823 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Amoebophilus asiaticus 5a2 |
Kingdom | Bacteria |
Replicon accession | NC_010830 |
Strand | + |
Start bp | 977964 |
End bp | 980555 |
Gene Length | 2592 bp |
Protein Length | 863 aa |
Translation table | 11 |
GC content | 37% |
IMG OID | 642681913 |
Product | DNA mismatch repair protein MutS |
Protein accession | YP_001957876 |
Protein GI | 189502159 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0249] Mismatch repair ATPase (MutS family) |
TIGRFAM ID | [TIGR01070] DNA mismatch repair protein MutS |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 28 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATCAGG CTACCACTCC ATTAATGAAG CAATACAATG AAATTAAGGC AAAATATCCT GGTTCATTAT TGCTTTTTAG GGTAGGCGAC TTTTACGAAA CTTTTGGAGA AGATGCTGTA AAAACCAGCA AGTTGCTAGA TATTGTTTTA ACTAAACGTG CTAATGGTGC AGCAGCGGCT GTAGAATTAG CTGGTTTTCC ACACCATGCG CTCGATACTT ATTTGCCTAA GTTAGTTAAA GCAGGGCATC GAGTTGCTAT TTGTGACCAA CTAGAAGATC CTAAAGCAGT TAAAGGTATT GTAAAGAGAG GTGTTACTGA ACTAGTAACC CCAGGTCTTT CTTTTCATGA TGCTGTCTTA GAACGCAGGC ACAATAACTA TCTAGCATCT TTATACTTTG AGAAAGAGTT AGTAGGTATT GCCTTTCTAG ATGTCTCTAC CGGTGAATTT CTTACGGCAC AAGGAAAGGC TACTTATATA GATAAATTGA TGCAGGGCTT CCAACCTGCA GAAGTAATTA TAAGCAAAAA GCAAAGAGCC ACTTTTCAGG CATTTTCTAA AGAAAATTAC CCTAGCTATG CGCTCGAAGA TTGGGTATAT CAGCCTGATT ATGCACAAGA AAAACTCAAC GAACATTTTG GTACTGCTTC CATTAAGGGA TTTGGAATAG ACAACCTGCC ACTAGGGGTT ATAGCTAGTG GGGCTATCCT CCGATACCTA GAAGAGACAG AACACAAAGA AAAAAAGCAT ATTACTTCAA TTGCCCGTAT TGAAGAAGAC AAGTATGTAT GGCTAGATAA ATTTACCATT AGAAATTTAG AAATACTACA GCCACAACAA GAAGGGGGCG TGTCGTTGAT TGAAGTGCTT GATAAAACAG TGACTCCCAT GGGCGCTCGC TTAATGAAAA AATGGCTGGT GTTGCCCTTA AAAGATATAC AGGCCATACA GAGAAGGCTA GATATTGTTG ATCTGTTTTA CCAGGATACT AATTTATGGG GAAGTATTTT ACAAGAGCTT AAACAGATTA GTGATTTGGA AAGGCTTATA TCTAAAGTTT CTGTTGGTAG AGCTACTCCA CGAGATCTAT TAGCCTTACA GAAAGCATTG CAACATACGC TTCCTATACA AAACTACTTA CAAACGAGCG AACATGATTT GCTTATAAAG TTGAGCCAGC AGCTCCACAA CTGTGAATAC TTGGCTGATA AAATTAGAGG AACATTACAG GATAATCCTC CGTTGTCTCT TACTCAAGGA GATCTGATCC GAGAAGGTAT TGATAGTGAG TTGGATGAGT TAAGAAAAAT TGCTTACCAA GGAAAAGATT ATTTACTCCA GCTACAACAA AAAGAAATTA AAAATACAGG CATCAATTCC TTAAAGATTG CTTATAATAA GGTGTTTGGC TATTACTTAG AAGTTACTAA TGTACATAAG TCTAAAGTTC CAGCTTCGTG GATACGTAAA CAGACACTGG TAAATGCAGA GCGCTATGTT ACAGAGGAGC TAAAGACATA TGAAGAAAAG ATTTTACAAG CAGAAAGTAA GATGCTGGAA ATAGAACAAA GGTTATATCA GCAATTACTA GATAGCGCTT TAGAATTCGT TCCACAAATT TTACAGAATG CAAAAATTTT AGCTCAAATA GATTGTTATC TTACATTTGC ACAAGAGGCT AGAAAGCATC AGTACACCAA ACCTATTCTT GCCAATCATA AAAAAATAAT TATAAAAAAT GGTAGACATC CTGTTATCGA ACAACAATTG TCTGTTGATG TAAGCTATGT GCCTAACGAT ATTTACCTAG ATAATGAAAC ACAGCAGGTT ATTGTTATTA CAGGTCCTAA TATGGCTGGT AAATCTGCAT TATTAAGGCA AGTGGCACTC ATTGTACTGA TGGCACAGAT AGGCTCTTTT GTACCAGCTA GTCAGGCAGA GATTGGCTTG GTAGACAAAA TATTTACTCG GGTAGGAGCC TCAGACAATT TAGCGCTAGG AGAGTCTACT TTTATGGTAG AGATGACGGA AACTGCCAGC ATTATGCATA ATTTAAGTGA CCGTAGTCTG ATTGTCATGG ATGAAATTGG ACGTGGGACT AGCACTTATG ATGGCATATC AATTGCCTGG TCTATTATTG AATATTTACA TAACCATCCT AAATACAAAG CAAAAACTTT ATTTGCTACG CATTATCATG AGCTTAATCA ATTATCAGAT CAGTTAGAGC GTGTTAAGAA CTTTAATGTG GCGGTTAAGG AAGTAGCAGG TAAAATTATA TTTTTACGTA AACTTAGGGA GGGTGGGAGT GAACATAGCT TTGGGATTCA TGTAGCTCAG CTAGCAGGAA TGCCAACTCA AGTAGTAGAA AGAGCCAGTG AAATTTTAGG GCATTTAGAA CATGATAAAA AACATATTGA AAATAAAGCT AAAATTAAAA CTATACCTGC AAAAACTTAT CAGCTCGCAC TTTTTGAAGC AGATCCTGAT ATAGAAAAAG CAAAGTCCTT ACTTAAGCAA TTAGATATAG ATACGCTTGC ACCTATAGAA GCTTTGCTAA AATTAAAAGA GCTTCAGGAG AGCTTAAAGT AA
|
Protein sequence | MNQATTPLMK QYNEIKAKYP GSLLLFRVGD FYETFGEDAV KTSKLLDIVL TKRANGAAAA VELAGFPHHA LDTYLPKLVK AGHRVAICDQ LEDPKAVKGI VKRGVTELVT PGLSFHDAVL ERRHNNYLAS LYFEKELVGI AFLDVSTGEF LTAQGKATYI DKLMQGFQPA EVIISKKQRA TFQAFSKENY PSYALEDWVY QPDYAQEKLN EHFGTASIKG FGIDNLPLGV IASGAILRYL EETEHKEKKH ITSIARIEED KYVWLDKFTI RNLEILQPQQ EGGVSLIEVL DKTVTPMGAR LMKKWLVLPL KDIQAIQRRL DIVDLFYQDT NLWGSILQEL KQISDLERLI SKVSVGRATP RDLLALQKAL QHTLPIQNYL QTSEHDLLIK LSQQLHNCEY LADKIRGTLQ DNPPLSLTQG DLIREGIDSE LDELRKIAYQ GKDYLLQLQQ KEIKNTGINS LKIAYNKVFG YYLEVTNVHK SKVPASWIRK QTLVNAERYV TEELKTYEEK ILQAESKMLE IEQRLYQQLL DSALEFVPQI LQNAKILAQI DCYLTFAQEA RKHQYTKPIL ANHKKIIIKN GRHPVIEQQL SVDVSYVPND IYLDNETQQV IVITGPNMAG KSALLRQVAL IVLMAQIGSF VPASQAEIGL VDKIFTRVGA SDNLALGEST FMVEMTETAS IMHNLSDRSL IVMDEIGRGT STYDGISIAW SIIEYLHNHP KYKAKTLFAT HYHELNQLSD QLERVKNFNV AVKEVAGKII FLRKLREGGS EHSFGIHVAQ LAGMPTQVVE RASEILGHLE HDKKHIENKA KIKTIPAKTY QLALFEADPD IEKAKSLLKQ LDIDTLAPIE ALLKLKELQE SLK
|
| |