Gene Aasi_0767 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAasi_0767 
Symbol 
ID6376823 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Amoebophilus asiaticus 5a2 
KingdomBacteria 
Replicon accessionNC_010830 
Strand
Start bp977964 
End bp980555 
Gene Length2592 bp 
Protein Length863 aa 
Translation table11 
GC content37% 
IMG OID642681913 
ProductDNA mismatch repair protein MutS 
Protein accessionYP_001957876 
Protein GI189502159 
COG category[L] Replication, recombination and repair 
COG ID[COG0249] Mismatch repair ATPase (MutS family) 
TIGRFAM ID[TIGR01070] DNA mismatch repair protein MutS 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATCAGG CTACCACTCC ATTAATGAAG CAATACAATG AAATTAAGGC AAAATATCCT 
GGTTCATTAT TGCTTTTTAG GGTAGGCGAC TTTTACGAAA CTTTTGGAGA AGATGCTGTA
AAAACCAGCA AGTTGCTAGA TATTGTTTTA ACTAAACGTG CTAATGGTGC AGCAGCGGCT
GTAGAATTAG CTGGTTTTCC ACACCATGCG CTCGATACTT ATTTGCCTAA GTTAGTTAAA
GCAGGGCATC GAGTTGCTAT TTGTGACCAA CTAGAAGATC CTAAAGCAGT TAAAGGTATT
GTAAAGAGAG GTGTTACTGA ACTAGTAACC CCAGGTCTTT CTTTTCATGA TGCTGTCTTA
GAACGCAGGC ACAATAACTA TCTAGCATCT TTATACTTTG AGAAAGAGTT AGTAGGTATT
GCCTTTCTAG ATGTCTCTAC CGGTGAATTT CTTACGGCAC AAGGAAAGGC TACTTATATA
GATAAATTGA TGCAGGGCTT CCAACCTGCA GAAGTAATTA TAAGCAAAAA GCAAAGAGCC
ACTTTTCAGG CATTTTCTAA AGAAAATTAC CCTAGCTATG CGCTCGAAGA TTGGGTATAT
CAGCCTGATT ATGCACAAGA AAAACTCAAC GAACATTTTG GTACTGCTTC CATTAAGGGA
TTTGGAATAG ACAACCTGCC ACTAGGGGTT ATAGCTAGTG GGGCTATCCT CCGATACCTA
GAAGAGACAG AACACAAAGA AAAAAAGCAT ATTACTTCAA TTGCCCGTAT TGAAGAAGAC
AAGTATGTAT GGCTAGATAA ATTTACCATT AGAAATTTAG AAATACTACA GCCACAACAA
GAAGGGGGCG TGTCGTTGAT TGAAGTGCTT GATAAAACAG TGACTCCCAT GGGCGCTCGC
TTAATGAAAA AATGGCTGGT GTTGCCCTTA AAAGATATAC AGGCCATACA GAGAAGGCTA
GATATTGTTG ATCTGTTTTA CCAGGATACT AATTTATGGG GAAGTATTTT ACAAGAGCTT
AAACAGATTA GTGATTTGGA AAGGCTTATA TCTAAAGTTT CTGTTGGTAG AGCTACTCCA
CGAGATCTAT TAGCCTTACA GAAAGCATTG CAACATACGC TTCCTATACA AAACTACTTA
CAAACGAGCG AACATGATTT GCTTATAAAG TTGAGCCAGC AGCTCCACAA CTGTGAATAC
TTGGCTGATA AAATTAGAGG AACATTACAG GATAATCCTC CGTTGTCTCT TACTCAAGGA
GATCTGATCC GAGAAGGTAT TGATAGTGAG TTGGATGAGT TAAGAAAAAT TGCTTACCAA
GGAAAAGATT ATTTACTCCA GCTACAACAA AAAGAAATTA AAAATACAGG CATCAATTCC
TTAAAGATTG CTTATAATAA GGTGTTTGGC TATTACTTAG AAGTTACTAA TGTACATAAG
TCTAAAGTTC CAGCTTCGTG GATACGTAAA CAGACACTGG TAAATGCAGA GCGCTATGTT
ACAGAGGAGC TAAAGACATA TGAAGAAAAG ATTTTACAAG CAGAAAGTAA GATGCTGGAA
ATAGAACAAA GGTTATATCA GCAATTACTA GATAGCGCTT TAGAATTCGT TCCACAAATT
TTACAGAATG CAAAAATTTT AGCTCAAATA GATTGTTATC TTACATTTGC ACAAGAGGCT
AGAAAGCATC AGTACACCAA ACCTATTCTT GCCAATCATA AAAAAATAAT TATAAAAAAT
GGTAGACATC CTGTTATCGA ACAACAATTG TCTGTTGATG TAAGCTATGT GCCTAACGAT
ATTTACCTAG ATAATGAAAC ACAGCAGGTT ATTGTTATTA CAGGTCCTAA TATGGCTGGT
AAATCTGCAT TATTAAGGCA AGTGGCACTC ATTGTACTGA TGGCACAGAT AGGCTCTTTT
GTACCAGCTA GTCAGGCAGA GATTGGCTTG GTAGACAAAA TATTTACTCG GGTAGGAGCC
TCAGACAATT TAGCGCTAGG AGAGTCTACT TTTATGGTAG AGATGACGGA AACTGCCAGC
ATTATGCATA ATTTAAGTGA CCGTAGTCTG ATTGTCATGG ATGAAATTGG ACGTGGGACT
AGCACTTATG ATGGCATATC AATTGCCTGG TCTATTATTG AATATTTACA TAACCATCCT
AAATACAAAG CAAAAACTTT ATTTGCTACG CATTATCATG AGCTTAATCA ATTATCAGAT
CAGTTAGAGC GTGTTAAGAA CTTTAATGTG GCGGTTAAGG AAGTAGCAGG TAAAATTATA
TTTTTACGTA AACTTAGGGA GGGTGGGAGT GAACATAGCT TTGGGATTCA TGTAGCTCAG
CTAGCAGGAA TGCCAACTCA AGTAGTAGAA AGAGCCAGTG AAATTTTAGG GCATTTAGAA
CATGATAAAA AACATATTGA AAATAAAGCT AAAATTAAAA CTATACCTGC AAAAACTTAT
CAGCTCGCAC TTTTTGAAGC AGATCCTGAT ATAGAAAAAG CAAAGTCCTT ACTTAAGCAA
TTAGATATAG ATACGCTTGC ACCTATAGAA GCTTTGCTAA AATTAAAAGA GCTTCAGGAG
AGCTTAAAGT AA
 
Protein sequence
MNQATTPLMK QYNEIKAKYP GSLLLFRVGD FYETFGEDAV KTSKLLDIVL TKRANGAAAA 
VELAGFPHHA LDTYLPKLVK AGHRVAICDQ LEDPKAVKGI VKRGVTELVT PGLSFHDAVL
ERRHNNYLAS LYFEKELVGI AFLDVSTGEF LTAQGKATYI DKLMQGFQPA EVIISKKQRA
TFQAFSKENY PSYALEDWVY QPDYAQEKLN EHFGTASIKG FGIDNLPLGV IASGAILRYL
EETEHKEKKH ITSIARIEED KYVWLDKFTI RNLEILQPQQ EGGVSLIEVL DKTVTPMGAR
LMKKWLVLPL KDIQAIQRRL DIVDLFYQDT NLWGSILQEL KQISDLERLI SKVSVGRATP
RDLLALQKAL QHTLPIQNYL QTSEHDLLIK LSQQLHNCEY LADKIRGTLQ DNPPLSLTQG
DLIREGIDSE LDELRKIAYQ GKDYLLQLQQ KEIKNTGINS LKIAYNKVFG YYLEVTNVHK
SKVPASWIRK QTLVNAERYV TEELKTYEEK ILQAESKMLE IEQRLYQQLL DSALEFVPQI
LQNAKILAQI DCYLTFAQEA RKHQYTKPIL ANHKKIIIKN GRHPVIEQQL SVDVSYVPND
IYLDNETQQV IVITGPNMAG KSALLRQVAL IVLMAQIGSF VPASQAEIGL VDKIFTRVGA
SDNLALGEST FMVEMTETAS IMHNLSDRSL IVMDEIGRGT STYDGISIAW SIIEYLHNHP
KYKAKTLFAT HYHELNQLSD QLERVKNFNV AVKEVAGKII FLRKLREGGS EHSFGIHVAQ
LAGMPTQVVE RASEILGHLE HDKKHIENKA KIKTIPAKTY QLALFEADPD IEKAKSLLKQ
LDIDTLAPIE ALLKLKELQE SLK