Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Apre_1336 |
Symbol | |
ID | 8398143 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anaerococcus prevotii DSM 20548 |
Kingdom | Bacteria |
Replicon accession | NC_013171 |
Strand | - |
Start bp | 1436374 |
End bp | 1438719 |
Gene Length | 2346 bp |
Protein Length | 781 aa |
Translation table | 11 |
GC content | 37% |
IMG OID | 644995698 |
Product | MutS2 family protein |
Protein accession | YP_003153080 |
Protein GI | 257066824 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1193] Mismatch repair ATPase (MutS family) |
TIGRFAM ID | [TIGR01069] MutS2 family protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAAGAAA AAAGCTTAAA GGTTTTAGAA TATGACAAGA TTCTAGAAAG ACTTGCTGGA TGTGCGAGGT CAAATCTTGT AAAGGATCAG ATCCTAAAGC TAAGACCTTA TGATGATATA AATTATATAA GAGAAGAGCT TTACGAGACT TCTGCTATGG TCGATGTGAT TAGGAAAAAT GGAAACATAG ATCTATTTGG TCTCTATGAT CTTACAGAAA TTGTTGCCTA TATTAGAAAA AATGGCATAC TCGATCCAGG AGAGCTCTTA AAAGTCCTTG ACCTACTAAG GGTGAGTGAA TATTTAAAAG ATTATGGGAA AAATATCGAA GATAGGAAGA TAGGAGATAT TTTTTCTAGA ATCTCAATCA ATGATTTTCT CAAAAACGAA ATCGATAGGT CTATAATAAA CGAGGAGGAA ATAGCAGATT CTGCCTCATC TACCCTTAGA AATATCAGAC GTCAAAAGCA AAGGAAGGAA GCTGATATAA GGATTAAGCT AAATTCATAT ATCACAAATT CCAAATACGA TGACGCCCTA CAAGATAAGG TAGTATCAGT AAGAGATGGA AGATATGTAG TTCCTGTAAA GACTAATAAG CGTGCCTTGA TCGGGGGAAT CGTCCATGAC AAGTCATCTT CTGGGAACAC TCTTTTTATA GAACCAGGCG CCATAGTAGA GCTTAACAAC CAGCTTAGAG ATCTAGAGAT TAAGGAAGAA GATGAAATCA GGAGAATTCT CGATAGACTT TCTAGACTTG CCCAAGGATT CGATGTAGAG CTACTAGAAA ATCAGAAGTT GATAGCAAGG ATTGATTTCC TCCAAGCAAA ATCTAGATTT GCCATAGAAA ATGAGTATAG CCTTCCTATA ATTACAGATG AAAAGAAAGT CGACTTAAAA TCAGCAAGAC ATCCCCTCCT TCCTGGTAAG GTTGTGCCAA TCGATGTAAG AATTGGAGGA GACTATACTA CTTTAATAAT CACAGGGCCA AATACTGGAG GTAAGACTGT AAGTCTAAAG ACTGTAGGAC TAATAAGTGC AATGGCTCAG ACAGCCCTTT TCATACCAGC CTACGAGGGA AGTAAGCTTT GTGTATTCGA TGATATTTTC CTAGACATAG GAGATACCCA GTCTATAGAG ATGAGTCTAT CGACTTTTTC AGCATCTTTG ACTAATATCG TCGATATATT GAAAAACTCT ACGGAAAATT CCTTGGTTCT CTTAGATGAG ATTGGTTCAG GAACAGATCC GGTAGAAGGA GCAGCCCTTG CTATTTCTAT CCTTAATTCC TTGACCCAAA AGAAAGTCAT GACCTTTTCT ACAACCCATT ACAGCGAGTT AAAATACTAT GCTGTAGAGA CTTCTGGTGT TATGAATGCA TCTGTAGAAT TTGATGTCGA TACACTTTCT CCAACCTACA AGCTTGAAAT AGGGACTCCA GGTAAGTCCA ACGCCTTCGA GATTTCCAAA AGACTTGGTC TTCCTTACGA GATTTTAAAC AATGCCAAAA ATCTTATAGG AGATGATACC AAAAATATCA ACAAAATCCT TGCAGAAATC GAAGAAGATA AGAAGGAAAT TGAAGATAAG AATAAGGAAA TAGAAAGCTA TAAGAGAGAA ATAGCTAAAA TAAGAAATGA GCTTAAGGAA AAATCCAAAA GACTTGATCA GAAGGAAGAA GATATCTTAA GAGAGGCCGA AGATAAGGCT AACAGCATCC TAGATAAGGC AAATAAGAGA AGTCAAGACA TGCTAAAAGA AGCCAAGAAG ATGAGAAATG CCAATACTTC CGACATAGAT AGGTCTCTAA ACAAAATTAG GCATGAATAC AAAGAAGGAA GAATAGAAAG AAAAGGGGAA GGCCTTTACA CCAAAGAGTC TAAGAATGCA CCAGATAGTC TAAAGGTGGG GGACACTGTT CTAATAGCAG GACTTAACGA AAAGGCAGAA GTAATCGAAG CTCCTGATAA GAAGGGTAAT ATCAAGGTAC AAATGGGAAT TCTTAAGATG GATTCTAATA TCAAAAACGT AAGTAAGATT AAAGGCGACA ATCAAACAGA AAAAAACATC AGAAAAGTGT ATAATACTAA AAAAGCCATG AATATCTCGC CAACCCTAGA CCTTAGGGGA CAAAGATATG ACGAGGCTAT GAGAAATCTA GATAAATACC TTGATGATGC AATGCTTGCA GGTCTTTCTA AGGCCAAGAT CATTCATGGA AAAGGAACTG GCGCCCTAAT CAAGGGAGTC GGAGAAATTC TAGAAGGTGA TAAGAGAATT GAAGATTACC GTTTCGGCGA TGACAAAGAA GGCGGATACG GTGTTACTAT AGTGAAATTT GGATAG
|
Protein sequence | MQEKSLKVLE YDKILERLAG CARSNLVKDQ ILKLRPYDDI NYIREELYET SAMVDVIRKN GNIDLFGLYD LTEIVAYIRK NGILDPGELL KVLDLLRVSE YLKDYGKNIE DRKIGDIFSR ISINDFLKNE IDRSIINEEE IADSASSTLR NIRRQKQRKE ADIRIKLNSY ITNSKYDDAL QDKVVSVRDG RYVVPVKTNK RALIGGIVHD KSSSGNTLFI EPGAIVELNN QLRDLEIKEE DEIRRILDRL SRLAQGFDVE LLENQKLIAR IDFLQAKSRF AIENEYSLPI ITDEKKVDLK SARHPLLPGK VVPIDVRIGG DYTTLIITGP NTGGKTVSLK TVGLISAMAQ TALFIPAYEG SKLCVFDDIF LDIGDTQSIE MSLSTFSASL TNIVDILKNS TENSLVLLDE IGSGTDPVEG AALAISILNS LTQKKVMTFS TTHYSELKYY AVETSGVMNA SVEFDVDTLS PTYKLEIGTP GKSNAFEISK RLGLPYEILN NAKNLIGDDT KNINKILAEI EEDKKEIEDK NKEIESYKRE IAKIRNELKE KSKRLDQKEE DILREAEDKA NSILDKANKR SQDMLKEAKK MRNANTSDID RSLNKIRHEY KEGRIERKGE GLYTKESKNA PDSLKVGDTV LIAGLNEKAE VIEAPDKKGN IKVQMGILKM DSNIKNVSKI KGDNQTEKNI RKVYNTKKAM NISPTLDLRG QRYDEAMRNL DKYLDDAMLA GLSKAKIIHG KGTGALIKGV GEILEGDKRI EDYRFGDDKE GGYGVTIVKF G
|
| |