Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | A9601_18541 |
Symbol | |
ID | 4718592 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. AS9601 |
Kingdom | Bacteria |
Replicon accession | NC_008816 |
Strand | + |
Start bp | 1583698 |
End bp | 1586439 |
Gene Length | 2742 bp |
Protein Length | 913 aa |
Translation table | 11 |
GC content | 29% |
IMG OID | 640079588 |
Product | DNA mismatch repair protein MutS |
Protein accession | YP_001010244 |
Protein GI | 123969386 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0249] Mismatch repair ATPase (MutS family) |
TIGRFAM ID | [TIGR01070] DNA mismatch repair protein MutS |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAAGAAG ACACGATAAT TCAAAAGAAT TTATTTGCGA TTGGTAATGA TATTAATGAA CAAAAAAAAA TAACAAAAAT TCCAGAAAAT TTATCTTTGG AAGATTTAAA AAAAGAATCG CAAAAAAGAC CCAGACAAAG AAAAAATTCA ACTAATTTAA TAAATAAATT CAAGACTGAT TTAATTTCAA ATAACAAAAA TGTTTGCATC AATGAAGAAT CTTATAGCTA TAAAACAGTT TCAAAACTGA AATTAACTCC TGTAATGAAG CATTATGTAA CTCTAAAAGA AGAAAATAAA GATAGGTTAT TACTTTATAG ATTAGGAGAC TTTTTTGAAT GTTTTTTTGA GGACGCTGTA TTAATATCTA ACCTTTTGGA AATAACGCTT ACCAGTAAAG ATGCTGGCAA AGAGATTGGT AAGATCCCTA TGGCAGGGGT TCCCCATCAT GCAATGGAGA GATACTGTGC TGATTTAATT AAAAAAAATT ATTCTGTGGT TATATGCGAT CAATTAGAAA AAAGTTCTGG AAATTATGGG ACTCCAATTA AAAGAGGAAT AACAAGAATA ATTACTCCTG GAACTGTAAT TGAAGAGGGG ATGTTGATAG CAAAGAAAAA TAATTGGATT AGTGCTATTT ACTTATCAGA AGAAAACTCA GATGAATCTT ATGAATGGGG CATATCAAAA GCTGATGTAA GCACAGGAGA ATTAATAACT TTAGAAGGCC AATCTCTGTC AAAACTATTT GATGAAATTA TTAAATTAGA TTCTTCAGAA ATCATTGTAG GAAGCAATAC AGCAAGAGAT TTATTAATTA AAGGAAATAG TCAAATTACA TATACTGTTT CTCAAGAGAC TAATTTTGGA ATTAATGAAG CAAATTATCT AATAAAAAAT TATTTCCAAA TTGCAAACCT AGAGGGAATA GGACTTAAAA ATTTAAACAA TGCCACTAGA TCACTTGGAG GTTTATTAAA TTATTTAGAA AAAATTAATC CTTCAAATTT AGATAAAGAT TCTTCTTTAA AAATTTCATT AGACTTTCCA CAAATCCAAT ATGGTCACAA CAAATTAATT ATTGATTATC AAACTCAAAA AAACTTAGAA ATCAAAAATA CGCAACGAGA AAACAATTAT GTAGGTTCGC TACTATGGAG TATTGATAGG ACTTATACCT GCATGGGCGC AAGGTGTTTA AGAAGGTGGA TAGATTCACC ACTATTAAAC GTTAATGAAA TTTATAAAAG GCAAAATATA ATTACAAACT TTCTAGAATC TAAAAAATTA CGTATAGATA CCCAAAATTT ACTTAGAGCA ATGGGGGATT TAGAAAGACT TGCAGGTAGA GCTTGTGCAG GTCATGCAAG TCCCAGAGAC TTAATTGCAA TAGCAGAAGG TTTAAAAAAA TTGCCTAGAC TAAAATCCAT AATTGAATTA TTTAAATATG ATCTCCCAGA TTGGACTGAT CAATTAAAAA ATATTGATGA AGGACTCTTA GAATTAGCTG ATACTATAAG TTTTAAATTA GTAGAAAATC CTCCTCTAAA TATTAGTGAA GGAGGCATGA TCCACGATGG TGTTGACAAT ATATTAGATG GTTTACGAAA TTTAATGGAT GATTACTCTG AGTGGCTAAA TAAAGAGGAA TCAAAGGAAA GAAAAATTAG CAAAATTTCA AACCTAAAAA TTCAATTTCA TAAAAATTTT GGTTATTACA TTTCTATTAA TAAGTCAAAA GTTAATTTAG CTCCACAACA TTGGATCAAA AGGCAAACAC TTACTAATGA AGAAAGGTAT ATCACTTCTG AAATTAAAAA TAAAGAAAAT AAGATTTTCC AAATAAAAAG TAGAGCTTCA TCAAAAGAAT ATGAAATTTT CTGCGAATTA AGAAATATAG TTGCTGAAAA AACAAAACAA ATAAGATCAA TCGCAAAATC CATAGCATCT CTTGATGCAC TACTTGGTTT ATCAATTACT TCAGTAGAAA ACAATTTTAT AAAACCTTCA TTAATACCAA TAAATGATTC AATGACAAAA AATAGTACAA AAATTATCGC AGGAAGAAAT CCAATTGTAG AGCAATTGTT AAGTGATAAA AAGTTTGTAG CAAACGATAT CTCTTTTGAG GATAATCAAA AATTAATTAT ATTAACCGGT CCCAATGCAA GCGGAAAAAG TTGCTTTATA AGACAACTTG GTTTAATACA AATTCTCACG CAAATTGGTA GCTTTGTTCC TGCTAATAAT GCTGAAATCA AGATTGCAGA TAGGATTTTC ACAAGAATTG GGGCAGTTGA TGATCAATCA TCTGGACAAT CAACATTTAT GGTAGAAATG TCTGAAACTG CATCAATTCT AAATCAGGCA ACTTCTAGCT CACTAGTTTT ACTTGATGAG ATTGGCAGAG GGACATCTAC TTTTGATGGA CTCTCAATAG CTTGGTCAGT AAGTGAATAT CTTGCAAAAA AAATTCAATG TAATACTATT TTTGCTACGC ACTATCATGA GCTTAATTAT TTAAAAAATA CAAATAAGAA TATACAAAAT TTTCAAGTTT TAGTAGAACA AAATAACGAT CAGCTAATTT TTAGCCACAG GATTGTTAAA GGGGGCTCAA ACAAAAGCTA TGGCATAGAA GCAGCTAAAT TAGCAGGAGT CCCAAAAGAA GTTATAGAAA AAGCAAAATC AGTTTTAAAT TCTTTAGAAG AAAATAACAA ATTAAATTAT GATATAAAGT AG
|
Protein sequence | MQEDTIIQKN LFAIGNDINE QKKITKIPEN LSLEDLKKES QKRPRQRKNS TNLINKFKTD LISNNKNVCI NEESYSYKTV SKLKLTPVMK HYVTLKEENK DRLLLYRLGD FFECFFEDAV LISNLLEITL TSKDAGKEIG KIPMAGVPHH AMERYCADLI KKNYSVVICD QLEKSSGNYG TPIKRGITRI ITPGTVIEEG MLIAKKNNWI SAIYLSEENS DESYEWGISK ADVSTGELIT LEGQSLSKLF DEIIKLDSSE IIVGSNTARD LLIKGNSQIT YTVSQETNFG INEANYLIKN YFQIANLEGI GLKNLNNATR SLGGLLNYLE KINPSNLDKD SSLKISLDFP QIQYGHNKLI IDYQTQKNLE IKNTQRENNY VGSLLWSIDR TYTCMGARCL RRWIDSPLLN VNEIYKRQNI ITNFLESKKL RIDTQNLLRA MGDLERLAGR ACAGHASPRD LIAIAEGLKK LPRLKSIIEL FKYDLPDWTD QLKNIDEGLL ELADTISFKL VENPPLNISE GGMIHDGVDN ILDGLRNLMD DYSEWLNKEE SKERKISKIS NLKIQFHKNF GYYISINKSK VNLAPQHWIK RQTLTNEERY ITSEIKNKEN KIFQIKSRAS SKEYEIFCEL RNIVAEKTKQ IRSIAKSIAS LDALLGLSIT SVENNFIKPS LIPINDSMTK NSTKIIAGRN PIVEQLLSDK KFVANDISFE DNQKLIILTG PNASGKSCFI RQLGLIQILT QIGSFVPANN AEIKIADRIF TRIGAVDDQS SGQSTFMVEM SETASILNQA TSSSLVLLDE IGRGTSTFDG LSIAWSVSEY LAKKIQCNTI FATHYHELNY LKNTNKNIQN FQVLVEQNND QLIFSHRIVK GGSNKSYGIE AAKLAGVPKE VIEKAKSVLN SLEENNKLNY DIK
|
| |