Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9301_18351 |
Symbol | |
ID | 4911157 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9301 |
Kingdom | Bacteria |
Replicon accession | NC_009091 |
Strand | + |
Start bp | 1555676 |
End bp | 1558417 |
Gene Length | 2742 bp |
Protein Length | 913 aa |
Translation table | 11 |
GC content | 29% |
IMG OID | 640161440 |
Product | DNA mismatch repair protein MutS |
Protein accession | YP_001092059 |
Protein GI | 126697173 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0249] Mismatch repair ATPase (MutS family) |
TIGRFAM ID | [TIGR01070] DNA mismatch repair protein MutS |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAAGAAG ATTTGATAAT TCAAAAGAAT TTATTTGCGA TTGGTAATGA TAATAATAAG CAAAAGGAAA AAACAAAAAT TCCAGAAGAT TTATCTTTAG AAGATTTAAA AAAAGAATCG CAAAAAAGAC CCAGACAAAG AAAAAATTCA ACTAATTTAA TAAATAAATT CAAGACTGAT TTAATTTCAA ATAAGAAAAA TGTTTGCATC AATGAAGAAT CTTATAGCTA TAAAACAGTT TCAAAACTGA AATTAACACC TGTAATGAAG CATTATGTAA CTCTAAAAGA AGAAAATAAA GATAGGTTAT TACTTTATAG ATTAGGAGAT TTTTTTGAAT GTTTTTTTGA GGATGCTGTA TTAATATCTA ATCTTTTAGA AATAACGCTT ACCAGTAAAG ATGCTGGCAA AGAGATTGGT AAGATCCCTA TGGCAGGGGT TCCCCATCAT GCAATGGACA GATACTGTGC TGATTTAATT AAAAAAAATT ATTCTGTGGT TATATGTGAT CAATTAGAAA AAAGTTCTGG AAATTATGGG ACTCCAATTA AAAGAGGAAT AACACGAATA ATTACTCCTG GAACTGTAAT TGAAGAGGGG ATGTTGATAG CAAAGAAAAA TAATTGGATT ACTGCTATTT ACTTATCAGA AGAAAACTCA AATGAATCTT ATGAATGGGG TATATCAAAA GCTGATGTAA GCACAGGAGA ATTAATAACT TTAGAAGGCC AATCTCTATC AAAACTATTT GATGAAATTA TTAAATTAGA TTCTTCAGAA ATCATTGTAG GAAGCAATGC AGTAAGAAAT TTATTAATTA AAGGAAATAG TCAAATTACA TATACTGTTT CTCAAGAGAC TAATTTTGGA ATTAATGAAG CAAATTATCT AATAAAAAAT TATTTCCAAA TTGCAAACTT AGAGGGAATA GGACTTAAAA ATTTAAAAAA TGCAACTAGA TCACTTGGAG GTTTATTAAA TTATTTAGAA AAAATTAATC CTTCAAATTT AGATAAAGAT TCTTCTGTAA AAATCTCATT AGACTTTCCA CAAATCCAAT ATGGTCACAA CAAATTAATT ATTGATTATC AAACTCAAAA AAACTTAGAA ATCAAAAATA CACAACGAGA AAACAATTAT GTAGGTTCGC TACTATGGAG TATTGATAGA ACTTATACTT GCATGGGCGC AAGGTGTTTA AGAAGGTGGA TAGATTCACC ACTATTAAAC GTTAATGAAA TTTATAAAAG ACAAAATATA ATTACAAACT TTTTTGAATC TAAGAAATTA CGTACAGATA CCCAAAATTT ACTTAGAGCA ATGGGGGATT TAGAAAGACT TGCAGGTAGA GCTTGTGCAG GTCATGCAAG TCCAAGAGAC TTAATTGCAA TAGCTGAAGG TTTAAAAAAA TTGCCTAGAC TAAAATCCAT AATTGAATTA TTTAAATATG ATCTCCCAAA TTGGACTGAT CAACTTATAA ATATTGATGA AGGACTCTTA GAATTAGCTG ATACTATAAG TTTTAAACTC GTAGAAAATC CTCCTCTAAG TATTAGTGAA GGAGGCATGA TCCACGATGG AGTTGACAAT ATATTAGATG GTTTACGCAA TTTAATGGAT GATTACTCAG AGTGGCTAAA TAAAGAGGAA TTAAAGGAAA GGAAAATTAG CAAAATTTCA AACCTAAAAA TTCAATTTCA TAAAAATTTT GGTTATTACA TTTCTATAAA TAAGTCAAAA GTTAATTTAG CTCCACAACA TTGGATCAAA AGGCAAACAC TTACTAATGA AGAAAGGTAT ATCACTTCAG AAATTAAAAA TAAAGAAAAT AAGATTTTCC AAATAAAAAG TAGAGCTTCA TCAAAAGAAT ATGAAATTTT CTGCGAATTA AGAAATATAG TTGCTGAAAA AACAAAACAA ATAAGATCAA TCGCAAAATC CATAGCATCT CTTGATGCAT TGCTTGGTTT ATCAATTACT TCAATAGAAA ACAATTTTAT AAAACCTTTA TTAATACCAA TAAATGATTC AATGACAAAA AATAGTACAA AAATTATCGC AGGAAGAAAT CCAATTGTAG AGCAATTGTT AAGTGATAAA AAGTTTGTAG CAAACGATAT TTCTTTTGAG GATAATCAAA AATTAATTAT ATTAACCGGT CCCAATGCAA GCGGAAAAAG TTGCTTTATA AGACAACTTG GTTTAATACA AATTCTCGCA CAAATTGGTA GCTTTGTTCC TGCTAATAAT GCTGAAATCA AGATTGCAGA TAGGATTTTC ACAAGAATTG GGGCAGTTGA TGATCAATCA TCTGGGCAAT CAACATTTAT GGTAGAAATG TCTGAAACTG CATCAATTCT AAATCAAGCA ACTTCTAACT CACTAGTTTT ACTTGATGAG ATAGGCAGAG GGACATCTAC TTTTGATGGA CTTTCAATAG CTTGGTCAGT AAGTGAATAT CTTGCAAAAA AAATTCAATG TAATACTATT TTTGCTACGC ACTATCATGA GCTTAATTAT TTAAAAAATT CAAATAAGAA TATACAAAAT TTTCAAGTTT TAGTAGAACA AAATGACGAT CAGCTAATTT TTAGCCACAG AATTGTTAGA GGGGGCTCAA ACAAAAGCTA TGGCATAGAA GCAGCTAAAT TAGCAGGAGT TCCAAAAGAA GTTATAGAAA AAGCAAAATC AGTTTTAAAT TCTTTAGAAG AAAATAACAA GTTAAATCAT AATATTAAGT AG
|
Protein sequence | MQEDLIIQKN LFAIGNDNNK QKEKTKIPED LSLEDLKKES QKRPRQRKNS TNLINKFKTD LISNKKNVCI NEESYSYKTV SKLKLTPVMK HYVTLKEENK DRLLLYRLGD FFECFFEDAV LISNLLEITL TSKDAGKEIG KIPMAGVPHH AMDRYCADLI KKNYSVVICD QLEKSSGNYG TPIKRGITRI ITPGTVIEEG MLIAKKNNWI TAIYLSEENS NESYEWGISK ADVSTGELIT LEGQSLSKLF DEIIKLDSSE IIVGSNAVRN LLIKGNSQIT YTVSQETNFG INEANYLIKN YFQIANLEGI GLKNLKNATR SLGGLLNYLE KINPSNLDKD SSVKISLDFP QIQYGHNKLI IDYQTQKNLE IKNTQRENNY VGSLLWSIDR TYTCMGARCL RRWIDSPLLN VNEIYKRQNI ITNFFESKKL RTDTQNLLRA MGDLERLAGR ACAGHASPRD LIAIAEGLKK LPRLKSIIEL FKYDLPNWTD QLINIDEGLL ELADTISFKL VENPPLSISE GGMIHDGVDN ILDGLRNLMD DYSEWLNKEE LKERKISKIS NLKIQFHKNF GYYISINKSK VNLAPQHWIK RQTLTNEERY ITSEIKNKEN KIFQIKSRAS SKEYEIFCEL RNIVAEKTKQ IRSIAKSIAS LDALLGLSIT SIENNFIKPL LIPINDSMTK NSTKIIAGRN PIVEQLLSDK KFVANDISFE DNQKLIILTG PNASGKSCFI RQLGLIQILA QIGSFVPANN AEIKIADRIF TRIGAVDDQS SGQSTFMVEM SETASILNQA TSNSLVLLDE IGRGTSTFDG LSIAWSVSEY LAKKIQCNTI FATHYHELNY LKNSNKNIQN FQVLVEQNDD QLIFSHRIVR GGSNKSYGIE AAKLAGVPKE VIEKAKSVLN SLEENNKLNH NIK
|
| |