Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PMT9312_1737 |
Symbol | |
ID | 3766563 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9312 |
Kingdom | Bacteria |
Replicon accession | NC_007577 |
Strand | + |
Start bp | 1623061 |
End bp | 1625802 |
Gene Length | 2742 bp |
Protein Length | 913 aa |
Translation table | 11 |
GC content | 29% |
IMG OID | 637798282 |
Product | DNA mismatch repair protein MutS |
Protein accession | YP_398234 |
Protein GI | 78780122 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0249] Mismatch repair ATPase (MutS family) |
TIGRFAM ID | [TIGR01070] DNA mismatch repair protein MutS |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.0169322 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAGAAG ATACGATAAT TCAAAAGAAT TTATTTGCTA TTGATAATGA AAATAATGAG CAAAAAGAAA TAACAAAAAT TCCAGAAGAT TTATCTTGGG AAGATTTAAA AAAAGAATCG CAAAAAAGAC CTAGACAAAG AAAAAATACA ACTAATTTAA TAAATAAATT CAAGACTGAT TTGATTTCAA AAAATAAAAA TGTTTGTATC AATGAAGAAT CATATAGTTA CAAAACTGTT TCAAAACTGA AATTAACTCC TGTAATGAAA CATTATGTAA CTCTAAAAGA GGAAAATAAA GATAGGTTAT TGCTCTATAG ATTGGGAGAT TTTTTTGAAT GTTTTTTTGA GGACGCTGTA TTAATATCTA ACCTTTTAGA AATAACACTA ACCAGTAAAG ATGCTGGCAA AGAAATTGGT AAGATCCCCA TGGCAGGCGT TCCTTATCAT GCAATGGAGA GATACTGTGC TGATTTAATT AAAAAAAATT ATTCTGTGGT TATATGTGAC CAATTAGAAA AAAGTTCTGG TAATTATGGG ACTCCAATTA AAAGAGGAAT AACAAGAATC ATTACTCCTG GAACTGTAAT TGAAGAGGGC ATGTTAATAG CAAAGAAAAA TAATTGGATT ACTGCTATTT ACTTATCAGA AGAAAATTCA AATGAATCTT ATGAATGGGG TATATCAAAA GCTGATGTAA GCACAGGAGA ATTAATAACT ATGGAAGGTC AATCTCTGTC AAAACTATTT GATGAAATTA TTAAATTAGA TTCTTCAGAA ATCATTATAG GAAGCAATGA AGTAAGAAAT TTATTAATGA ATGGAAATAG TCAAATTTCA TATACTGTTT CACAAGAGAC TAATTTCGGC ATTAATGAAG CAAATTATCT AATAAAAAAA TATTTCCAAA TTGTAAGCCT AGAAGGAATA GGACTTAAAA ATTTAAACAA TGCGACTAGA TCACTTGGAG GTTTATTAAA TTATTTAAAA AAAATTAATC CTTTAAATTT AGATAAGGAT TCTTCTGTAA AAATCTCATT AGACTTTCCA CAAATTCAAT TTTGTCACAA CAAATTAATT ATTGATTATC AAACTCAAAA AAATTTAGAA ATAAAAAATA CACAACGAGA AAACAATTAT GTAGGTTCGC TACTATGGAG TATTGATAGA ACATATACCT GCATGGGTGC CAGGTGTTTA AGAAGATGGA TAGATTCACC ACTATTAAAC GTTAATGAAA TTTATAAAAG ACAAAATATA ATTTCAAACT TTATTGAATC GAAACAAGTC CGCATGGATA CCCAAAATTT ACTTAGAGCA ATGGGTGATT TAGAAAGACT TGCAGGAAGA GCTTGCGCAG GTCATGCAAG TCCTAGAGAC TTAATTGCAA TCGCGGAAGG TTTAAAAAAA TTGCCTAGAA TAAAATCCAT CATTGAATTA TTTAAATATG ATCTCCCAGA TTGGACTGAT CAATTAAAAA ATATTGATGA AGAACTCTTA GAATTAGCTG ATACGATTAG TTTTCAACTA ATAGAACATC CTCCTCTTAA TATAAGTGAA GGAGGCATGA TCCACGATGG TGTTGACAAT ATATTAGACG GTTTACGCAA TTTAATGGAT GATTACTCTG AGTGGCTAAA TCAAGAGGAA TTAAAAGAGA GGAAAATTAG CAAAATTTCA AACCTAAAAA TTCAATTTCA TAAAAATTTT GGCTACTACA TATCTATAAA TAAATCAAAA GTTAATTTAG CTCCACAACA TTGGATAAAA AGGCAAACAC TGACTAATGA AGAAAGGTAT ATCACTACAG ATATTAAAAA TAAAGAAAAT AAGATTTTCC AAATCAAAAG TCGAGCTTCA TCAAGGGAAT ATGAAATTTT TTGCGAATTA AGAAAAATGG TTGCTGAAAA AACAAAACAA ATAAGATCAA TTGCTAAATC CATAGCATCT CTTGATGCAT TACTTGGATT GTCAATTACT TCAGTAGAAA ACAATTTTAT AAAACCTGCG TTAATCCCAA TAAATGATTC ACAGAAAAAA AATAGTACTA AAATTATTGC AGGAAGAAAT CCAATAGTTG AGCAATTATT AAATGATAAA AAGTTTACAG CGAATGATAT TTGTTTTGAT GATAACCAGA AATTAATTAT TTTAACTGGT CCCAATGCAA GCGGGAAAAG TTGCTTTATA AGACAAATTG GTTTAATACA AATCCTTGCA CAAATTGGTA GCTTTATTCC GGCAAATAAA GCTGAAATCA AGATTGCAGA TAGGATTTTT ACAAGAATTG GGGCGGTTGA TGATCAATCT TCAGGACAAT CAACATTTAT GGTAGAAATG TCTGAAACTG CATCAATTTT AAATCAGGCA ACTTCGAGCT CACTTGTATT ACTTGATGAG ATAGGTAGAG GGACATCTAC TTTTGATGGC CTTTCCATAG CTTGGTCTGT AAGTGAATAT CTTGCAAAAA AAATTAAATG TAACACTATT TTTGCTACTC ACTATCATGA ATTGAATTAT TTAAAAAATT CAAATAAAAA TATAGAAAAT TTTCAAGTTT TAGTAGAGCA AAATAACGAT CAAATAATTT TTAGTCATAA GATTAAAAAA GGAGGTTCAA ACAAAAGTTA CGGAATAGAA GCAGCTAAAT TAGCAGGAGT TCCAAGAGAA GTTATAGAAA AAGCTAAATT AGTTTTAAAT TCTTTAGAAG AAAATAATAA ATTCAATAAA AATAATGATT AA
|
Protein sequence | MKEDTIIQKN LFAIDNENNE QKEITKIPED LSWEDLKKES QKRPRQRKNT TNLINKFKTD LISKNKNVCI NEESYSYKTV SKLKLTPVMK HYVTLKEENK DRLLLYRLGD FFECFFEDAV LISNLLEITL TSKDAGKEIG KIPMAGVPYH AMERYCADLI KKNYSVVICD QLEKSSGNYG TPIKRGITRI ITPGTVIEEG MLIAKKNNWI TAIYLSEENS NESYEWGISK ADVSTGELIT MEGQSLSKLF DEIIKLDSSE IIIGSNEVRN LLMNGNSQIS YTVSQETNFG INEANYLIKK YFQIVSLEGI GLKNLNNATR SLGGLLNYLK KINPLNLDKD SSVKISLDFP QIQFCHNKLI IDYQTQKNLE IKNTQRENNY VGSLLWSIDR TYTCMGARCL RRWIDSPLLN VNEIYKRQNI ISNFIESKQV RMDTQNLLRA MGDLERLAGR ACAGHASPRD LIAIAEGLKK LPRIKSIIEL FKYDLPDWTD QLKNIDEELL ELADTISFQL IEHPPLNISE GGMIHDGVDN ILDGLRNLMD DYSEWLNQEE LKERKISKIS NLKIQFHKNF GYYISINKSK VNLAPQHWIK RQTLTNEERY ITTDIKNKEN KIFQIKSRAS SREYEIFCEL RKMVAEKTKQ IRSIAKSIAS LDALLGLSIT SVENNFIKPA LIPINDSQKK NSTKIIAGRN PIVEQLLNDK KFTANDICFD DNQKLIILTG PNASGKSCFI RQIGLIQILA QIGSFIPANK AEIKIADRIF TRIGAVDDQS SGQSTFMVEM SETASILNQA TSSSLVLLDE IGRGTSTFDG LSIAWSVSEY LAKKIKCNTI FATHYHELNY LKNSNKNIEN FQVLVEQNND QIIFSHKIKK GGSNKSYGIE AAKLAGVPRE VIEKAKLVLN SLEENNKFNK NND
|
| |