Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9515_18351 |
Symbol | |
ID | 4720413 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9515 |
Kingdom | Bacteria |
Replicon accession | NC_008817 |
Strand | + |
Start bp | 1617910 |
End bp | 1620654 |
Gene Length | 2745 bp |
Protein Length | 914 aa |
Translation table | 11 |
GC content | 30% |
IMG OID | 640081535 |
Product | DNA mismatch repair protein MutS |
Protein accession | YP_001012149 |
Protein GI | 123967068 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0249] Mismatch repair ATPase (MutS family) |
TIGRFAM ID | [TIGR01070] DNA mismatch repair protein MutS |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAAGACG AAACTATAAT TCAGAAAGAA TTATTTGCAA TCAATAATGA ACTTAAACTT CCAAAGAGCT CGAGTAAAAT TTCTGATAAT TTATCGACTG AAGAATTAAA AAAAGAGTCA CAGAGAAGGC CGAGACAAAG AAAAAATGCC ACCAATGTAA TTAACAAATT CAAGAATGAG TTACATAATA AAAGCAGGGA TAATTGCATT AATGAAAAAT CTTTCAGTTA TAAAACAGTT GAAAAAGAAA AATTAACTCC AATTCTCAAG CATTATGTAA AATTAAAAGA AGAAAACAGC GATAGATTGT TGTTATATAG ATTAGGTGAC TTTTTTGAAT GTTTTTTTGA AGATGCTGTA CTTATATCTA ACCTATTAGA AATAACATTA ACCAGCAAAG ATGCTGGGAA AGAAATTGGG AAAATCCCTA TGGCGGGGGT CCCTCATCAC GCAATGGAGA GATACTGCGC AGAGTTAATT AAAAAAAATA ATTCGGTTGT TATTTGCGAT CAGTTAGAAA AAAGTACTGG AAATTATGGT ACACCATTAA AAAGAGGAGT AACAAGAATA ATCACCCCTG GAACTGTAAT TGAAGAAGGT ATGCTGGTTG CGAAAAAAAA TAATTGGATA ACAGCAATCC ACTTATCAAA AAATATTTCT GAAAATCTAG ATGAATGGGG TATTTCAAGA GCAGATGTGA GTACAGGAGA GCTATTAACA ATGGAAGGTA AATCACTTCC AAAGTTATTC GATGAACTTG TTAAGTTAGA TACATCAGAG ATCATAATTG GAAGCAATGA AGAGAAAAGT TTATTAGAAG AACAAAATGA AAACATCACA TATACCGTTA CGCAAGAGAC TTTTTTCAGT ATTAATGAAG CAAGTTCAAC TATAAAAAAC TATTTCCAAA TTCTAAGTCT AGAAGGTCTA GGGCTCAAAA ATTTAAACAA TGCGACCAAA GCACTTGGTG GCCTACTAAA TTATCTAGAG AAAATCAATC CTTCCAATTT AGAGAACGAT TCTTCTCTAA GAATTTCTTT AGATTTCCCA CAAATACAAT TTCCAAAAGA TCATTTAATT ATTGATTATC AAACTCAAAA AAATTTAGAA ATAAAAAATA CACAACGAGA AAACAATTAT GCAGGTTCGC TTCTTTGGAG TATTGACAAA ACATATACCT GCATGGGCGC AAGATGTTTA AGAAGATGGA TAGATTCTCC ATTATTAGAT ATTGATGAAA TTTGTAAAAG ACAAAATATT ATTTCAAACT TTTTAGAGTC TAAAAAGTTA AGAATAGATA CCCAAAATAT ACTTAGAGCA ATGGGCGATT TAGAGAGACT TTCGGGAAGG GCATGCGCTG GTCATGCAAG TCCAAGAGAT TTGATAGCAA TATCTGAAGG CTTAAAAAAA TTACCTAAAT TAAAGTCAAT TGTTAATTTA TTTAAGTATA AAATCCCATC TTGGACGGAT CAATTAAAGA ATGTTGATAA TGAACTTCTA GAATTAGCAG ATCTAATAAG TTTCAAACTT GTAGGAAATC CCCCTTTAAA TACTAGTGAA GGCGGAATTA TCCATGATGG AGTTGACAAT GTACTAGATG GCTTACGTAA TTTAATCGAC GATTACTCAG ATTGGTTAAA TGAAGAAGAA TTAAAAGAAA GAAAAATTAG TAAAATCTCG AATTTAAAGA TTCAATTTCA TAAAAACTTT GGTTATTACA TTTCAATAAA CAAATCAAAA GTTAATTTAG CACCAGACCA TTGGATTAAA AGGCAAACTC TTACTAATGA GGAGAGATAT GTAACTACAG AAATAAAAAA TAAAGAAAGT AAAATATTTC AAGTAAAAAA CCGGGCAGCG GCAAAAGAGT ATGAATTATT TTGTGAGATA AGAAATCTTG TAGCTTCAAA AACAAAAAAA ATTAGATCCA TCGCAAAATC TATAGCGTGT ATAGATGCAT TACTCGGTTT ATCCATTACA TCATTAGAGA ATAATTTCAT AAAACCTACT CTTTTACCCA TTCAAAATTC AACGACCCAA CAAAGTACAG AAATTATTAA AGGACGGAAT CCCATCGTTG AACAATTATT AACTAATAAA GAATTTATAT CTAATGATAT TTTGTTTAAT AATAAGCAAA AATTAATAAT ATTAACTGGT CCAAATGCTA GTGGAAAAAG CTGCTTTATT AGACAAATAG GTTTAATTCA AATTTTATCT CAAATTGGTA GTTTCATCCC TGCAAGTAAA GCAAATATAC AAATTGCGGA TCGAATATTT ACAAGAATTG GAGCTGTAGA TGATCAATCT TCCGGGCAGT CAACATTCAT GGTAGAAATG TCAGAGACAG CATCAATACT AAACCAAGCG ACATCAAACT CTCTTGTTTT ACTTGATGAG ATCGGCAGAG GGACATCTAC TTTTGACGGA TTATCCATAG CGTGGTCAGT GAGTGAATAT CTTGCAAAAA AAATTGTATG TAATACCATT TTTGCTACTC ATTACCATGA ACTTAATTAT CTTAAAAATA CAAATAAAAA TGTAGAAAAT TTTCAAGTAT TAGTTAAACA GAAAAAGGAT CAACTATATT TTTGTCATAA AATAACAAAA GGTGGGGCAA ACAAAAGTTA CGGTATTGAA GCTGCAAAGT TAGCTGGGGT TCCCAAAGAA GTTATCGATA AAGCTAAATT AGTTTTAGAT TATTTAGAAA AAAATAATCA GTTAAATTCT CAAATACAAA TTTAA
|
Protein sequence | MQDETIIQKE LFAINNELKL PKSSSKISDN LSTEELKKES QRRPRQRKNA TNVINKFKNE LHNKSRDNCI NEKSFSYKTV EKEKLTPILK HYVKLKEENS DRLLLYRLGD FFECFFEDAV LISNLLEITL TSKDAGKEIG KIPMAGVPHH AMERYCAELI KKNNSVVICD QLEKSTGNYG TPLKRGVTRI ITPGTVIEEG MLVAKKNNWI TAIHLSKNIS ENLDEWGISR ADVSTGELLT MEGKSLPKLF DELVKLDTSE IIIGSNEEKS LLEEQNENIT YTVTQETFFS INEASSTIKN YFQILSLEGL GLKNLNNATK ALGGLLNYLE KINPSNLEND SSLRISLDFP QIQFPKDHLI IDYQTQKNLE IKNTQRENNY AGSLLWSIDK TYTCMGARCL RRWIDSPLLD IDEICKRQNI ISNFLESKKL RIDTQNILRA MGDLERLSGR ACAGHASPRD LIAISEGLKK LPKLKSIVNL FKYKIPSWTD QLKNVDNELL ELADLISFKL VGNPPLNTSE GGIIHDGVDN VLDGLRNLID DYSDWLNEEE LKERKISKIS NLKIQFHKNF GYYISINKSK VNLAPDHWIK RQTLTNEERY VTTEIKNKES KIFQVKNRAA AKEYELFCEI RNLVASKTKK IRSIAKSIAC IDALLGLSIT SLENNFIKPT LLPIQNSTTQ QSTEIIKGRN PIVEQLLTNK EFISNDILFN NKQKLIILTG PNASGKSCFI RQIGLIQILS QIGSFIPASK ANIQIADRIF TRIGAVDDQS SGQSTFMVEM SETASILNQA TSNSLVLLDE IGRGTSTFDG LSIAWSVSEY LAKKIVCNTI FATHYHELNY LKNTNKNVEN FQVLVKQKKD QLYFCHKITK GGANKSYGIE AAKLAGVPKE VIDKAKLVLD YLEKNNQLNS QIQI
|
| |