Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PMN2A_1243 |
Symbol | |
ID | 3606637 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. NATL2A |
Kingdom | Bacteria |
Replicon accession | NC_007335 |
Strand | + |
Start bp | 1744289 |
End bp | 1747069 |
Gene Length | 2781 bp |
Protein Length | 926 aa |
Translation table | 11 |
GC content | 33% |
IMG OID | 637688119 |
Product | DNA mismatch repair protein MutS |
Protein accession | YP_292436 |
Protein GI | 72383081 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0249] Mismatch repair ATPase (MutS family) |
TIGRFAM ID | [TIGR01070] DNA mismatch repair protein MutS |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.32952 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCTGCCA GCCAAAACCC TATACAAGGC AGTCTATTTG GGGGGAATGA GGAAAGCGAT CTCAATAAAG CTGAAAAGCT GAAAGGTTCA GAAAGATCGA ACGTAAATCT GTCACATCAA CAACTAAAAG AAGACGCATC GCTTAGACCT CGCATAAAAC AGACTCCTAA AAATCCAAAT CAGATAATTG ATTTGGATGA GTTGTCTCGT CTGGCAATTG AGGAACCCAA ATGGTCACAT CACAATTTAC CAAAAATTGA TGATCTCACT CCTGCACTGA GACATTATGT CGAATTAAAA AAAGAGAATC CTGACCGAGT TTTGCTTTAT AGGCTTGGAG ACTTCTTCGA ATGTTTTTTT GAAGATGCAA TAACACTCTC AAAACTTTTA GAAATCACAC TCACAAGTAA AGAAGCTGGA AAAAAAATTG GCAAAATTCC AATGGCTGGA ATTCCTCATC ATGCATCTGA TCGTTATTGC ACAGAACTAA TTAAAAAAGG TTTATCTATT GCTATTTGTG ATCAACTTGA AGCTGCTCCA ACAAAAGGCA ATAAATTAAT AAAAAGAGGC ATAACAAGAT TAATAACCCC TGGAACAATT CTAGAAGAGG GAATGTTAAG TGCTAAAAAA AATAATTGGC TAGCTTCTGT TCTTCTAGAA GCCAAATCTA ATCAAGAAAT AATTGACTGG TCTTTAGCAA AAATAGATGT AAGCACCGGT GAATTTATTG TTCAAGAAGG TCAAGGAAGT AATAATTTAC GACACGAATT AATTAAATTG AATGCAGCTG AAGTTATTTC AGAGAAAAAG TCAATCTCAA ATAAAATCTG GTATGAAGGA TTAATAGAAA TAACAGAATT TAATAGAACA TCATTCTCTA ATTTAGAGGC AATAACAACT ATTAAAAATC ATTATTGTCT AAATAATATT GATAGCCTAG GGATACATTC TAACTCACTA TCAATTAGGA CTATTGGAGG ATTAATTTCA TATTTAAATA AAACGCACCC AAATATAGAT GATAAATCTA ACAATGAAAT AAAAACAAAT ATCTGTATTG ACTACCCTCG AATAAAAAAT AGTCGATCAG GATTAATTAT AGATAGTCAA ACAAGAAGAA ATTTAGAAAT TACCTCAACT CAGAAGGATG GAAAATTTCA AGGCTCATTA CTTTGGGCAA TTGATAAAAC ATTAACTGCA ATGGGTGCGA GATGCATTAG GAGGTGGATA GAAGAACCTA CAAAAGATGT TAATGCAATT AAAAATAGAC AGAATATAAT TGGATTTCTA GTCAAATCAT CGACATTAAG AAAAAATATT AGAAAAACCC TTAGAGCCAT GGGTGACTTA GAACGTCTTT CAGGTAGAGC AGGAGCTCAA CAAGCGGGAG CACGTGATTT AGTTGCTATC GCAGAAGGAA TTAACCGTTT ACCCTTAATT AAAAATTATC TAAATGAGCC CATATTTGAT AAAACTAAGT ATTTTGACTC TATTATAAAT ATAGATAAAG ACTTAATAGA ACTTGCATCA AAAATCAATA ATCAAATCAT AGACAATCCA CCACTTAGCC TTACAGAAGG GGGTCTATTT TATGATGGTA TAAACCCTGT ACTTGATGGA CTAAGAAACC AACTAGATGA TCATAATATA TGGCTCAACT CTCAGGAATT AGAAGAGAGA AAGAAAAGCA ATATAAATAA TTTAAAGCTT CAATATCATC GATCTTTTGG ATACTTTTTA GCTGTTAGCA AATCAAAGGC TATAAATGTT CCAGATCACT GGATCAGAAG ACAAACCTTA ACTAATGAAG AACGTTTTGT GACACCAGGA TTAAAAGAAA GAGAAGGAAA AATCTTTCAA GTTAGAGCAA GAATATCAGA ACTAGAATAT GAACTTTTTT GTGATTTAAG AAAACTTACA GGGAGTAAAT CAAACATTAT TAGACAAGCT GCAAAAGCAA TATCTCACTT AGATGTTTTA ACTGGACTAG CCGAACTAGC CGCTAATCAC AACTATATTC AACCTCAGAT AATAGATATG AATGAGCAAG ATAAATCAAG AAAATTATCT ATTATTGATG GCCGTCATCC TGTAGTTGAA CAAATTTTAG TTGATAAAGT TTTTGTACCT AATGATATAG AACTTGGCTC TAAGACCGAT CTAATTATTC TTTCAGGGCC AAATGCTAGT GGAAAAAGTT GTTATTTAAG ACAAGTAGGT CTCTTGCAAA TCATGGCTCA AATTGGAAGT TGGATCCCAG CTAAATCAGC ACATATGGGA ATAGCTGATC AAGTATTTAC ACGTGTTGGA GCAGTGGATG ATTTAGCTTC AGGCCAATCA ACTTTCATGG TCGAAATGAT TGAAACTGCC TTCATTCTTA ATAATGCTAC TGAAAACTCA TTAGTTTTAT TAGATGAAAT TGGAAGAGGA ACTTCAACTT TTGATGGGCT ATCTATTGCC TGGTCAGTAA GCGAGTTTTT AGCAAAAAAA ATTAAAAGTC GTTCAATCTT TGCAACTCAT TACCATGAAT TGAATCAAAT TTCTGAATAT ATTGAAAATG TCGAGAATTA CAAAGTTGTA GTTGAATATA AAAATCATTC CCTTTCATTC CTTCACAAGG TCGAAAGAGG AGGAGCAAAT AAAAGTTATG GAATTGAAGC TGCGAGGCTT GCGGGAGTCC CCCCAGACGT AGTCAATAAT GCAAGATTGA TATTAAAAAA TCTAGAAAAA AATAACTCCA ACACCATTCA AATCACTAAG CCAATTGAAA GTTGCAAATA A
|
Protein sequence | MAASQNPIQG SLFGGNEESD LNKAEKLKGS ERSNVNLSHQ QLKEDASLRP RIKQTPKNPN QIIDLDELSR LAIEEPKWSH HNLPKIDDLT PALRHYVELK KENPDRVLLY RLGDFFECFF EDAITLSKLL EITLTSKEAG KKIGKIPMAG IPHHASDRYC TELIKKGLSI AICDQLEAAP TKGNKLIKRG ITRLITPGTI LEEGMLSAKK NNWLASVLLE AKSNQEIIDW SLAKIDVSTG EFIVQEGQGS NNLRHELIKL NAAEVISEKK SISNKIWYEG LIEITEFNRT SFSNLEAITT IKNHYCLNNI DSLGIHSNSL SIRTIGGLIS YLNKTHPNID DKSNNEIKTN ICIDYPRIKN SRSGLIIDSQ TRRNLEITST QKDGKFQGSL LWAIDKTLTA MGARCIRRWI EEPTKDVNAI KNRQNIIGFL VKSSTLRKNI RKTLRAMGDL ERLSGRAGAQ QAGARDLVAI AEGINRLPLI KNYLNEPIFD KTKYFDSIIN IDKDLIELAS KINNQIIDNP PLSLTEGGLF YDGINPVLDG LRNQLDDHNI WLNSQELEER KKSNINNLKL QYHRSFGYFL AVSKSKAINV PDHWIRRQTL TNEERFVTPG LKEREGKIFQ VRARISELEY ELFCDLRKLT GSKSNIIRQA AKAISHLDVL TGLAELAANH NYIQPQIIDM NEQDKSRKLS IIDGRHPVVE QILVDKVFVP NDIELGSKTD LIILSGPNAS GKSCYLRQVG LLQIMAQIGS WIPAKSAHMG IADQVFTRVG AVDDLASGQS TFMVEMIETA FILNNATENS LVLLDEIGRG TSTFDGLSIA WSVSEFLAKK IKSRSIFATH YHELNQISEY IENVENYKVV VEYKNHSLSF LHKVERGGAN KSYGIEAARL AGVPPDVVNN ARLILKNLEK NNSNTIQITK PIESCK
|
| |