Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9211_17711 |
Symbol | |
ID | 5731650 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9211 |
Kingdom | Bacteria |
Replicon accession | NC_009976 |
Strand | + |
Start bp | 1594509 |
End bp | 1597235 |
Gene Length | 2727 bp |
Protein Length | 908 aa |
Translation table | 11 |
GC content | 37% |
IMG OID | 641286157 |
Product | DNA mismatch repair protein MutS |
Protein accession | YP_001551656 |
Protein GI | 159904312 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0249] Mismatch repair ATPase (MutS family) |
TIGRFAM ID | [TIGR01070] DNA mismatch repair protein MutS |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 25 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACTGCCG AGAGCATTTC ACTACAAGGC AGCCTGTTTG GTGACCTAGC AGAAAGCCCG ACTAACAGAA AAATCTCACC TGACTCAACT AAAGACGATG ATTTTTCAGA TGCAGAATTA ATAAAAAACG CTCAAGCAAG GCCCAGGCCT TCAAAAAGTT CAACAGACAT AGAACCTTTA GATCATCCAA TTGGCGATGA AAATCCTCAA GGGAGTGATC TGACAAATGT TTCCCATCAC AGTGACGTAG AAATTACGCA ACTCACTCCA GTTCTTCGTC ATTATGTAGA GCTTAAAAAA GAAAATCCAG AAAGAATATT GCTATATAGA CTTGGTGATT TTTTTGAATG TTTTTTTGAA GATGCAATTC TTTTATCACA ACTTTTAGAA CTCACTCTCA CTGGAAAAGC AGCTGGAAAA GAAATTGGAA GAATACCGAT GGCAGGCATA CCTCATCATT CTGCAGAGAG ATATTGCTCA ATGCTTATCC AAAAAGGTCT CTCTGTAGCT TTGTGTGACC AATTAGAAAG CACTCAAAAC AAAGAAGGGA AACTGCTCAA AAGAGGGGTG ACAAGGGTAC TTACTCCTGG AACCGTTATC GAAGAAGGAA TGCTGCAGGC TAAAAAAAAT AATTGGTTGG CTGCTGTTCT AATAGAAGCT AAACCTGATA AAAATTCATA CCAATGGGGC TTAGCTTATG CTGATATCAG TACTGGAGAA TTCTTTGTCA AAGAAGGAAT TGACCTAGAT GCTTTAGAAC AGGAAATTGC AAAGATAGAG GCTTCCGAAG TTATTTGCCA ACAAATAGAT GATAGATCTC TAGTAAAAAA ATGGTGTCCA AAGAATATAG AACTCACTCA AACTGCAAAG ACTCCTTTTA CACTTCATGA AGCTAAATCT TCCCTAAAGA ACCATTACAA ATTAAACACT ATAAATGGAC TTGGATTCCA TGAATGGGAA TTAGCAATTA GAGCAGCAGG AGGTTTATTA GCCTATTTAC ATGAAACCAA CCCTATCAAT TCAGTAGAAA GAACCAGATC TGATATTCCT CTTGAAGTAC CTCAACTAAG TGTTTCAAAT GAGGCGTTAA TCGTTGATGC TCAAACCAGA CGTAATCTAG AAATACTTAG TACTCAAAAA GATGGGCGAT TTCAAGGTTC ACTCCTTTGG GCAATTGACA GAACTCTTAC AGCTATGGGA GGTAGATGCC TTCGCCGTTG GCTGGAAAGC CCTCTAATAG ACCTGAAAAG AATTAATGCT AGGCAAGAAA TTGTAAGTTT TTTAGTTAAA GAAAGTTCGC TTAGGCAAGT TTTAAGAAAA TTGCTTAGAT CGATGGCAGA CCTGGAAAGA TTGTCTGGAA GAGCTGGCGC GGGCCATGCA GGAGCTAGAG ACTTAGTAGC TATAGCAGAT GGTCTGGAAA GACTGCCTCT TTTAGCAGCA AATTTACAAA GTCTTCCTAA AAAATCGCCT TCTTGGCTAA TCCCGCTCCA GAATGTAGAC AAAGATCTAC TAAAACTAGC CAATACTATT AGAAATACTC TAATTAATAA TCCACCTTTA AATCTTAGTG AAGGCGGTCT AATTCATGAC GGAATAGATC CTATATTAGA TGGCCTCAGA AATATGCTCG ATGATCAAAA TGAATGGCTA AATAGCCAAG AAGAACAAGA GAGAAAAGCT AGTGGGAATA ACAATCTTCG GCTTCAATAT CATCGAACAT TTGGTTATTT TCTTGCAGTT AGCAAATCAA AGGCTAATGA TGTTCCATCC CATTGGATAC GAAGACAAAC ATTAGCTAAT GAAGAAAGAT TTATAACTCC AGATTTAAAA ACAAGAGAAG GAAAAATCTT TCAATTAAAA GCGAGGTCAG GTCAAAGAGA GTATGAATTA TTCTCAGACC TGAGGCAAAT AGTAGGAGAT CATGCCCACG CTATACGTAA AGCAGCCAAA TCAATAGCAG GTCTAGATGC ACTTGCTGGA CTAGCAGAAT TAGCCGCATC CAATAATTAT TGTGCTCCTA AAATACTAGA TTCAAAAAGT GGTGCAAACA AAATTTGTAT TGAGGCTTGT AGACATCCAG TAGTTGAACA GATGTTGGTA GAAAGAGAAT TTCAACCTAA CAATATTGAA ATTGGTGATA AGACAGATTT AATTATTTTG ACTGGACCAA ATGCTAGTGG TAAAAGTTGT TATCTTCGGC AAATTGGATT AATACAATTA CTCTCACAAG TTGGGAGTTG GATTCCTGCT ACTAAAGGTT TAATAAGTAT TTCAGATAGA ATATTTACAC GAGTAGGAGC TGTTGATGAT CTTGCTGCAG GTCAATCTAC CTTCATGGTA GAAATGGCAG AAACAGCTTA TATACTTAAT CAAGCCACAA ACAATTCTTT AGTCTTACTA GACGAGATTG GTAGAGGTAC TGCAACTTTT GACGGTTTAT CTATTGCATG GTCTGTAAGT GAATTTCTTG CTAAAGAAAT TAAAAGTAGA ACCATCTTTG CAACGCATTA TCATGAGTTA AATTCTCTTG CAAAAACTTT CACAAATATT TCAAACTCTC AAGTTCTTGT AAAGCAAAAT GGCAATGATC TTCACTTTCT TCATAAAGTT GTAGATGGAG GTGCAAATAG AAGCTATGGG ATAGAAGCAG CAAGGTTAGC AGGAGTTCCA AAGAAAGTAA TTGACACTGC AAATAAAGTT CTGGAAAGAT TGGAAAATAA TAATTAA
|
Protein sequence | MTAESISLQG SLFGDLAESP TNRKISPDST KDDDFSDAEL IKNAQARPRP SKSSTDIEPL DHPIGDENPQ GSDLTNVSHH SDVEITQLTP VLRHYVELKK ENPERILLYR LGDFFECFFE DAILLSQLLE LTLTGKAAGK EIGRIPMAGI PHHSAERYCS MLIQKGLSVA LCDQLESTQN KEGKLLKRGV TRVLTPGTVI EEGMLQAKKN NWLAAVLIEA KPDKNSYQWG LAYADISTGE FFVKEGIDLD ALEQEIAKIE ASEVICQQID DRSLVKKWCP KNIELTQTAK TPFTLHEAKS SLKNHYKLNT INGLGFHEWE LAIRAAGGLL AYLHETNPIN SVERTRSDIP LEVPQLSVSN EALIVDAQTR RNLEILSTQK DGRFQGSLLW AIDRTLTAMG GRCLRRWLES PLIDLKRINA RQEIVSFLVK ESSLRQVLRK LLRSMADLER LSGRAGAGHA GARDLVAIAD GLERLPLLAA NLQSLPKKSP SWLIPLQNVD KDLLKLANTI RNTLINNPPL NLSEGGLIHD GIDPILDGLR NMLDDQNEWL NSQEEQERKA SGNNNLRLQY HRTFGYFLAV SKSKANDVPS HWIRRQTLAN EERFITPDLK TREGKIFQLK ARSGQREYEL FSDLRQIVGD HAHAIRKAAK SIAGLDALAG LAELAASNNY CAPKILDSKS GANKICIEAC RHPVVEQMLV EREFQPNNIE IGDKTDLIIL TGPNASGKSC YLRQIGLIQL LSQVGSWIPA TKGLISISDR IFTRVGAVDD LAAGQSTFMV EMAETAYILN QATNNSLVLL DEIGRGTATF DGLSIAWSVS EFLAKEIKSR TIFATHYHEL NSLAKTFTNI SNSQVLVKQN GNDLHFLHKV VDGGANRSYG IEAARLAGVP KKVIDTANKV LERLENNN
|
| |