Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9515_02471 |
Symbol | |
ID | 4720383 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9515 |
Kingdom | Bacteria |
Replicon accession | NC_008817 |
Strand | + |
Start bp | 228029 |
End bp | 230440 |
Gene Length | 2412 bp |
Protein Length | 803 aa |
Translation table | 11 |
GC content | 31% |
IMG OID | 640079910 |
Product | DNA mismatch repair protein MutS family protein |
Protein accession | YP_001010563 |
Protein GI | 123965482 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1193] Mismatch repair ATPase (MutS family) |
TIGRFAM ID | [TIGR01069] MutS2 family protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.677193 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAGAGA AAAGTCATTA TAAAAATTTA TTATCGCACA AATCACTTTC AGAAGAATCT ATTGGTCTTT TAGAGTGGGA TACTTTGAAG ACTCAAATTG CCTCTTTTGC TTCTACAAAA ATGGGCAAGA ATGTAGTACT TCAGTTTGAA ATTCCCACTG AATATGAAAT TTCGAGGAGA TTATTGAAAG AAACTATAGA GATAAATGAG CTAGAAAAAA ACTTAGATAA ATCGATTAGT TTTTCTGGGG TGTTTGATAT TTGTAAAAAT ATCGAGATTT GTTCAAAAGG AGGAGTAATT AATTCTTCTG ATTTATTAGA GATAGCTGAA ACAATTTCAG CTTCAAGAAA TTTAAAAAAA ATCCTTTTGG ATTTTGAACA AAGACCTTTT ATTTCTGCTT TTTTAAAGGG TTTAATTGAC CATAATCAGA TTGAAAAAAT TTTGAAAAAT GGTATTGAAT CCAACGGTAG AATTTCGGAT CGAGCCAGTC AGAAATTAGC AAACCTTAGA CAAGAATTAT TATCTAAGAA ATCAGAAAGG AGGGTATTAG TAAATAAATT TATTCAAAAT AATCTACCTT ATATCCAAGA TACTATTGTT GGCGATAGAT ACGGAAGACC CGTTTTAGCA ATAAAAGTGC AATATGCAGA GAAATTTAAA GGTATAATTC ATGATTCCTC GGCATCAGGA AATACTATAT ATCTGGAGCC AGAATCTATA GTTTTAAAGG GTAATAAGAT CGCTTCTATG GAAGCTAGAG TTGCAGGAGA AGAATTTAAA TTATTAAAAG AGTGGTCACA TATTATTCGT GATAACAATC AAAGTCTTCT TGAAATGTCA AATATACTTT TGAGAGCGGA GTTTTCCCTA ACTCGTTCAA GATATTCAAA TTGGATTGGA GGCAATGCTC CAATAGTTGA AAATAGTCCA ATAGTTTCTT TAATGGGTTT TTCTCATCCT CTTTTGATTT GGGAAAATAA AAAAAAACAA GCTGCCAAAC CAGTTTCTAT TGATTTTCAT ATAGACAGAA ATACTAAGGT GGTAGCTATC ACAGGACCAA ATACTGGAGG TAAAACTGTT GCATTAAAGG GCCTTGGAAT AGCATTATTG ATGGCTAGAT CAGGATTATT TATTCCTTCA ATTCAAAAAC CAATAATTCC TTTTTGTCCA AATATATTTG TTGATATTGG GGATGATCAA TCTTTAGAGG GAAATTTATC TACTTTTAGT GGACACATAT TGCGCATTAA AAATATTTTG GAAGCTCTTA ATAATAAAAA GGGGTTTTCA GTTGTTTTAT TAGATGAGAT TGGATCTGGA ACCGATCCTT CTGAAGGTAC TGCACTTGCA ATAGCTTTAC TGAAAGAGTT TGCAATTGTC TCTGATATTA CCCTGGCTAC AACTCATTAT GGGGATATTA AAGCTTTGAA ATACAGTGAT AATAGATTTG AAAACGTTTC AGTTGCTTTT GACGAAGAAT CATTCAAGCC TAAATATACT CTTAATTGGG GAATACCGGG AAGAAGTAAT GCGTTATCAA TTTCAAAGAG AATTGGCATT AATGAAAAAA TACTGAATAA TGCTTCCAAC TATTTAAAAC CGAAAGAAGT TGAAAATATA AATAATATTA TTAAAGGATT AGAAGAGGAA AGGTTAAAGC AACAAAAATC GGCAGAAGAG GCCGCTGAAC TTATAGCCAG AACAGAAATA TTACATGATG AAATAAAGAA TAAATATGAA TTTCAAAAAC TCAATGCTTT AAAAATTCAG GAGGCTGAAA AGCAAAAACT ATCCAAACAT ATCAAAGAAG CTCAAAAAGA AGTTATTAAT TTGATCAAAA AATTAAAAGA CCAAAATGCA ACTGGAGAAG ATGCCAGATT AATTGGAATT AGGCTTAAAG AAATTGAGAC GGATCATCTT ACTCAATCAA ATGTTGAAAG GACAACATCT TGGAGTCCCA AAATAGGAGA TTTTATTAAA ATTAAAAGTT TAAATAGCTC TGGTCAAATA ATAGACATAG ATGAAAAAGC CAAGTCTTAC GAGGTTAAAT GTGGATCATT TAGAAGCACA TTATCAATTA ATGATTTTGA AGGACTTAAT GGCGAAAAAC CTAAGTTTAA AGATTCTCAA ATCCAAATTA GTTCAGTAAG AGAAGATTTT TCTTTTTCTA AAATAAGAAC AAGTAAAAAC ACCATAGATG TTAGGGGAAT GAGAGTTCAT GAAGCAGAAA TCATTATTGA AGAAAAATTT AAAAAGTTTC ATGGACCGCT CTGGATAGTT CATGGTATCG GAACTGGGAA ATTAAAAAAA GGATTACGTT TATGGTTATC AAGTTTGAAT TATGTTGATA AAGTTGAAGA TGCAGAAAAT AATGAGGGAG GAGCTGGTTG CAGTATTGCG TGGATAAAAT AA
|
Protein sequence | MKEKSHYKNL LSHKSLSEES IGLLEWDTLK TQIASFASTK MGKNVVLQFE IPTEYEISRR LLKETIEINE LEKNLDKSIS FSGVFDICKN IEICSKGGVI NSSDLLEIAE TISASRNLKK ILLDFEQRPF ISAFLKGLID HNQIEKILKN GIESNGRISD RASQKLANLR QELLSKKSER RVLVNKFIQN NLPYIQDTIV GDRYGRPVLA IKVQYAEKFK GIIHDSSASG NTIYLEPESI VLKGNKIASM EARVAGEEFK LLKEWSHIIR DNNQSLLEMS NILLRAEFSL TRSRYSNWIG GNAPIVENSP IVSLMGFSHP LLIWENKKKQ AAKPVSIDFH IDRNTKVVAI TGPNTGGKTV ALKGLGIALL MARSGLFIPS IQKPIIPFCP NIFVDIGDDQ SLEGNLSTFS GHILRIKNIL EALNNKKGFS VVLLDEIGSG TDPSEGTALA IALLKEFAIV SDITLATTHY GDIKALKYSD NRFENVSVAF DEESFKPKYT LNWGIPGRSN ALSISKRIGI NEKILNNASN YLKPKEVENI NNIIKGLEEE RLKQQKSAEE AAELIARTEI LHDEIKNKYE FQKLNALKIQ EAEKQKLSKH IKEAQKEVIN LIKKLKDQNA TGEDARLIGI RLKEIETDHL TQSNVERTTS WSPKIGDFIK IKSLNSSGQI IDIDEKAKSY EVKCGSFRST LSINDFEGLN GEKPKFKDSQ IQISSVREDF SFSKIRTSKN TIDVRGMRVH EAEIIIEEKF KKFHGPLWIV HGIGTGKLKK GLRLWLSSLN YVDKVEDAEN NEGGAGCSIA WIK
|
| |