Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9301_02381 |
Symbol | |
ID | 4911573 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9301 |
Kingdom | Bacteria |
Replicon accession | NC_009091 |
Strand | + |
Start bp | 221362 |
End bp | 223773 |
Gene Length | 2412 bp |
Protein Length | 803 aa |
Translation table | 11 |
GC content | 31% |
IMG OID | 640159804 |
Product | DNA mismatch repair protein MutS family protein |
Protein accession | YP_001090462 |
Protein GI | 126695576 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1193] Mismatch repair ATPase (MutS family) |
TIGRFAM ID | [TIGR01069] MutS2 family protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAAGAGA AAAGTTATTC AAAAAAATCA TCTTCAAATA ACACCTTAGA AGAAGAATCT ATAAGCCTTT TAGAGTGGGA CTCATTAAAA ACGCATTTAT CTTCATTCGC CCTAACGGAA ATGGGTAAAA GAGCAATTTT AAGTTTTGAT ATCCCTTCAG AATACGAATC ATCTAAAAGA CTTTTGAATG AAACTGTTGA AATAACTCAG CTAGAAAATA ATTTAGATAA ATCAATTAGT TTTTCTGGTG TTTTTGATAT TAGTAGAAAT ATTGAAATAT GTTCTAAGGG AGGTGTAATT TCATCTTCTG AATTGTTAGA AATAGCAAAA ACAATTGCTG CAGCAAGAAA TTTAAAAAAA ATCTTATTAG ATTTTGAACA AAGGCCTTAT ATTTCATCAT TTACAAAAAA TTTAATTGAC CATCAGAATA TCGAAACGAT TTTTAAAAAA GGCATTGAAT CAAATGGAAG GATTTCAGAC AATGCTAGTA ATGAACTATC TATTCTTAGA AAAGAATTTT TATCTAAGAA ACTTGAAAGA AAAATATTAG TTGAGAAATT TATTCAAAAG AACTTAGCTT ATTTGCAAGA TACTACTATT GGAGATCGAT ATGGAAGGCC CGTTTTAGCA GTGAAAGTTA ATTATGTAGA TAAATTTAAA GGCATAATTC ATGACTCTTC ATCTTCAGGA AATACAGTAT ATTTTGAGCC TGATAGTGTA GTAAATAAAG GTAATAAGAT TGCTTCTTTA GAGGCTAGGA TCACAGCAGA AGAATTTAAA TTACTTAAAA AATGGTCTCT GGTTGTTAGT GATAATTCAG AAAATCTTAT TGAAATGGCA GCCATTTTAT TGAGATTAGA AAATGCCCTA ACTCGTTCAA GATATTCGAA ATGGATTGGA GGCAAAACTC CTACGTTTGA GAAAAGTCCT ATTATTTCTT TAATTGGCTT TACTCATCCG TTATTGATTT GGGAACATAA GAAAAAAGGA GCCTGTCCAC CAGTAGCTGT TGATTTTTAT ATAAATAGAA ATATTAAGGT TGTAGCTATT ACAGGTCCAA ATACTGGAGG TAAAACAGCA GCTTTAAAAG GCTTGGGCTT GTCTTTACTT ATGGCTAGAG CAGGATTATT GATACCTTCA ACTAATAATC CTATTATCCC TTTTTGTCCA AATATATATG TGGATATAGG AGATAATCAA TCATTAGAAG AAAATTTATC TACCTTCAGT GGGCATATAT CCCGCATAAA AGAGATATTA GATTTACTTG ATCATAAGAA AGGATTATCA GTTGTTTTGC TAGATGAGAT TGGATCTGGT ACAGATCCTC TTGAAGGTAG TGCTCTAGCG ATGGCTTTAT TAAAAGAATT TGCAAATAAA TCTGATATCA CTTTTGCCAC TACACATTAT GGGGATATTA AGGCTTTAAA ATATAACGAT TCAAGATTTG AAAACGTATC AGTTGCCTTT GATGAGGATT CTTTGAAGCC AAAATATATA CTCAACTGGG GTATTCCTGG GAGAAGTAAT GCTTTGTCAA TTTCAAAGAG AATTGGTCTT GATGAAAGCA TACTCAATGA AGCTGCAAAT TATCTAAAGC CAAAAGAAGT TGACAATATT AACAGTATTA TTAAAGGACT TGAGGAAGAG AAGATTAAAC AACAAAATTC TGCAGAAGCT GCTGCAGAAT TGATTGCAAG GACTGAAATA TTACATGATG AACTGAAGAG AAATTATGAA TATCAAAAAT TAAATGCTGA AAAAATCCAG GAAATTGAAA GGTCTAAATT ATCAAAACAT ATTGTATCCG CTAAAAAAGA GGTGATAGAT TTGATTAAAA AATTAAGAGA TAAAAATGTT AATGGAGAGG ATACGAGAAT TATTGGAAAA AGATTAAAGG AAATTGAGAC GGAACATTTA ACCCAAAAAA AATCTGAAAA GTCAATATCA TGGAACCCTC AGGTAGGTGA TTTTGTAAAG ATTAAAAGTC TAAATAGTAC GGGACAAATT GTAGGTTTAG ATAAAAAAGG TGGTTTTTAT GAAATTAAAT GTGGTTCATT TAGAAGCACA TTATCTGTAA ATGAATTTGA AGGTATTAAT GGAGAAAAGC CTAATTTCAA AAGTTCAAAA ATTGAGATCA AGTCTACAAG AGAAGATTTT TCTTTTTCTA AAATTAGAAC GAGTAAAAAT ACAATTGATG TAAGAGGGTT AAGAGTTCAT GAAGCCGAAA TAATTATTGA GGAGAAAATT AGAAAATTTC ATGGACCGCT ATGGATTGTT CATGGAATTG GCACAGGAAA ATTAAAAAAA GGACTAAGAA ATTGGTTATC AGGTTTAAAT TATGTTGATA AGATTGAAGA TGCAGCCAAT AACGAGGGTG GCCCTGGTTG CAGTATTGCG TGGATAAAAT AA
|
Protein sequence | MQEKSYSKKS SSNNTLEEES ISLLEWDSLK THLSSFALTE MGKRAILSFD IPSEYESSKR LLNETVEITQ LENNLDKSIS FSGVFDISRN IEICSKGGVI SSSELLEIAK TIAAARNLKK ILLDFEQRPY ISSFTKNLID HQNIETIFKK GIESNGRISD NASNELSILR KEFLSKKLER KILVEKFIQK NLAYLQDTTI GDRYGRPVLA VKVNYVDKFK GIIHDSSSSG NTVYFEPDSV VNKGNKIASL EARITAEEFK LLKKWSLVVS DNSENLIEMA AILLRLENAL TRSRYSKWIG GKTPTFEKSP IISLIGFTHP LLIWEHKKKG ACPPVAVDFY INRNIKVVAI TGPNTGGKTA ALKGLGLSLL MARAGLLIPS TNNPIIPFCP NIYVDIGDNQ SLEENLSTFS GHISRIKEIL DLLDHKKGLS VVLLDEIGSG TDPLEGSALA MALLKEFANK SDITFATTHY GDIKALKYND SRFENVSVAF DEDSLKPKYI LNWGIPGRSN ALSISKRIGL DESILNEAAN YLKPKEVDNI NSIIKGLEEE KIKQQNSAEA AAELIARTEI LHDELKRNYE YQKLNAEKIQ EIERSKLSKH IVSAKKEVID LIKKLRDKNV NGEDTRIIGK RLKEIETEHL TQKKSEKSIS WNPQVGDFVK IKSLNSTGQI VGLDKKGGFY EIKCGSFRST LSVNEFEGIN GEKPNFKSSK IEIKSTREDF SFSKIRTSKN TIDVRGLRVH EAEIIIEEKI RKFHGPLWIV HGIGTGKLKK GLRNWLSGLN YVDKIEDAAN NEGGPGCSIA WIK
|
| |