Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | A9601_02371 |
Symbol | |
ID | 4716921 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. AS9601 |
Kingdom | Bacteria |
Replicon accession | NC_008816 |
Strand | + |
Start bp | 220393 |
End bp | 222804 |
Gene Length | 2412 bp |
Protein Length | 803 aa |
Translation table | 11 |
GC content | 32% |
IMG OID | 640077936 |
Product | DNA mismatch repair protein MutS family protein |
Protein accession | YP_001008632 |
Protein GI | 123967774 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1193] Mismatch repair ATPase (MutS family) |
TIGRFAM ID | [TIGR01069] MutS2 family protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAAGAGA AAAGTTATTC TAAAAAATCA TATTCAGATA ACACCTTAGA AGAAGAATCT ATAAACCTTT TAGAGTGGGA TTCATTAAAA ACGCATTTAT CTTCGTTCGC CTCAACGGAA ATGGGTAAAC AATCAATTTT AAGTTTTGTT ATACCTTCAG AATACGAGGC ATCTAAAAGA CTTTTGAATG AAACTGTTGA AATTAATGAG CTAGAAAAAA ATTTAGATAA ATCAATTAGT TTTTCTGGTG TTTTTGATAT TAGTAGAAAT ATAGAAATTT GTTCGAAGGG AGGTGTAATT ACATCTTCTG AGTTGTTAGA AATAGCGAAA ACAATTGCTG CAGCAAGAAA TTTAAAAAAA ATCTTATTAG ATTTTGAACA AAGACCTTAT ATTTCATCAT TCACAAAAAA TTTAATTGAC CATCAGAATA TCGAAACGAT TTTTAAAAAA GGCATTGAAT CGAATGGAAG GATTTCAGAC AATGCTAGTA ATGAGTTATC TATTCTTAGA AAAGAATTTT TATCTAAGAA ACTCGAAAGA AAAATATTAG TTGAGAAATT TATTCAAAAG AATTTAGCTT ATTTGCAAGA TACTACTATT GGAGATCGGT ATGGAAGGCC TGTTTTAGCA GTGAAAGTTA ATTATGTAGA TAAATTTAAG GGAATAATTC ATGACTCTTC ATCTTCAGGA AATACAGTAT ATTTCGAGCC TGAAAGTGTA GTAACTAAAG GTAATAAGAT TGCTTCTTTA GAGGCTAGGA TCACAGCAGA AGAATTTAAA TTACTTAAGA AATGGTCTCA TGTTGTTAGT GATAATTCAA AAAATCTTAT TGAAATGGCG TCCATTTTAT TAAGATTAGA AAATGCCCTA ACTCGTTCAA GATATTCGAA ATGGATTGGA GGTAAAACTC CTACATTTGA GAAAAATCCT ATTATTTCTT TAATTGGTTT TTCTCATCCG TTATTGATTT GGGAACATAA GAAAAAAGGA GCCCCTCCAC CAGTAGCTGT CGATTTTTAT ATAAATCGAA ATATTAAGGT TGTAGCTATT ACAGGCCCAA ATACTGGAGG TAAAACAGCA GCTTTAAAAG GTTTGGGCTT GTCTTTACTT ATGGCTAGAG CAGGATTATT GATACCTTCA ACTAATAATC CTATTATCCC TTTCTGTCCA AATATATATG TGGATATAGG AGATAATCAA TCATTAGAAG AAAATTTATC TACCTTCAGT GGGCATATAT CCCGCATAAA AGGGATATTA GATTCACTTG ATTATAAGAA AGGATTATCA GTTGTTTTGT TAGATGAGAT TGGATCTGGT ACAGATCCTC TTGAAGGAAG CGCTCTTGCG ATGGCTTTAT TAAAAGAATT TGCAAATAAA TCTGATATCA CTTTGGCAAC TACACATTAT GGAGATATTA AGGCTTTAAA ATATAACGAC TCGAGATTTG AAAACGTATC AGTTGCCTTT GATGAGGAAT CTTTGAAGCC AAAATATATA CTCAACTGGG GTATTCCTGG GAGAAGTAAT GCTTTGTCAA TTTCAAAGAG AATTGGTCTC GATGAAAGCA TACTCAATGA AGCTGCAAAT TATCTAAAGC CAAAAGAAGT TGACAACATT AACAGTATTA TTAAAGGACT TGAGGAAGAG AGGATTAAAC AACAAAATTC TGCAGAAGCT GCTGCAGAAT TGATTGCAAG GACTGAAATA CTACATGATG AACTGAAGAG AAATTATGAA CATCAAAAAA TAAATGCTCA AAAAATTCAA GAACTTGAAA GGTCTAAATT ATCAAAACAT ATCATATCCG CTAAAAAAGA GGTGATAGAT TTGATTAAAA AATTAAGAGA TAAAAATGTT AATGGAGAGG ATACGAGAAT TATTGGAAAA AGATTAAAGG AAATTGAGAC GGAACATTTA ACCCAAAAAA AATCTGAAAA GTCAATATCA TGGAACCCTC AGGTAGGCGA TTTTGTAAAG ATTAAAAGTC TAAATAGTAC GGGACAAATT GTAGATTTAG ATAAAAAAGG TGGTTTTTAC GAAGTTAAAT GTGGTTCATT CAGAAGCACA TTATCTGCAA ATGACTTTGA AGGTATTAAT GGAGAAAAGC CTAATTTCAA AATGTCAAAA ATTGAGATCA AGGCTACAAG AGAGGATTTT TCTTTTTCTA AAATTAGAAC AAGTAAAAAT ACAATTGACG TAAGAGGATT AAGAGTGCAT GAAGCCGAAA TAATTATTGA GGAGAAAATT AGAAGATTTC ATGGACCACT ATGGATTGTT CATGGAATTG GCACAGGAAA ATTAAAGAAA GGACTAAGAA ATTGGTTATC AGGTTTAAAT TATGTTGATA AGATTGAAGA TGCAGCCAAC AACGAGGGTG GCCCTGGTTG CAGTATTGCG TGGATAAAAT AA
|
Protein sequence | MQEKSYSKKS YSDNTLEEES INLLEWDSLK THLSSFASTE MGKQSILSFV IPSEYEASKR LLNETVEINE LEKNLDKSIS FSGVFDISRN IEICSKGGVI TSSELLEIAK TIAAARNLKK ILLDFEQRPY ISSFTKNLID HQNIETIFKK GIESNGRISD NASNELSILR KEFLSKKLER KILVEKFIQK NLAYLQDTTI GDRYGRPVLA VKVNYVDKFK GIIHDSSSSG NTVYFEPESV VTKGNKIASL EARITAEEFK LLKKWSHVVS DNSKNLIEMA SILLRLENAL TRSRYSKWIG GKTPTFEKNP IISLIGFSHP LLIWEHKKKG APPPVAVDFY INRNIKVVAI TGPNTGGKTA ALKGLGLSLL MARAGLLIPS TNNPIIPFCP NIYVDIGDNQ SLEENLSTFS GHISRIKGIL DSLDYKKGLS VVLLDEIGSG TDPLEGSALA MALLKEFANK SDITLATTHY GDIKALKYND SRFENVSVAF DEESLKPKYI LNWGIPGRSN ALSISKRIGL DESILNEAAN YLKPKEVDNI NSIIKGLEEE RIKQQNSAEA AAELIARTEI LHDELKRNYE HQKINAQKIQ ELERSKLSKH IISAKKEVID LIKKLRDKNV NGEDTRIIGK RLKEIETEHL TQKKSEKSIS WNPQVGDFVK IKSLNSTGQI VDLDKKGGFY EVKCGSFRST LSANDFEGIN GEKPNFKMSK IEIKATREDF SFSKIRTSKN TIDVRGLRVH EAEIIIEEKI RRFHGPLWIV HGIGTGKLKK GLRNWLSGLN YVDKIEDAAN NEGGPGCSIA WIK
|
| |