Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9211_02381 |
Symbol | |
ID | 5731613 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9211 |
Kingdom | Bacteria |
Replicon accession | NC_009976 |
Strand | + |
Start bp | 228963 |
End bp | 231380 |
Gene Length | 2418 bp |
Protein Length | 805 aa |
Translation table | 11 |
GC content | 42% |
IMG OID | 641284582 |
Product | DNA mismatch repair protein MutS family protein |
Protein accession | YP_001550123 |
Protein GI | 159902779 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1193] Mismatch repair ATPase (MutS family) |
TIGRFAM ID | [TIGR01069] MutS2 family protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 41 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGATGATT TGTTAAACAA TTCCGTATTG CGAAGCTCAG CCACCGCTTT CGAAGAAACA TTGGAGTTAT TGGATTGGCG AATACTGTGT AATCATCTCT CAACCTTTGC ACCGACTGCG AAAGGTAAGC GTGAGTGCAA GAATATTGAG ATTCCACAAG ATATTGAAAC AACAAGAAAA AGATTATCTG AAACTCTAGA AATCGGTACT TTAGATAAAA ATCTTGAAGC GGGCATAAGT TTTCAAGGCG TCAATGAATT GGATGGTGTG ATTTTGCATT GTTCAAAGGG TGGAATTGCA TCCGGGGAGG AATTGTTAAG CATTGCTGAG ACCCTTAGAG CGGTGAGACG CCTAAAAAAA ATTTTCGAGG ATCCAGTATC TCGCCCTTAT ACAACCTCTT TATTCATTGA TCTTGCCACT CACCATGAAC TTGAAAAAGT TCTTTTGTTT GGGATAGAAG AAGGAGGTAG AGTCGCAGAT AGAGCAAGCA ATCAACTCTC TCAGCTTCGT CGTCATTTGC AGGATTTACG GATAGGAAGA AGGAGTATTC TGCAGGATCT GATCAGAAGG AATGGTTCAA TCCTTCAAGA TACTGTTATC GCAGAAAGAT ATGGGAGACC TGTAATAGCA ATGAAAGTCG GTTCCGTAGA TCAAGTTCCT GGTGTTGTTC ATGACAGCTC ATCTTCTGGG AATACAATTT TTCTGGAACC ACAAATAGTT ATCTCTCTTG GCAATCAAAT CGTTGAAATT CAAACTAAGA TTTCCAAAGA GGAAGAACGT CTCCTTTCCA TATGGAGTCA ATTAGTCGCT AAGAATATTA ATTCATTGAA TCATCTGTCT AGTGTGCTTT TGCAATTGGA ACTTGGCTTG GCACGAGCTC GGTATGGCGA TTGGCTAGGA GGTGTGCTCC CTGTAATAAC AACTAAAGAG GACGATCCTT TCCTGATTAA AGATTTCTCC CATCCATTAT TGCTTTGGAA AAACAAAAAA CTTGGTGGTC ACAAAGTCAT TCCAATAACT TTCGATGTCT CCAAAGGACT AAAGGTAGTA GCTATTACTG GTCCCAATAC AGGCGGGAAA ACAATTGCTC TGAAAAGCTT TGGCTTGGCA GTTTTAATGG CAAGGTGTGG CATGCTTTTG CCATGTTCCT CGGAACCTAC TTTACCTTGG TGCAATCAGG TCTTAGCCGA TATAGGAGAT GAACAATCTC TTGAGCAAAA CCTTTCAACA TTTAGTGGAC ATATTGCTCG CATAGTTCGA ATACTTGATG TAATTGCTCA ATCTCCTGGA CCAACGGTAG TTCTATTGGA TGAGGTTGGT GCAGGCACTG ATCCCACTGA AGGGAGTGCT ATAGCTATTT CTCTATTACG AGCATTGGCC GATAGCGCAA GACTAACAAT CGCAACAACT CACCTTGGAG AATTGAAAGC ATTGAAATAT AGCGATTCTC GCTTTGAGAA TGCTTCGGTC GCCTTCGATA GTGAAACTAT TCGGCCTACC TATCATTTGT TATGGGGAAT TCCTGGCAGG AGCAATGCTG TTGCGATTGC TATTCGATTA GGGCTTGATT CTCAAATCAC AGAAACAGCA AAAAAATTAA TTGGACCCAA AGGATTGCAA GATGTTAATC AAGTGATCCT TGGACTCGAA GAACAGAGAG AGAGACAGCA GAAAGCTGCA GAAGACGCGG CAGCCCTTTT GGCTAGAACC GAATTGCTTT ATGAAGAATT GCTTGCTAGA TGGGAACAGC AACAAGAAAC CAATAGAAAA TGGCAGGAAG TTGGTAGATA TAAATTAGGT ACGTCTATAC GTGAAGGTCA AAGAGAAGTT AGGAATTTAA TCCGTCGTTT ACGTGCAGAA GGAGCAGATG GAGATATAGC AAGAAAAGCA GGGCAACGTT TAAAGCAGAT TGAATTTGAC TCGCGTCCAC AAGTTTCAAG AAGGAATGAT TTTAATTGGA GACCAAAAAT CGGAGATCGT GTGAGACTTA TAGCTCTTGG CAAGTCTGGA GAGATAATCT CCATTTCTGA AGATGGTTGC CATTTAACTG TTCTTTGTGG GATTTTTCGT AGCACTGTTG ATTTGTCTAG TATAGAAAGT CTTGATGGTC GCAAACCAAG TATTCCTAAG TCGTCGGTGA AAGTGACGAC TCCTCGAACT ATGGGAAGCT TTTCAACAGT GCGGACCGAT CGAAATACTT TGGATGTAAG AGGTTTAAGA GTTCATGAAG CTGAAGCAGT AGTTGAAGAA AGTTTGCGCA ATGCTATTGG GAAGGTCTGG GTAATTCATG GCATTGGAAC TGGAAAATTA AAAAGGGGCT TACGTCAATG GCTTGAGACT CTCCCCTATG TGGAGCGAGT CGTAGATGCA GAGCAAAATG ATGGCGGATC AGGTTGCAGT GTGATTTGGT TGCGATAG
|
Protein sequence | MDDLLNNSVL RSSATAFEET LELLDWRILC NHLSTFAPTA KGKRECKNIE IPQDIETTRK RLSETLEIGT LDKNLEAGIS FQGVNELDGV ILHCSKGGIA SGEELLSIAE TLRAVRRLKK IFEDPVSRPY TTSLFIDLAT HHELEKVLLF GIEEGGRVAD RASNQLSQLR RHLQDLRIGR RSILQDLIRR NGSILQDTVI AERYGRPVIA MKVGSVDQVP GVVHDSSSSG NTIFLEPQIV ISLGNQIVEI QTKISKEEER LLSIWSQLVA KNINSLNHLS SVLLQLELGL ARARYGDWLG GVLPVITTKE DDPFLIKDFS HPLLLWKNKK LGGHKVIPIT FDVSKGLKVV AITGPNTGGK TIALKSFGLA VLMARCGMLL PCSSEPTLPW CNQVLADIGD EQSLEQNLST FSGHIARIVR ILDVIAQSPG PTVVLLDEVG AGTDPTEGSA IAISLLRALA DSARLTIATT HLGELKALKY SDSRFENASV AFDSETIRPT YHLLWGIPGR SNAVAIAIRL GLDSQITETA KKLIGPKGLQ DVNQVILGLE EQRERQQKAA EDAAALLART ELLYEELLAR WEQQQETNRK WQEVGRYKLG TSIREGQREV RNLIRRLRAE GADGDIARKA GQRLKQIEFD SRPQVSRRND FNWRPKIGDR VRLIALGKSG EIISISEDGC HLTVLCGIFR STVDLSSIES LDGRKPSIPK SSVKVTTPRT MGSFSTVRTD RNTLDVRGLR VHEAEAVVEE SLRNAIGKVW VIHGIGTGKL KRGLRQWLET LPYVERVVDA EQNDGGSGCS VIWLR
|
| |