Gene P9211_02381 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9211_02381 
Symbol 
ID5731613 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9211 
KingdomBacteria 
Replicon accessionNC_009976 
Strand
Start bp228963 
End bp231380 
Gene Length2418 bp 
Protein Length805 aa 
Translation table11 
GC content42% 
IMG OID641284582 
ProductDNA mismatch repair protein MutS family protein 
Protein accessionYP_001550123 
Protein GI159902779 
COG category[L] Replication, recombination and repair 
COG ID[COG1193] Mismatch repair ATPase (MutS family) 
TIGRFAM ID[TIGR01069] MutS2 family protein 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones41 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATGATT TGTTAAACAA TTCCGTATTG CGAAGCTCAG CCACCGCTTT CGAAGAAACA 
TTGGAGTTAT TGGATTGGCG AATACTGTGT AATCATCTCT CAACCTTTGC ACCGACTGCG
AAAGGTAAGC GTGAGTGCAA GAATATTGAG ATTCCACAAG ATATTGAAAC AACAAGAAAA
AGATTATCTG AAACTCTAGA AATCGGTACT TTAGATAAAA ATCTTGAAGC GGGCATAAGT
TTTCAAGGCG TCAATGAATT GGATGGTGTG ATTTTGCATT GTTCAAAGGG TGGAATTGCA
TCCGGGGAGG AATTGTTAAG CATTGCTGAG ACCCTTAGAG CGGTGAGACG CCTAAAAAAA
ATTTTCGAGG ATCCAGTATC TCGCCCTTAT ACAACCTCTT TATTCATTGA TCTTGCCACT
CACCATGAAC TTGAAAAAGT TCTTTTGTTT GGGATAGAAG AAGGAGGTAG AGTCGCAGAT
AGAGCAAGCA ATCAACTCTC TCAGCTTCGT CGTCATTTGC AGGATTTACG GATAGGAAGA
AGGAGTATTC TGCAGGATCT GATCAGAAGG AATGGTTCAA TCCTTCAAGA TACTGTTATC
GCAGAAAGAT ATGGGAGACC TGTAATAGCA ATGAAAGTCG GTTCCGTAGA TCAAGTTCCT
GGTGTTGTTC ATGACAGCTC ATCTTCTGGG AATACAATTT TTCTGGAACC ACAAATAGTT
ATCTCTCTTG GCAATCAAAT CGTTGAAATT CAAACTAAGA TTTCCAAAGA GGAAGAACGT
CTCCTTTCCA TATGGAGTCA ATTAGTCGCT AAGAATATTA ATTCATTGAA TCATCTGTCT
AGTGTGCTTT TGCAATTGGA ACTTGGCTTG GCACGAGCTC GGTATGGCGA TTGGCTAGGA
GGTGTGCTCC CTGTAATAAC AACTAAAGAG GACGATCCTT TCCTGATTAA AGATTTCTCC
CATCCATTAT TGCTTTGGAA AAACAAAAAA CTTGGTGGTC ACAAAGTCAT TCCAATAACT
TTCGATGTCT CCAAAGGACT AAAGGTAGTA GCTATTACTG GTCCCAATAC AGGCGGGAAA
ACAATTGCTC TGAAAAGCTT TGGCTTGGCA GTTTTAATGG CAAGGTGTGG CATGCTTTTG
CCATGTTCCT CGGAACCTAC TTTACCTTGG TGCAATCAGG TCTTAGCCGA TATAGGAGAT
GAACAATCTC TTGAGCAAAA CCTTTCAACA TTTAGTGGAC ATATTGCTCG CATAGTTCGA
ATACTTGATG TAATTGCTCA ATCTCCTGGA CCAACGGTAG TTCTATTGGA TGAGGTTGGT
GCAGGCACTG ATCCCACTGA AGGGAGTGCT ATAGCTATTT CTCTATTACG AGCATTGGCC
GATAGCGCAA GACTAACAAT CGCAACAACT CACCTTGGAG AATTGAAAGC ATTGAAATAT
AGCGATTCTC GCTTTGAGAA TGCTTCGGTC GCCTTCGATA GTGAAACTAT TCGGCCTACC
TATCATTTGT TATGGGGAAT TCCTGGCAGG AGCAATGCTG TTGCGATTGC TATTCGATTA
GGGCTTGATT CTCAAATCAC AGAAACAGCA AAAAAATTAA TTGGACCCAA AGGATTGCAA
GATGTTAATC AAGTGATCCT TGGACTCGAA GAACAGAGAG AGAGACAGCA GAAAGCTGCA
GAAGACGCGG CAGCCCTTTT GGCTAGAACC GAATTGCTTT ATGAAGAATT GCTTGCTAGA
TGGGAACAGC AACAAGAAAC CAATAGAAAA TGGCAGGAAG TTGGTAGATA TAAATTAGGT
ACGTCTATAC GTGAAGGTCA AAGAGAAGTT AGGAATTTAA TCCGTCGTTT ACGTGCAGAA
GGAGCAGATG GAGATATAGC AAGAAAAGCA GGGCAACGTT TAAAGCAGAT TGAATTTGAC
TCGCGTCCAC AAGTTTCAAG AAGGAATGAT TTTAATTGGA GACCAAAAAT CGGAGATCGT
GTGAGACTTA TAGCTCTTGG CAAGTCTGGA GAGATAATCT CCATTTCTGA AGATGGTTGC
CATTTAACTG TTCTTTGTGG GATTTTTCGT AGCACTGTTG ATTTGTCTAG TATAGAAAGT
CTTGATGGTC GCAAACCAAG TATTCCTAAG TCGTCGGTGA AAGTGACGAC TCCTCGAACT
ATGGGAAGCT TTTCAACAGT GCGGACCGAT CGAAATACTT TGGATGTAAG AGGTTTAAGA
GTTCATGAAG CTGAAGCAGT AGTTGAAGAA AGTTTGCGCA ATGCTATTGG GAAGGTCTGG
GTAATTCATG GCATTGGAAC TGGAAAATTA AAAAGGGGCT TACGTCAATG GCTTGAGACT
CTCCCCTATG TGGAGCGAGT CGTAGATGCA GAGCAAAATG ATGGCGGATC AGGTTGCAGT
GTGATTTGGT TGCGATAG
 
Protein sequence
MDDLLNNSVL RSSATAFEET LELLDWRILC NHLSTFAPTA KGKRECKNIE IPQDIETTRK 
RLSETLEIGT LDKNLEAGIS FQGVNELDGV ILHCSKGGIA SGEELLSIAE TLRAVRRLKK
IFEDPVSRPY TTSLFIDLAT HHELEKVLLF GIEEGGRVAD RASNQLSQLR RHLQDLRIGR
RSILQDLIRR NGSILQDTVI AERYGRPVIA MKVGSVDQVP GVVHDSSSSG NTIFLEPQIV
ISLGNQIVEI QTKISKEEER LLSIWSQLVA KNINSLNHLS SVLLQLELGL ARARYGDWLG
GVLPVITTKE DDPFLIKDFS HPLLLWKNKK LGGHKVIPIT FDVSKGLKVV AITGPNTGGK
TIALKSFGLA VLMARCGMLL PCSSEPTLPW CNQVLADIGD EQSLEQNLST FSGHIARIVR
ILDVIAQSPG PTVVLLDEVG AGTDPTEGSA IAISLLRALA DSARLTIATT HLGELKALKY
SDSRFENASV AFDSETIRPT YHLLWGIPGR SNAVAIAIRL GLDSQITETA KKLIGPKGLQ
DVNQVILGLE EQRERQQKAA EDAAALLART ELLYEELLAR WEQQQETNRK WQEVGRYKLG
TSIREGQREV RNLIRRLRAE GADGDIARKA GQRLKQIEFD SRPQVSRRND FNWRPKIGDR
VRLIALGKSG EIISISEDGC HLTVLCGIFR STVDLSSIES LDGRKPSIPK SSVKVTTPRT
MGSFSTVRTD RNTLDVRGLR VHEAEAVVEE SLRNAIGKVW VIHGIGTGKL KRGLRQWLET
LPYVERVVDA EQNDGGSGCS VIWLR