Gene P9301_18351 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9301_18351 
Symbol 
ID4911157 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9301 
KingdomBacteria 
Replicon accessionNC_009091 
Strand
Start bp1555676 
End bp1558417 
Gene Length2742 bp 
Protein Length913 aa 
Translation table11 
GC content29% 
IMG OID640161440 
ProductDNA mismatch repair protein MutS 
Protein accessionYP_001092059 
Protein GI126697173 
COG category[L] Replication, recombination and repair 
COG ID[COG0249] Mismatch repair ATPase (MutS family) 
TIGRFAM ID[TIGR01070] DNA mismatch repair protein MutS 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAAGAAG ATTTGATAAT TCAAAAGAAT TTATTTGCGA TTGGTAATGA TAATAATAAG 
CAAAAGGAAA AAACAAAAAT TCCAGAAGAT TTATCTTTAG AAGATTTAAA AAAAGAATCG
CAAAAAAGAC CCAGACAAAG AAAAAATTCA ACTAATTTAA TAAATAAATT CAAGACTGAT
TTAATTTCAA ATAAGAAAAA TGTTTGCATC AATGAAGAAT CTTATAGCTA TAAAACAGTT
TCAAAACTGA AATTAACACC TGTAATGAAG CATTATGTAA CTCTAAAAGA AGAAAATAAA
GATAGGTTAT TACTTTATAG ATTAGGAGAT TTTTTTGAAT GTTTTTTTGA GGATGCTGTA
TTAATATCTA ATCTTTTAGA AATAACGCTT ACCAGTAAAG ATGCTGGCAA AGAGATTGGT
AAGATCCCTA TGGCAGGGGT TCCCCATCAT GCAATGGACA GATACTGTGC TGATTTAATT
AAAAAAAATT ATTCTGTGGT TATATGTGAT CAATTAGAAA AAAGTTCTGG AAATTATGGG
ACTCCAATTA AAAGAGGAAT AACACGAATA ATTACTCCTG GAACTGTAAT TGAAGAGGGG
ATGTTGATAG CAAAGAAAAA TAATTGGATT ACTGCTATTT ACTTATCAGA AGAAAACTCA
AATGAATCTT ATGAATGGGG TATATCAAAA GCTGATGTAA GCACAGGAGA ATTAATAACT
TTAGAAGGCC AATCTCTATC AAAACTATTT GATGAAATTA TTAAATTAGA TTCTTCAGAA
ATCATTGTAG GAAGCAATGC AGTAAGAAAT TTATTAATTA AAGGAAATAG TCAAATTACA
TATACTGTTT CTCAAGAGAC TAATTTTGGA ATTAATGAAG CAAATTATCT AATAAAAAAT
TATTTCCAAA TTGCAAACTT AGAGGGAATA GGACTTAAAA ATTTAAAAAA TGCAACTAGA
TCACTTGGAG GTTTATTAAA TTATTTAGAA AAAATTAATC CTTCAAATTT AGATAAAGAT
TCTTCTGTAA AAATCTCATT AGACTTTCCA CAAATCCAAT ATGGTCACAA CAAATTAATT
ATTGATTATC AAACTCAAAA AAACTTAGAA ATCAAAAATA CACAACGAGA AAACAATTAT
GTAGGTTCGC TACTATGGAG TATTGATAGA ACTTATACTT GCATGGGCGC AAGGTGTTTA
AGAAGGTGGA TAGATTCACC ACTATTAAAC GTTAATGAAA TTTATAAAAG ACAAAATATA
ATTACAAACT TTTTTGAATC TAAGAAATTA CGTACAGATA CCCAAAATTT ACTTAGAGCA
ATGGGGGATT TAGAAAGACT TGCAGGTAGA GCTTGTGCAG GTCATGCAAG TCCAAGAGAC
TTAATTGCAA TAGCTGAAGG TTTAAAAAAA TTGCCTAGAC TAAAATCCAT AATTGAATTA
TTTAAATATG ATCTCCCAAA TTGGACTGAT CAACTTATAA ATATTGATGA AGGACTCTTA
GAATTAGCTG ATACTATAAG TTTTAAACTC GTAGAAAATC CTCCTCTAAG TATTAGTGAA
GGAGGCATGA TCCACGATGG AGTTGACAAT ATATTAGATG GTTTACGCAA TTTAATGGAT
GATTACTCAG AGTGGCTAAA TAAAGAGGAA TTAAAGGAAA GGAAAATTAG CAAAATTTCA
AACCTAAAAA TTCAATTTCA TAAAAATTTT GGTTATTACA TTTCTATAAA TAAGTCAAAA
GTTAATTTAG CTCCACAACA TTGGATCAAA AGGCAAACAC TTACTAATGA AGAAAGGTAT
ATCACTTCAG AAATTAAAAA TAAAGAAAAT AAGATTTTCC AAATAAAAAG TAGAGCTTCA
TCAAAAGAAT ATGAAATTTT CTGCGAATTA AGAAATATAG TTGCTGAAAA AACAAAACAA
ATAAGATCAA TCGCAAAATC CATAGCATCT CTTGATGCAT TGCTTGGTTT ATCAATTACT
TCAATAGAAA ACAATTTTAT AAAACCTTTA TTAATACCAA TAAATGATTC AATGACAAAA
AATAGTACAA AAATTATCGC AGGAAGAAAT CCAATTGTAG AGCAATTGTT AAGTGATAAA
AAGTTTGTAG CAAACGATAT TTCTTTTGAG GATAATCAAA AATTAATTAT ATTAACCGGT
CCCAATGCAA GCGGAAAAAG TTGCTTTATA AGACAACTTG GTTTAATACA AATTCTCGCA
CAAATTGGTA GCTTTGTTCC TGCTAATAAT GCTGAAATCA AGATTGCAGA TAGGATTTTC
ACAAGAATTG GGGCAGTTGA TGATCAATCA TCTGGGCAAT CAACATTTAT GGTAGAAATG
TCTGAAACTG CATCAATTCT AAATCAAGCA ACTTCTAACT CACTAGTTTT ACTTGATGAG
ATAGGCAGAG GGACATCTAC TTTTGATGGA CTTTCAATAG CTTGGTCAGT AAGTGAATAT
CTTGCAAAAA AAATTCAATG TAATACTATT TTTGCTACGC ACTATCATGA GCTTAATTAT
TTAAAAAATT CAAATAAGAA TATACAAAAT TTTCAAGTTT TAGTAGAACA AAATGACGAT
CAGCTAATTT TTAGCCACAG AATTGTTAGA GGGGGCTCAA ACAAAAGCTA TGGCATAGAA
GCAGCTAAAT TAGCAGGAGT TCCAAAAGAA GTTATAGAAA AAGCAAAATC AGTTTTAAAT
TCTTTAGAAG AAAATAACAA GTTAAATCAT AATATTAAGT AG
 
Protein sequence
MQEDLIIQKN LFAIGNDNNK QKEKTKIPED LSLEDLKKES QKRPRQRKNS TNLINKFKTD 
LISNKKNVCI NEESYSYKTV SKLKLTPVMK HYVTLKEENK DRLLLYRLGD FFECFFEDAV
LISNLLEITL TSKDAGKEIG KIPMAGVPHH AMDRYCADLI KKNYSVVICD QLEKSSGNYG
TPIKRGITRI ITPGTVIEEG MLIAKKNNWI TAIYLSEENS NESYEWGISK ADVSTGELIT
LEGQSLSKLF DEIIKLDSSE IIVGSNAVRN LLIKGNSQIT YTVSQETNFG INEANYLIKN
YFQIANLEGI GLKNLKNATR SLGGLLNYLE KINPSNLDKD SSVKISLDFP QIQYGHNKLI
IDYQTQKNLE IKNTQRENNY VGSLLWSIDR TYTCMGARCL RRWIDSPLLN VNEIYKRQNI
ITNFFESKKL RTDTQNLLRA MGDLERLAGR ACAGHASPRD LIAIAEGLKK LPRLKSIIEL
FKYDLPNWTD QLINIDEGLL ELADTISFKL VENPPLSISE GGMIHDGVDN ILDGLRNLMD
DYSEWLNKEE LKERKISKIS NLKIQFHKNF GYYISINKSK VNLAPQHWIK RQTLTNEERY
ITSEIKNKEN KIFQIKSRAS SKEYEIFCEL RNIVAEKTKQ IRSIAKSIAS LDALLGLSIT
SIENNFIKPL LIPINDSMTK NSTKIIAGRN PIVEQLLSDK KFVANDISFE DNQKLIILTG
PNASGKSCFI RQLGLIQILA QIGSFVPANN AEIKIADRIF TRIGAVDDQS SGQSTFMVEM
SETASILNQA TSNSLVLLDE IGRGTSTFDG LSIAWSVSEY LAKKIQCNTI FATHYHELNY
LKNSNKNIQN FQVLVEQNDD QLIFSHRIVR GGSNKSYGIE AAKLAGVPKE VIEKAKSVLN
SLEENNKLNH NIK