Gene A9601_18541 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagA9601_18541 
Symbol 
ID4718592 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. AS9601 
KingdomBacteria 
Replicon accessionNC_008816 
Strand
Start bp1583698 
End bp1586439 
Gene Length2742 bp 
Protein Length913 aa 
Translation table11 
GC content29% 
IMG OID640079588 
ProductDNA mismatch repair protein MutS 
Protein accessionYP_001010244 
Protein GI123969386 
COG category[L] Replication, recombination and repair 
COG ID[COG0249] Mismatch repair ATPase (MutS family) 
TIGRFAM ID[TIGR01070] DNA mismatch repair protein MutS 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAAGAAG ACACGATAAT TCAAAAGAAT TTATTTGCGA TTGGTAATGA TATTAATGAA 
CAAAAAAAAA TAACAAAAAT TCCAGAAAAT TTATCTTTGG AAGATTTAAA AAAAGAATCG
CAAAAAAGAC CCAGACAAAG AAAAAATTCA ACTAATTTAA TAAATAAATT CAAGACTGAT
TTAATTTCAA ATAACAAAAA TGTTTGCATC AATGAAGAAT CTTATAGCTA TAAAACAGTT
TCAAAACTGA AATTAACTCC TGTAATGAAG CATTATGTAA CTCTAAAAGA AGAAAATAAA
GATAGGTTAT TACTTTATAG ATTAGGAGAC TTTTTTGAAT GTTTTTTTGA GGACGCTGTA
TTAATATCTA ACCTTTTGGA AATAACGCTT ACCAGTAAAG ATGCTGGCAA AGAGATTGGT
AAGATCCCTA TGGCAGGGGT TCCCCATCAT GCAATGGAGA GATACTGTGC TGATTTAATT
AAAAAAAATT ATTCTGTGGT TATATGCGAT CAATTAGAAA AAAGTTCTGG AAATTATGGG
ACTCCAATTA AAAGAGGAAT AACAAGAATA ATTACTCCTG GAACTGTAAT TGAAGAGGGG
ATGTTGATAG CAAAGAAAAA TAATTGGATT AGTGCTATTT ACTTATCAGA AGAAAACTCA
GATGAATCTT ATGAATGGGG CATATCAAAA GCTGATGTAA GCACAGGAGA ATTAATAACT
TTAGAAGGCC AATCTCTGTC AAAACTATTT GATGAAATTA TTAAATTAGA TTCTTCAGAA
ATCATTGTAG GAAGCAATAC AGCAAGAGAT TTATTAATTA AAGGAAATAG TCAAATTACA
TATACTGTTT CTCAAGAGAC TAATTTTGGA ATTAATGAAG CAAATTATCT AATAAAAAAT
TATTTCCAAA TTGCAAACCT AGAGGGAATA GGACTTAAAA ATTTAAACAA TGCCACTAGA
TCACTTGGAG GTTTATTAAA TTATTTAGAA AAAATTAATC CTTCAAATTT AGATAAAGAT
TCTTCTTTAA AAATTTCATT AGACTTTCCA CAAATCCAAT ATGGTCACAA CAAATTAATT
ATTGATTATC AAACTCAAAA AAACTTAGAA ATCAAAAATA CGCAACGAGA AAACAATTAT
GTAGGTTCGC TACTATGGAG TATTGATAGG ACTTATACCT GCATGGGCGC AAGGTGTTTA
AGAAGGTGGA TAGATTCACC ACTATTAAAC GTTAATGAAA TTTATAAAAG GCAAAATATA
ATTACAAACT TTCTAGAATC TAAAAAATTA CGTATAGATA CCCAAAATTT ACTTAGAGCA
ATGGGGGATT TAGAAAGACT TGCAGGTAGA GCTTGTGCAG GTCATGCAAG TCCCAGAGAC
TTAATTGCAA TAGCAGAAGG TTTAAAAAAA TTGCCTAGAC TAAAATCCAT AATTGAATTA
TTTAAATATG ATCTCCCAGA TTGGACTGAT CAATTAAAAA ATATTGATGA AGGACTCTTA
GAATTAGCTG ATACTATAAG TTTTAAATTA GTAGAAAATC CTCCTCTAAA TATTAGTGAA
GGAGGCATGA TCCACGATGG TGTTGACAAT ATATTAGATG GTTTACGAAA TTTAATGGAT
GATTACTCTG AGTGGCTAAA TAAAGAGGAA TCAAAGGAAA GAAAAATTAG CAAAATTTCA
AACCTAAAAA TTCAATTTCA TAAAAATTTT GGTTATTACA TTTCTATTAA TAAGTCAAAA
GTTAATTTAG CTCCACAACA TTGGATCAAA AGGCAAACAC TTACTAATGA AGAAAGGTAT
ATCACTTCTG AAATTAAAAA TAAAGAAAAT AAGATTTTCC AAATAAAAAG TAGAGCTTCA
TCAAAAGAAT ATGAAATTTT CTGCGAATTA AGAAATATAG TTGCTGAAAA AACAAAACAA
ATAAGATCAA TCGCAAAATC CATAGCATCT CTTGATGCAC TACTTGGTTT ATCAATTACT
TCAGTAGAAA ACAATTTTAT AAAACCTTCA TTAATACCAA TAAATGATTC AATGACAAAA
AATAGTACAA AAATTATCGC AGGAAGAAAT CCAATTGTAG AGCAATTGTT AAGTGATAAA
AAGTTTGTAG CAAACGATAT CTCTTTTGAG GATAATCAAA AATTAATTAT ATTAACCGGT
CCCAATGCAA GCGGAAAAAG TTGCTTTATA AGACAACTTG GTTTAATACA AATTCTCACG
CAAATTGGTA GCTTTGTTCC TGCTAATAAT GCTGAAATCA AGATTGCAGA TAGGATTTTC
ACAAGAATTG GGGCAGTTGA TGATCAATCA TCTGGACAAT CAACATTTAT GGTAGAAATG
TCTGAAACTG CATCAATTCT AAATCAGGCA ACTTCTAGCT CACTAGTTTT ACTTGATGAG
ATTGGCAGAG GGACATCTAC TTTTGATGGA CTCTCAATAG CTTGGTCAGT AAGTGAATAT
CTTGCAAAAA AAATTCAATG TAATACTATT TTTGCTACGC ACTATCATGA GCTTAATTAT
TTAAAAAATA CAAATAAGAA TATACAAAAT TTTCAAGTTT TAGTAGAACA AAATAACGAT
CAGCTAATTT TTAGCCACAG GATTGTTAAA GGGGGCTCAA ACAAAAGCTA TGGCATAGAA
GCAGCTAAAT TAGCAGGAGT CCCAAAAGAA GTTATAGAAA AAGCAAAATC AGTTTTAAAT
TCTTTAGAAG AAAATAACAA ATTAAATTAT GATATAAAGT AG
 
Protein sequence
MQEDTIIQKN LFAIGNDINE QKKITKIPEN LSLEDLKKES QKRPRQRKNS TNLINKFKTD 
LISNNKNVCI NEESYSYKTV SKLKLTPVMK HYVTLKEENK DRLLLYRLGD FFECFFEDAV
LISNLLEITL TSKDAGKEIG KIPMAGVPHH AMERYCADLI KKNYSVVICD QLEKSSGNYG
TPIKRGITRI ITPGTVIEEG MLIAKKNNWI SAIYLSEENS DESYEWGISK ADVSTGELIT
LEGQSLSKLF DEIIKLDSSE IIVGSNTARD LLIKGNSQIT YTVSQETNFG INEANYLIKN
YFQIANLEGI GLKNLNNATR SLGGLLNYLE KINPSNLDKD SSLKISLDFP QIQYGHNKLI
IDYQTQKNLE IKNTQRENNY VGSLLWSIDR TYTCMGARCL RRWIDSPLLN VNEIYKRQNI
ITNFLESKKL RIDTQNLLRA MGDLERLAGR ACAGHASPRD LIAIAEGLKK LPRLKSIIEL
FKYDLPDWTD QLKNIDEGLL ELADTISFKL VENPPLNISE GGMIHDGVDN ILDGLRNLMD
DYSEWLNKEE SKERKISKIS NLKIQFHKNF GYYISINKSK VNLAPQHWIK RQTLTNEERY
ITSEIKNKEN KIFQIKSRAS SKEYEIFCEL RNIVAEKTKQ IRSIAKSIAS LDALLGLSIT
SVENNFIKPS LIPINDSMTK NSTKIIAGRN PIVEQLLSDK KFVANDISFE DNQKLIILTG
PNASGKSCFI RQLGLIQILT QIGSFVPANN AEIKIADRIF TRIGAVDDQS SGQSTFMVEM
SETASILNQA TSSSLVLLDE IGRGTSTFDG LSIAWSVSEY LAKKIQCNTI FATHYHELNY
LKNTNKNIQN FQVLVEQNND QLIFSHRIVK GGSNKSYGIE AAKLAGVPKE VIEKAKSVLN
SLEENNKLNY DIK