Gene P9515_18351 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9515_18351 
Symbol 
ID4720413 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9515 
KingdomBacteria 
Replicon accessionNC_008817 
Strand
Start bp1617910 
End bp1620654 
Gene Length2745 bp 
Protein Length914 aa 
Translation table11 
GC content30% 
IMG OID640081535 
ProductDNA mismatch repair protein MutS 
Protein accessionYP_001012149 
Protein GI123967068 
COG category[L] Replication, recombination and repair 
COG ID[COG0249] Mismatch repair ATPase (MutS family) 
TIGRFAM ID[TIGR01070] DNA mismatch repair protein MutS 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAAGACG AAACTATAAT TCAGAAAGAA TTATTTGCAA TCAATAATGA ACTTAAACTT 
CCAAAGAGCT CGAGTAAAAT TTCTGATAAT TTATCGACTG AAGAATTAAA AAAAGAGTCA
CAGAGAAGGC CGAGACAAAG AAAAAATGCC ACCAATGTAA TTAACAAATT CAAGAATGAG
TTACATAATA AAAGCAGGGA TAATTGCATT AATGAAAAAT CTTTCAGTTA TAAAACAGTT
GAAAAAGAAA AATTAACTCC AATTCTCAAG CATTATGTAA AATTAAAAGA AGAAAACAGC
GATAGATTGT TGTTATATAG ATTAGGTGAC TTTTTTGAAT GTTTTTTTGA AGATGCTGTA
CTTATATCTA ACCTATTAGA AATAACATTA ACCAGCAAAG ATGCTGGGAA AGAAATTGGG
AAAATCCCTA TGGCGGGGGT CCCTCATCAC GCAATGGAGA GATACTGCGC AGAGTTAATT
AAAAAAAATA ATTCGGTTGT TATTTGCGAT CAGTTAGAAA AAAGTACTGG AAATTATGGT
ACACCATTAA AAAGAGGAGT AACAAGAATA ATCACCCCTG GAACTGTAAT TGAAGAAGGT
ATGCTGGTTG CGAAAAAAAA TAATTGGATA ACAGCAATCC ACTTATCAAA AAATATTTCT
GAAAATCTAG ATGAATGGGG TATTTCAAGA GCAGATGTGA GTACAGGAGA GCTATTAACA
ATGGAAGGTA AATCACTTCC AAAGTTATTC GATGAACTTG TTAAGTTAGA TACATCAGAG
ATCATAATTG GAAGCAATGA AGAGAAAAGT TTATTAGAAG AACAAAATGA AAACATCACA
TATACCGTTA CGCAAGAGAC TTTTTTCAGT ATTAATGAAG CAAGTTCAAC TATAAAAAAC
TATTTCCAAA TTCTAAGTCT AGAAGGTCTA GGGCTCAAAA ATTTAAACAA TGCGACCAAA
GCACTTGGTG GCCTACTAAA TTATCTAGAG AAAATCAATC CTTCCAATTT AGAGAACGAT
TCTTCTCTAA GAATTTCTTT AGATTTCCCA CAAATACAAT TTCCAAAAGA TCATTTAATT
ATTGATTATC AAACTCAAAA AAATTTAGAA ATAAAAAATA CACAACGAGA AAACAATTAT
GCAGGTTCGC TTCTTTGGAG TATTGACAAA ACATATACCT GCATGGGCGC AAGATGTTTA
AGAAGATGGA TAGATTCTCC ATTATTAGAT ATTGATGAAA TTTGTAAAAG ACAAAATATT
ATTTCAAACT TTTTAGAGTC TAAAAAGTTA AGAATAGATA CCCAAAATAT ACTTAGAGCA
ATGGGCGATT TAGAGAGACT TTCGGGAAGG GCATGCGCTG GTCATGCAAG TCCAAGAGAT
TTGATAGCAA TATCTGAAGG CTTAAAAAAA TTACCTAAAT TAAAGTCAAT TGTTAATTTA
TTTAAGTATA AAATCCCATC TTGGACGGAT CAATTAAAGA ATGTTGATAA TGAACTTCTA
GAATTAGCAG ATCTAATAAG TTTCAAACTT GTAGGAAATC CCCCTTTAAA TACTAGTGAA
GGCGGAATTA TCCATGATGG AGTTGACAAT GTACTAGATG GCTTACGTAA TTTAATCGAC
GATTACTCAG ATTGGTTAAA TGAAGAAGAA TTAAAAGAAA GAAAAATTAG TAAAATCTCG
AATTTAAAGA TTCAATTTCA TAAAAACTTT GGTTATTACA TTTCAATAAA CAAATCAAAA
GTTAATTTAG CACCAGACCA TTGGATTAAA AGGCAAACTC TTACTAATGA GGAGAGATAT
GTAACTACAG AAATAAAAAA TAAAGAAAGT AAAATATTTC AAGTAAAAAA CCGGGCAGCG
GCAAAAGAGT ATGAATTATT TTGTGAGATA AGAAATCTTG TAGCTTCAAA AACAAAAAAA
ATTAGATCCA TCGCAAAATC TATAGCGTGT ATAGATGCAT TACTCGGTTT ATCCATTACA
TCATTAGAGA ATAATTTCAT AAAACCTACT CTTTTACCCA TTCAAAATTC AACGACCCAA
CAAAGTACAG AAATTATTAA AGGACGGAAT CCCATCGTTG AACAATTATT AACTAATAAA
GAATTTATAT CTAATGATAT TTTGTTTAAT AATAAGCAAA AATTAATAAT ATTAACTGGT
CCAAATGCTA GTGGAAAAAG CTGCTTTATT AGACAAATAG GTTTAATTCA AATTTTATCT
CAAATTGGTA GTTTCATCCC TGCAAGTAAA GCAAATATAC AAATTGCGGA TCGAATATTT
ACAAGAATTG GAGCTGTAGA TGATCAATCT TCCGGGCAGT CAACATTCAT GGTAGAAATG
TCAGAGACAG CATCAATACT AAACCAAGCG ACATCAAACT CTCTTGTTTT ACTTGATGAG
ATCGGCAGAG GGACATCTAC TTTTGACGGA TTATCCATAG CGTGGTCAGT GAGTGAATAT
CTTGCAAAAA AAATTGTATG TAATACCATT TTTGCTACTC ATTACCATGA ACTTAATTAT
CTTAAAAATA CAAATAAAAA TGTAGAAAAT TTTCAAGTAT TAGTTAAACA GAAAAAGGAT
CAACTATATT TTTGTCATAA AATAACAAAA GGTGGGGCAA ACAAAAGTTA CGGTATTGAA
GCTGCAAAGT TAGCTGGGGT TCCCAAAGAA GTTATCGATA AAGCTAAATT AGTTTTAGAT
TATTTAGAAA AAAATAATCA GTTAAATTCT CAAATACAAA TTTAA
 
Protein sequence
MQDETIIQKE LFAINNELKL PKSSSKISDN LSTEELKKES QRRPRQRKNA TNVINKFKNE 
LHNKSRDNCI NEKSFSYKTV EKEKLTPILK HYVKLKEENS DRLLLYRLGD FFECFFEDAV
LISNLLEITL TSKDAGKEIG KIPMAGVPHH AMERYCAELI KKNNSVVICD QLEKSTGNYG
TPLKRGVTRI ITPGTVIEEG MLVAKKNNWI TAIHLSKNIS ENLDEWGISR ADVSTGELLT
MEGKSLPKLF DELVKLDTSE IIIGSNEEKS LLEEQNENIT YTVTQETFFS INEASSTIKN
YFQILSLEGL GLKNLNNATK ALGGLLNYLE KINPSNLEND SSLRISLDFP QIQFPKDHLI
IDYQTQKNLE IKNTQRENNY AGSLLWSIDK TYTCMGARCL RRWIDSPLLD IDEICKRQNI
ISNFLESKKL RIDTQNILRA MGDLERLSGR ACAGHASPRD LIAISEGLKK LPKLKSIVNL
FKYKIPSWTD QLKNVDNELL ELADLISFKL VGNPPLNTSE GGIIHDGVDN VLDGLRNLID
DYSDWLNEEE LKERKISKIS NLKIQFHKNF GYYISINKSK VNLAPDHWIK RQTLTNEERY
VTTEIKNKES KIFQVKNRAA AKEYELFCEI RNLVASKTKK IRSIAKSIAC IDALLGLSIT
SLENNFIKPT LLPIQNSTTQ QSTEIIKGRN PIVEQLLTNK EFISNDILFN NKQKLIILTG
PNASGKSCFI RQIGLIQILS QIGSFIPASK ANIQIADRIF TRIGAVDDQS SGQSTFMVEM
SETASILNQA TSNSLVLLDE IGRGTSTFDG LSIAWSVSEY LAKKIVCNTI FATHYHELNY
LKNTNKNVEN FQVLVKQKKD QLYFCHKITK GGANKSYGIE AAKLAGVPKE VIDKAKLVLD
YLEKNNQLNS QIQI