Gene P9515_02471 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9515_02471 
Symbol 
ID4720383 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9515 
KingdomBacteria 
Replicon accessionNC_008817 
Strand
Start bp228029 
End bp230440 
Gene Length2412 bp 
Protein Length803 aa 
Translation table11 
GC content31% 
IMG OID640079910 
ProductDNA mismatch repair protein MutS family protein 
Protein accessionYP_001010563 
Protein GI123965482 
COG category[L] Replication, recombination and repair 
COG ID[COG1193] Mismatch repair ATPase (MutS family) 
TIGRFAM ID[TIGR01069] MutS2 family protein 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.677193 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAGAGA AAAGTCATTA TAAAAATTTA TTATCGCACA AATCACTTTC AGAAGAATCT 
ATTGGTCTTT TAGAGTGGGA TACTTTGAAG ACTCAAATTG CCTCTTTTGC TTCTACAAAA
ATGGGCAAGA ATGTAGTACT TCAGTTTGAA ATTCCCACTG AATATGAAAT TTCGAGGAGA
TTATTGAAAG AAACTATAGA GATAAATGAG CTAGAAAAAA ACTTAGATAA ATCGATTAGT
TTTTCTGGGG TGTTTGATAT TTGTAAAAAT ATCGAGATTT GTTCAAAAGG AGGAGTAATT
AATTCTTCTG ATTTATTAGA GATAGCTGAA ACAATTTCAG CTTCAAGAAA TTTAAAAAAA
ATCCTTTTGG ATTTTGAACA AAGACCTTTT ATTTCTGCTT TTTTAAAGGG TTTAATTGAC
CATAATCAGA TTGAAAAAAT TTTGAAAAAT GGTATTGAAT CCAACGGTAG AATTTCGGAT
CGAGCCAGTC AGAAATTAGC AAACCTTAGA CAAGAATTAT TATCTAAGAA ATCAGAAAGG
AGGGTATTAG TAAATAAATT TATTCAAAAT AATCTACCTT ATATCCAAGA TACTATTGTT
GGCGATAGAT ACGGAAGACC CGTTTTAGCA ATAAAAGTGC AATATGCAGA GAAATTTAAA
GGTATAATTC ATGATTCCTC GGCATCAGGA AATACTATAT ATCTGGAGCC AGAATCTATA
GTTTTAAAGG GTAATAAGAT CGCTTCTATG GAAGCTAGAG TTGCAGGAGA AGAATTTAAA
TTATTAAAAG AGTGGTCACA TATTATTCGT GATAACAATC AAAGTCTTCT TGAAATGTCA
AATATACTTT TGAGAGCGGA GTTTTCCCTA ACTCGTTCAA GATATTCAAA TTGGATTGGA
GGCAATGCTC CAATAGTTGA AAATAGTCCA ATAGTTTCTT TAATGGGTTT TTCTCATCCT
CTTTTGATTT GGGAAAATAA AAAAAAACAA GCTGCCAAAC CAGTTTCTAT TGATTTTCAT
ATAGACAGAA ATACTAAGGT GGTAGCTATC ACAGGACCAA ATACTGGAGG TAAAACTGTT
GCATTAAAGG GCCTTGGAAT AGCATTATTG ATGGCTAGAT CAGGATTATT TATTCCTTCA
ATTCAAAAAC CAATAATTCC TTTTTGTCCA AATATATTTG TTGATATTGG GGATGATCAA
TCTTTAGAGG GAAATTTATC TACTTTTAGT GGACACATAT TGCGCATTAA AAATATTTTG
GAAGCTCTTA ATAATAAAAA GGGGTTTTCA GTTGTTTTAT TAGATGAGAT TGGATCTGGA
ACCGATCCTT CTGAAGGTAC TGCACTTGCA ATAGCTTTAC TGAAAGAGTT TGCAATTGTC
TCTGATATTA CCCTGGCTAC AACTCATTAT GGGGATATTA AAGCTTTGAA ATACAGTGAT
AATAGATTTG AAAACGTTTC AGTTGCTTTT GACGAAGAAT CATTCAAGCC TAAATATACT
CTTAATTGGG GAATACCGGG AAGAAGTAAT GCGTTATCAA TTTCAAAGAG AATTGGCATT
AATGAAAAAA TACTGAATAA TGCTTCCAAC TATTTAAAAC CGAAAGAAGT TGAAAATATA
AATAATATTA TTAAAGGATT AGAAGAGGAA AGGTTAAAGC AACAAAAATC GGCAGAAGAG
GCCGCTGAAC TTATAGCCAG AACAGAAATA TTACATGATG AAATAAAGAA TAAATATGAA
TTTCAAAAAC TCAATGCTTT AAAAATTCAG GAGGCTGAAA AGCAAAAACT ATCCAAACAT
ATCAAAGAAG CTCAAAAAGA AGTTATTAAT TTGATCAAAA AATTAAAAGA CCAAAATGCA
ACTGGAGAAG ATGCCAGATT AATTGGAATT AGGCTTAAAG AAATTGAGAC GGATCATCTT
ACTCAATCAA ATGTTGAAAG GACAACATCT TGGAGTCCCA AAATAGGAGA TTTTATTAAA
ATTAAAAGTT TAAATAGCTC TGGTCAAATA ATAGACATAG ATGAAAAAGC CAAGTCTTAC
GAGGTTAAAT GTGGATCATT TAGAAGCACA TTATCAATTA ATGATTTTGA AGGACTTAAT
GGCGAAAAAC CTAAGTTTAA AGATTCTCAA ATCCAAATTA GTTCAGTAAG AGAAGATTTT
TCTTTTTCTA AAATAAGAAC AAGTAAAAAC ACCATAGATG TTAGGGGAAT GAGAGTTCAT
GAAGCAGAAA TCATTATTGA AGAAAAATTT AAAAAGTTTC ATGGACCGCT CTGGATAGTT
CATGGTATCG GAACTGGGAA ATTAAAAAAA GGATTACGTT TATGGTTATC AAGTTTGAAT
TATGTTGATA AAGTTGAAGA TGCAGAAAAT AATGAGGGAG GAGCTGGTTG CAGTATTGCG
TGGATAAAAT AA
 
Protein sequence
MKEKSHYKNL LSHKSLSEES IGLLEWDTLK TQIASFASTK MGKNVVLQFE IPTEYEISRR 
LLKETIEINE LEKNLDKSIS FSGVFDICKN IEICSKGGVI NSSDLLEIAE TISASRNLKK
ILLDFEQRPF ISAFLKGLID HNQIEKILKN GIESNGRISD RASQKLANLR QELLSKKSER
RVLVNKFIQN NLPYIQDTIV GDRYGRPVLA IKVQYAEKFK GIIHDSSASG NTIYLEPESI
VLKGNKIASM EARVAGEEFK LLKEWSHIIR DNNQSLLEMS NILLRAEFSL TRSRYSNWIG
GNAPIVENSP IVSLMGFSHP LLIWENKKKQ AAKPVSIDFH IDRNTKVVAI TGPNTGGKTV
ALKGLGIALL MARSGLFIPS IQKPIIPFCP NIFVDIGDDQ SLEGNLSTFS GHILRIKNIL
EALNNKKGFS VVLLDEIGSG TDPSEGTALA IALLKEFAIV SDITLATTHY GDIKALKYSD
NRFENVSVAF DEESFKPKYT LNWGIPGRSN ALSISKRIGI NEKILNNASN YLKPKEVENI
NNIIKGLEEE RLKQQKSAEE AAELIARTEI LHDEIKNKYE FQKLNALKIQ EAEKQKLSKH
IKEAQKEVIN LIKKLKDQNA TGEDARLIGI RLKEIETDHL TQSNVERTTS WSPKIGDFIK
IKSLNSSGQI IDIDEKAKSY EVKCGSFRST LSINDFEGLN GEKPKFKDSQ IQISSVREDF
SFSKIRTSKN TIDVRGMRVH EAEIIIEEKF KKFHGPLWIV HGIGTGKLKK GLRLWLSSLN
YVDKVEDAEN NEGGAGCSIA WIK