Gene P9301_02381 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9301_02381 
Symbol 
ID4911573 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9301 
KingdomBacteria 
Replicon accessionNC_009091 
Strand
Start bp221362 
End bp223773 
Gene Length2412 bp 
Protein Length803 aa 
Translation table11 
GC content31% 
IMG OID640159804 
ProductDNA mismatch repair protein MutS family protein 
Protein accessionYP_001090462 
Protein GI126695576 
COG category[L] Replication, recombination and repair 
COG ID[COG1193] Mismatch repair ATPase (MutS family) 
TIGRFAM ID[TIGR01069] MutS2 family protein 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAAGAGA AAAGTTATTC AAAAAAATCA TCTTCAAATA ACACCTTAGA AGAAGAATCT 
ATAAGCCTTT TAGAGTGGGA CTCATTAAAA ACGCATTTAT CTTCATTCGC CCTAACGGAA
ATGGGTAAAA GAGCAATTTT AAGTTTTGAT ATCCCTTCAG AATACGAATC ATCTAAAAGA
CTTTTGAATG AAACTGTTGA AATAACTCAG CTAGAAAATA ATTTAGATAA ATCAATTAGT
TTTTCTGGTG TTTTTGATAT TAGTAGAAAT ATTGAAATAT GTTCTAAGGG AGGTGTAATT
TCATCTTCTG AATTGTTAGA AATAGCAAAA ACAATTGCTG CAGCAAGAAA TTTAAAAAAA
ATCTTATTAG ATTTTGAACA AAGGCCTTAT ATTTCATCAT TTACAAAAAA TTTAATTGAC
CATCAGAATA TCGAAACGAT TTTTAAAAAA GGCATTGAAT CAAATGGAAG GATTTCAGAC
AATGCTAGTA ATGAACTATC TATTCTTAGA AAAGAATTTT TATCTAAGAA ACTTGAAAGA
AAAATATTAG TTGAGAAATT TATTCAAAAG AACTTAGCTT ATTTGCAAGA TACTACTATT
GGAGATCGAT ATGGAAGGCC CGTTTTAGCA GTGAAAGTTA ATTATGTAGA TAAATTTAAA
GGCATAATTC ATGACTCTTC ATCTTCAGGA AATACAGTAT ATTTTGAGCC TGATAGTGTA
GTAAATAAAG GTAATAAGAT TGCTTCTTTA GAGGCTAGGA TCACAGCAGA AGAATTTAAA
TTACTTAAAA AATGGTCTCT GGTTGTTAGT GATAATTCAG AAAATCTTAT TGAAATGGCA
GCCATTTTAT TGAGATTAGA AAATGCCCTA ACTCGTTCAA GATATTCGAA ATGGATTGGA
GGCAAAACTC CTACGTTTGA GAAAAGTCCT ATTATTTCTT TAATTGGCTT TACTCATCCG
TTATTGATTT GGGAACATAA GAAAAAAGGA GCCTGTCCAC CAGTAGCTGT TGATTTTTAT
ATAAATAGAA ATATTAAGGT TGTAGCTATT ACAGGTCCAA ATACTGGAGG TAAAACAGCA
GCTTTAAAAG GCTTGGGCTT GTCTTTACTT ATGGCTAGAG CAGGATTATT GATACCTTCA
ACTAATAATC CTATTATCCC TTTTTGTCCA AATATATATG TGGATATAGG AGATAATCAA
TCATTAGAAG AAAATTTATC TACCTTCAGT GGGCATATAT CCCGCATAAA AGAGATATTA
GATTTACTTG ATCATAAGAA AGGATTATCA GTTGTTTTGC TAGATGAGAT TGGATCTGGT
ACAGATCCTC TTGAAGGTAG TGCTCTAGCG ATGGCTTTAT TAAAAGAATT TGCAAATAAA
TCTGATATCA CTTTTGCCAC TACACATTAT GGGGATATTA AGGCTTTAAA ATATAACGAT
TCAAGATTTG AAAACGTATC AGTTGCCTTT GATGAGGATT CTTTGAAGCC AAAATATATA
CTCAACTGGG GTATTCCTGG GAGAAGTAAT GCTTTGTCAA TTTCAAAGAG AATTGGTCTT
GATGAAAGCA TACTCAATGA AGCTGCAAAT TATCTAAAGC CAAAAGAAGT TGACAATATT
AACAGTATTA TTAAAGGACT TGAGGAAGAG AAGATTAAAC AACAAAATTC TGCAGAAGCT
GCTGCAGAAT TGATTGCAAG GACTGAAATA TTACATGATG AACTGAAGAG AAATTATGAA
TATCAAAAAT TAAATGCTGA AAAAATCCAG GAAATTGAAA GGTCTAAATT ATCAAAACAT
ATTGTATCCG CTAAAAAAGA GGTGATAGAT TTGATTAAAA AATTAAGAGA TAAAAATGTT
AATGGAGAGG ATACGAGAAT TATTGGAAAA AGATTAAAGG AAATTGAGAC GGAACATTTA
ACCCAAAAAA AATCTGAAAA GTCAATATCA TGGAACCCTC AGGTAGGTGA TTTTGTAAAG
ATTAAAAGTC TAAATAGTAC GGGACAAATT GTAGGTTTAG ATAAAAAAGG TGGTTTTTAT
GAAATTAAAT GTGGTTCATT TAGAAGCACA TTATCTGTAA ATGAATTTGA AGGTATTAAT
GGAGAAAAGC CTAATTTCAA AAGTTCAAAA ATTGAGATCA AGTCTACAAG AGAAGATTTT
TCTTTTTCTA AAATTAGAAC GAGTAAAAAT ACAATTGATG TAAGAGGGTT AAGAGTTCAT
GAAGCCGAAA TAATTATTGA GGAGAAAATT AGAAAATTTC ATGGACCGCT ATGGATTGTT
CATGGAATTG GCACAGGAAA ATTAAAAAAA GGACTAAGAA ATTGGTTATC AGGTTTAAAT
TATGTTGATA AGATTGAAGA TGCAGCCAAT AACGAGGGTG GCCCTGGTTG CAGTATTGCG
TGGATAAAAT AA
 
Protein sequence
MQEKSYSKKS SSNNTLEEES ISLLEWDSLK THLSSFALTE MGKRAILSFD IPSEYESSKR 
LLNETVEITQ LENNLDKSIS FSGVFDISRN IEICSKGGVI SSSELLEIAK TIAAARNLKK
ILLDFEQRPY ISSFTKNLID HQNIETIFKK GIESNGRISD NASNELSILR KEFLSKKLER
KILVEKFIQK NLAYLQDTTI GDRYGRPVLA VKVNYVDKFK GIIHDSSSSG NTVYFEPDSV
VNKGNKIASL EARITAEEFK LLKKWSLVVS DNSENLIEMA AILLRLENAL TRSRYSKWIG
GKTPTFEKSP IISLIGFTHP LLIWEHKKKG ACPPVAVDFY INRNIKVVAI TGPNTGGKTA
ALKGLGLSLL MARAGLLIPS TNNPIIPFCP NIYVDIGDNQ SLEENLSTFS GHISRIKEIL
DLLDHKKGLS VVLLDEIGSG TDPLEGSALA MALLKEFANK SDITFATTHY GDIKALKYND
SRFENVSVAF DEDSLKPKYI LNWGIPGRSN ALSISKRIGL DESILNEAAN YLKPKEVDNI
NSIIKGLEEE KIKQQNSAEA AAELIARTEI LHDELKRNYE YQKLNAEKIQ EIERSKLSKH
IVSAKKEVID LIKKLRDKNV NGEDTRIIGK RLKEIETEHL TQKKSEKSIS WNPQVGDFVK
IKSLNSTGQI VGLDKKGGFY EIKCGSFRST LSVNEFEGIN GEKPNFKSSK IEIKSTREDF
SFSKIRTSKN TIDVRGLRVH EAEIIIEEKI RKFHGPLWIV HGIGTGKLKK GLRNWLSGLN
YVDKIEDAAN NEGGPGCSIA WIK