Gene A9601_02371 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagA9601_02371 
Symbol 
ID4716921 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. AS9601 
KingdomBacteria 
Replicon accessionNC_008816 
Strand
Start bp220393 
End bp222804 
Gene Length2412 bp 
Protein Length803 aa 
Translation table11 
GC content32% 
IMG OID640077936 
ProductDNA mismatch repair protein MutS family protein 
Protein accessionYP_001008632 
Protein GI123967774 
COG category[L] Replication, recombination and repair 
COG ID[COG1193] Mismatch repair ATPase (MutS family) 
TIGRFAM ID[TIGR01069] MutS2 family protein 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAAGAGA AAAGTTATTC TAAAAAATCA TATTCAGATA ACACCTTAGA AGAAGAATCT 
ATAAACCTTT TAGAGTGGGA TTCATTAAAA ACGCATTTAT CTTCGTTCGC CTCAACGGAA
ATGGGTAAAC AATCAATTTT AAGTTTTGTT ATACCTTCAG AATACGAGGC ATCTAAAAGA
CTTTTGAATG AAACTGTTGA AATTAATGAG CTAGAAAAAA ATTTAGATAA ATCAATTAGT
TTTTCTGGTG TTTTTGATAT TAGTAGAAAT ATAGAAATTT GTTCGAAGGG AGGTGTAATT
ACATCTTCTG AGTTGTTAGA AATAGCGAAA ACAATTGCTG CAGCAAGAAA TTTAAAAAAA
ATCTTATTAG ATTTTGAACA AAGACCTTAT ATTTCATCAT TCACAAAAAA TTTAATTGAC
CATCAGAATA TCGAAACGAT TTTTAAAAAA GGCATTGAAT CGAATGGAAG GATTTCAGAC
AATGCTAGTA ATGAGTTATC TATTCTTAGA AAAGAATTTT TATCTAAGAA ACTCGAAAGA
AAAATATTAG TTGAGAAATT TATTCAAAAG AATTTAGCTT ATTTGCAAGA TACTACTATT
GGAGATCGGT ATGGAAGGCC TGTTTTAGCA GTGAAAGTTA ATTATGTAGA TAAATTTAAG
GGAATAATTC ATGACTCTTC ATCTTCAGGA AATACAGTAT ATTTCGAGCC TGAAAGTGTA
GTAACTAAAG GTAATAAGAT TGCTTCTTTA GAGGCTAGGA TCACAGCAGA AGAATTTAAA
TTACTTAAGA AATGGTCTCA TGTTGTTAGT GATAATTCAA AAAATCTTAT TGAAATGGCG
TCCATTTTAT TAAGATTAGA AAATGCCCTA ACTCGTTCAA GATATTCGAA ATGGATTGGA
GGTAAAACTC CTACATTTGA GAAAAATCCT ATTATTTCTT TAATTGGTTT TTCTCATCCG
TTATTGATTT GGGAACATAA GAAAAAAGGA GCCCCTCCAC CAGTAGCTGT CGATTTTTAT
ATAAATCGAA ATATTAAGGT TGTAGCTATT ACAGGCCCAA ATACTGGAGG TAAAACAGCA
GCTTTAAAAG GTTTGGGCTT GTCTTTACTT ATGGCTAGAG CAGGATTATT GATACCTTCA
ACTAATAATC CTATTATCCC TTTCTGTCCA AATATATATG TGGATATAGG AGATAATCAA
TCATTAGAAG AAAATTTATC TACCTTCAGT GGGCATATAT CCCGCATAAA AGGGATATTA
GATTCACTTG ATTATAAGAA AGGATTATCA GTTGTTTTGT TAGATGAGAT TGGATCTGGT
ACAGATCCTC TTGAAGGAAG CGCTCTTGCG ATGGCTTTAT TAAAAGAATT TGCAAATAAA
TCTGATATCA CTTTGGCAAC TACACATTAT GGAGATATTA AGGCTTTAAA ATATAACGAC
TCGAGATTTG AAAACGTATC AGTTGCCTTT GATGAGGAAT CTTTGAAGCC AAAATATATA
CTCAACTGGG GTATTCCTGG GAGAAGTAAT GCTTTGTCAA TTTCAAAGAG AATTGGTCTC
GATGAAAGCA TACTCAATGA AGCTGCAAAT TATCTAAAGC CAAAAGAAGT TGACAACATT
AACAGTATTA TTAAAGGACT TGAGGAAGAG AGGATTAAAC AACAAAATTC TGCAGAAGCT
GCTGCAGAAT TGATTGCAAG GACTGAAATA CTACATGATG AACTGAAGAG AAATTATGAA
CATCAAAAAA TAAATGCTCA AAAAATTCAA GAACTTGAAA GGTCTAAATT ATCAAAACAT
ATCATATCCG CTAAAAAAGA GGTGATAGAT TTGATTAAAA AATTAAGAGA TAAAAATGTT
AATGGAGAGG ATACGAGAAT TATTGGAAAA AGATTAAAGG AAATTGAGAC GGAACATTTA
ACCCAAAAAA AATCTGAAAA GTCAATATCA TGGAACCCTC AGGTAGGCGA TTTTGTAAAG
ATTAAAAGTC TAAATAGTAC GGGACAAATT GTAGATTTAG ATAAAAAAGG TGGTTTTTAC
GAAGTTAAAT GTGGTTCATT CAGAAGCACA TTATCTGCAA ATGACTTTGA AGGTATTAAT
GGAGAAAAGC CTAATTTCAA AATGTCAAAA ATTGAGATCA AGGCTACAAG AGAGGATTTT
TCTTTTTCTA AAATTAGAAC AAGTAAAAAT ACAATTGACG TAAGAGGATT AAGAGTGCAT
GAAGCCGAAA TAATTATTGA GGAGAAAATT AGAAGATTTC ATGGACCACT ATGGATTGTT
CATGGAATTG GCACAGGAAA ATTAAAGAAA GGACTAAGAA ATTGGTTATC AGGTTTAAAT
TATGTTGATA AGATTGAAGA TGCAGCCAAC AACGAGGGTG GCCCTGGTTG CAGTATTGCG
TGGATAAAAT AA
 
Protein sequence
MQEKSYSKKS YSDNTLEEES INLLEWDSLK THLSSFASTE MGKQSILSFV IPSEYEASKR 
LLNETVEINE LEKNLDKSIS FSGVFDISRN IEICSKGGVI TSSELLEIAK TIAAARNLKK
ILLDFEQRPY ISSFTKNLID HQNIETIFKK GIESNGRISD NASNELSILR KEFLSKKLER
KILVEKFIQK NLAYLQDTTI GDRYGRPVLA VKVNYVDKFK GIIHDSSSSG NTVYFEPESV
VTKGNKIASL EARITAEEFK LLKKWSHVVS DNSKNLIEMA SILLRLENAL TRSRYSKWIG
GKTPTFEKNP IISLIGFSHP LLIWEHKKKG APPPVAVDFY INRNIKVVAI TGPNTGGKTA
ALKGLGLSLL MARAGLLIPS TNNPIIPFCP NIYVDIGDNQ SLEENLSTFS GHISRIKGIL
DSLDYKKGLS VVLLDEIGSG TDPLEGSALA MALLKEFANK SDITLATTHY GDIKALKYND
SRFENVSVAF DEESLKPKYI LNWGIPGRSN ALSISKRIGL DESILNEAAN YLKPKEVDNI
NSIIKGLEEE RIKQQNSAEA AAELIARTEI LHDELKRNYE HQKINAQKIQ ELERSKLSKH
IISAKKEVID LIKKLRDKNV NGEDTRIIGK RLKEIETEHL TQKKSEKSIS WNPQVGDFVK
IKSLNSTGQI VDLDKKGGFY EVKCGSFRST LSANDFEGIN GEKPNFKMSK IEIKATREDF
SFSKIRTSKN TIDVRGLRVH EAEIIIEEKI RRFHGPLWIV HGIGTGKLKK GLRNWLSGLN
YVDKIEDAAN NEGGPGCSIA WIK