Gene PMN2A_1243 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPMN2A_1243 
Symbol 
ID3606637 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL2A 
KingdomBacteria 
Replicon accessionNC_007335 
Strand
Start bp1744289 
End bp1747069 
Gene Length2781 bp 
Protein Length926 aa 
Translation table11 
GC content33% 
IMG OID637688119 
ProductDNA mismatch repair protein MutS 
Protein accessionYP_292436 
Protein GI72383081 
COG category[L] Replication, recombination and repair 
COG ID[COG0249] Mismatch repair ATPase (MutS family) 
TIGRFAM ID[TIGR01070] DNA mismatch repair protein MutS 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.32952 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCTGCCA GCCAAAACCC TATACAAGGC AGTCTATTTG GGGGGAATGA GGAAAGCGAT 
CTCAATAAAG CTGAAAAGCT GAAAGGTTCA GAAAGATCGA ACGTAAATCT GTCACATCAA
CAACTAAAAG AAGACGCATC GCTTAGACCT CGCATAAAAC AGACTCCTAA AAATCCAAAT
CAGATAATTG ATTTGGATGA GTTGTCTCGT CTGGCAATTG AGGAACCCAA ATGGTCACAT
CACAATTTAC CAAAAATTGA TGATCTCACT CCTGCACTGA GACATTATGT CGAATTAAAA
AAAGAGAATC CTGACCGAGT TTTGCTTTAT AGGCTTGGAG ACTTCTTCGA ATGTTTTTTT
GAAGATGCAA TAACACTCTC AAAACTTTTA GAAATCACAC TCACAAGTAA AGAAGCTGGA
AAAAAAATTG GCAAAATTCC AATGGCTGGA ATTCCTCATC ATGCATCTGA TCGTTATTGC
ACAGAACTAA TTAAAAAAGG TTTATCTATT GCTATTTGTG ATCAACTTGA AGCTGCTCCA
ACAAAAGGCA ATAAATTAAT AAAAAGAGGC ATAACAAGAT TAATAACCCC TGGAACAATT
CTAGAAGAGG GAATGTTAAG TGCTAAAAAA AATAATTGGC TAGCTTCTGT TCTTCTAGAA
GCCAAATCTA ATCAAGAAAT AATTGACTGG TCTTTAGCAA AAATAGATGT AAGCACCGGT
GAATTTATTG TTCAAGAAGG TCAAGGAAGT AATAATTTAC GACACGAATT AATTAAATTG
AATGCAGCTG AAGTTATTTC AGAGAAAAAG TCAATCTCAA ATAAAATCTG GTATGAAGGA
TTAATAGAAA TAACAGAATT TAATAGAACA TCATTCTCTA ATTTAGAGGC AATAACAACT
ATTAAAAATC ATTATTGTCT AAATAATATT GATAGCCTAG GGATACATTC TAACTCACTA
TCAATTAGGA CTATTGGAGG ATTAATTTCA TATTTAAATA AAACGCACCC AAATATAGAT
GATAAATCTA ACAATGAAAT AAAAACAAAT ATCTGTATTG ACTACCCTCG AATAAAAAAT
AGTCGATCAG GATTAATTAT AGATAGTCAA ACAAGAAGAA ATTTAGAAAT TACCTCAACT
CAGAAGGATG GAAAATTTCA AGGCTCATTA CTTTGGGCAA TTGATAAAAC ATTAACTGCA
ATGGGTGCGA GATGCATTAG GAGGTGGATA GAAGAACCTA CAAAAGATGT TAATGCAATT
AAAAATAGAC AGAATATAAT TGGATTTCTA GTCAAATCAT CGACATTAAG AAAAAATATT
AGAAAAACCC TTAGAGCCAT GGGTGACTTA GAACGTCTTT CAGGTAGAGC AGGAGCTCAA
CAAGCGGGAG CACGTGATTT AGTTGCTATC GCAGAAGGAA TTAACCGTTT ACCCTTAATT
AAAAATTATC TAAATGAGCC CATATTTGAT AAAACTAAGT ATTTTGACTC TATTATAAAT
ATAGATAAAG ACTTAATAGA ACTTGCATCA AAAATCAATA ATCAAATCAT AGACAATCCA
CCACTTAGCC TTACAGAAGG GGGTCTATTT TATGATGGTA TAAACCCTGT ACTTGATGGA
CTAAGAAACC AACTAGATGA TCATAATATA TGGCTCAACT CTCAGGAATT AGAAGAGAGA
AAGAAAAGCA ATATAAATAA TTTAAAGCTT CAATATCATC GATCTTTTGG ATACTTTTTA
GCTGTTAGCA AATCAAAGGC TATAAATGTT CCAGATCACT GGATCAGAAG ACAAACCTTA
ACTAATGAAG AACGTTTTGT GACACCAGGA TTAAAAGAAA GAGAAGGAAA AATCTTTCAA
GTTAGAGCAA GAATATCAGA ACTAGAATAT GAACTTTTTT GTGATTTAAG AAAACTTACA
GGGAGTAAAT CAAACATTAT TAGACAAGCT GCAAAAGCAA TATCTCACTT AGATGTTTTA
ACTGGACTAG CCGAACTAGC CGCTAATCAC AACTATATTC AACCTCAGAT AATAGATATG
AATGAGCAAG ATAAATCAAG AAAATTATCT ATTATTGATG GCCGTCATCC TGTAGTTGAA
CAAATTTTAG TTGATAAAGT TTTTGTACCT AATGATATAG AACTTGGCTC TAAGACCGAT
CTAATTATTC TTTCAGGGCC AAATGCTAGT GGAAAAAGTT GTTATTTAAG ACAAGTAGGT
CTCTTGCAAA TCATGGCTCA AATTGGAAGT TGGATCCCAG CTAAATCAGC ACATATGGGA
ATAGCTGATC AAGTATTTAC ACGTGTTGGA GCAGTGGATG ATTTAGCTTC AGGCCAATCA
ACTTTCATGG TCGAAATGAT TGAAACTGCC TTCATTCTTA ATAATGCTAC TGAAAACTCA
TTAGTTTTAT TAGATGAAAT TGGAAGAGGA ACTTCAACTT TTGATGGGCT ATCTATTGCC
TGGTCAGTAA GCGAGTTTTT AGCAAAAAAA ATTAAAAGTC GTTCAATCTT TGCAACTCAT
TACCATGAAT TGAATCAAAT TTCTGAATAT ATTGAAAATG TCGAGAATTA CAAAGTTGTA
GTTGAATATA AAAATCATTC CCTTTCATTC CTTCACAAGG TCGAAAGAGG AGGAGCAAAT
AAAAGTTATG GAATTGAAGC TGCGAGGCTT GCGGGAGTCC CCCCAGACGT AGTCAATAAT
GCAAGATTGA TATTAAAAAA TCTAGAAAAA AATAACTCCA ACACCATTCA AATCACTAAG
CCAATTGAAA GTTGCAAATA A
 
Protein sequence
MAASQNPIQG SLFGGNEESD LNKAEKLKGS ERSNVNLSHQ QLKEDASLRP RIKQTPKNPN 
QIIDLDELSR LAIEEPKWSH HNLPKIDDLT PALRHYVELK KENPDRVLLY RLGDFFECFF
EDAITLSKLL EITLTSKEAG KKIGKIPMAG IPHHASDRYC TELIKKGLSI AICDQLEAAP
TKGNKLIKRG ITRLITPGTI LEEGMLSAKK NNWLASVLLE AKSNQEIIDW SLAKIDVSTG
EFIVQEGQGS NNLRHELIKL NAAEVISEKK SISNKIWYEG LIEITEFNRT SFSNLEAITT
IKNHYCLNNI DSLGIHSNSL SIRTIGGLIS YLNKTHPNID DKSNNEIKTN ICIDYPRIKN
SRSGLIIDSQ TRRNLEITST QKDGKFQGSL LWAIDKTLTA MGARCIRRWI EEPTKDVNAI
KNRQNIIGFL VKSSTLRKNI RKTLRAMGDL ERLSGRAGAQ QAGARDLVAI AEGINRLPLI
KNYLNEPIFD KTKYFDSIIN IDKDLIELAS KINNQIIDNP PLSLTEGGLF YDGINPVLDG
LRNQLDDHNI WLNSQELEER KKSNINNLKL QYHRSFGYFL AVSKSKAINV PDHWIRRQTL
TNEERFVTPG LKEREGKIFQ VRARISELEY ELFCDLRKLT GSKSNIIRQA AKAISHLDVL
TGLAELAANH NYIQPQIIDM NEQDKSRKLS IIDGRHPVVE QILVDKVFVP NDIELGSKTD
LIILSGPNAS GKSCYLRQVG LLQIMAQIGS WIPAKSAHMG IADQVFTRVG AVDDLASGQS
TFMVEMIETA FILNNATENS LVLLDEIGRG TSTFDGLSIA WSVSEFLAKK IKSRSIFATH
YHELNQISEY IENVENYKVV VEYKNHSLSF LHKVERGGAN KSYGIEAARL AGVPPDVVNN
ARLILKNLEK NNSNTIQITK PIESCK