Gene PMN2A_1585 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPMN2A_1585 
Symbol 
ID3606983 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL2A 
KingdomBacteria 
Replicon accessionNC_007335 
Strand
Start bp255699 
End bp258113 
Gene Length2415 bp 
Protein Length804 aa 
Translation table11 
GC content38% 
IMG OID637688463 
ProductMutS 2 protein 
Protein accessionYP_292776 
Protein GI72383421 
COG category[L] Replication, recombination and repair 
COG ID[COG1193] Mismatch repair ATPase (MutS family) 
TIGRFAM ID[TIGR01069] MutS2 family protein 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGATTAA CAAAAAATCA TGATGATTCT AAAAAGACAC AAATAATATC AGAGTCTTTG 
GATTTGCTTG AATGGCCAAC TGTTTGTAGC CATTTGTCTA CATTCGCTCT TACTCAACAA
GGTCGTAAAA AATGTGAAAG CTTTGATTTG CCACGAAATC TATCTTTAAG CCAAGAGCTA
TTGTCCCAAA CATTAGAAAT TGGGTCATTA GATAGTTCTC TTAATGAAGG AATATCTTTT
GATGGTGTTC ATGATTTGGA AAATATACTT TTGATATGCT CCAAAGGAGG TATTGCTATT
GGTGAGGATT TATTAAAAGT AGCTGATACT TTAAGAGCTG CTAGAAAATT ACGAAAACTA
ATATTTGATC AAGTGATACG TCCACGACTT TCTGAATTAC TCAAAGATGT TGCAACTTTA
CCAGATTTGC AAAAACTCCT CGAATTTGGG CTTGATGAAG GTGGGCGAAT TGCAGATCGT
GCTAGCCCAA AGCTTTCTGA ATTACGACGT TATAGAAATT CCGTACGTCT TCAAAGAAAA
GATATTCTTC AAGATATCAT CCGGAAATAT GGTGGATTAC TTCAAGATAA TATTATTTCA
GAGAGGTATG GACGACCTGT TTTAGCTTTT AAGGCTGGGA CTTCTGATCA AATCAAAGGA
ATGGTTCATG ATAGTTCGGC CTCTGGGAAC ACGATATATG TCGAGCCCCA AGTTGTCATA
TCAATAGGAA ATCGTTTAGC TAAGATAGAT TCTGAAATCT CAGATGAAGA GAGGAGACTT
TTAGCTGATT GGAGTAAAGA GGTTGGTCTT AATGCAATTG TAATAGCTCA TTTAGTAGAG
ATCCTTTTGC AAATTGAGTT TGCATTGTCT CGAGCACGTT ATTCTAAATG GCTTAATGGG
GTCCCTGCAA TTCTAGATCA AGAAGAACAT TCACCCTTTG AGATCAAAGA TTTTCGTCAT
CCTTTATTAG TATGGAATGA CTTCTATGAG AAAAAGAATA CAGTAGTTCC AACTAGTTTT
GATGTCGCCC CTGATTTAAA AGTTGTTGCG ATTACAGGCC CTAATACTGG AGGGAAAACA
GTTGCTTTGA AAAGTATTGG TTTAGCAGTT TTAATGGCAA AAGCGGGGTT GCTTTTGCCA
TGTACAGGCT CACCAAGATT ACCATGGTGT AAAAATGTTT TCGCTGATAT TGGTGATGAG
CAATCTTTAC AGCAAAATTT GTCTACATTT AGCGGACATA TTCTTCGTAT AAGTCGAATA
CTCGACGCTA TAGATGTGTT CCCTGGTACG ACTCTCGTTC TTTTAGATGA AGTTGGAGCT
GGAACTGATC CAACTGAAGG CACAGCATTG GCCATGGCAC TCCTACAGGT AATGGCTAAT
AGAGCAAGAT TAACTATCGC GACTACTCAT TTTGGACAAT TAAAAGCGCT CAAATATAGT
GATTCAAGAT TTGAAAATGC TTCAGTTTCT TTTGATAGTG AAACTATACA ACCAACTTTT
CATTTGCAAT GGGGAATTCC TGGTCGAAGT AATGCAATTG AAATTTCAAA GAGACTTGGT
CTCGATGAGC AAGTAATTAT AAGTGCTCAA AAATTTATCA ATCCTGAAAG GGTTGATAAT
GTTAATCAAG TTATTCAAGG CTTAGAAAAA CAACGCGAGC GTCAGCAATC AGCAGCTGAA
GATGCTGCTG CATTATTGGC TAAAACCGAA TTACTACATG AGGAATTACT TAATAGTTGG
CAGAAACAAC GTCAACAATC GGAAGAGTTT AATGAACAAG GAAGGTTTAA ATTGGAGTCA
TCAATTCGTG AAGGTCAAAA AGAAGTTAGA CATTTAATTA AACGCTTGCG CGATCAAAAC
GCTAGTGGTG AGACAGCAAG AATTGCCGGT CAACGATTAC GGCAAATAGA AAAGGGATAT
CGAAACGACA AGCGAATTAA CCACACACAG AGTTGGACAC CAAAGATTGG GGAAAAAGTT
AGATTGTCTT CTATTGGTAA AGCAGGTGAA ATAATTTCTT TTTCAGATGA TGGAATGCAA
TTAACAGTGC TATGCGGCGT ATTTCGAAGC AAAGTCAATT TAACCGAAGT TGAAAGTCTT
GATGGTCAAA AGGTCGAAAT AAACCAAAGT GTGCAAGTAA AAACTTCGCA GGTAAGAAAG
AATTTATCTT TAGTAAGAAC TAAAAAAAAC ACCTTAGATG TAAGAGGGTT ACGCGTTCAT
GAAGCCGAGG GGGTAATTGA AGAAAAATTG AGAAATTGTT CCGGAGCTTT ATGGGTTATT
CATGGAATTG GTTCTGGAAA ACTGAAAAAA GGTTTGAGGA AATGGTTTGA TTCACTTCCA
TATATTGAAA AAGTAGCCGA TGCTGAACCT CATGATGGCG GCCCTGGATG TAGCGTTGTG
TGGATGGTTG ATTGA
 
Protein sequence
MGLTKNHDDS KKTQIISESL DLLEWPTVCS HLSTFALTQQ GRKKCESFDL PRNLSLSQEL 
LSQTLEIGSL DSSLNEGISF DGVHDLENIL LICSKGGIAI GEDLLKVADT LRAARKLRKL
IFDQVIRPRL SELLKDVATL PDLQKLLEFG LDEGGRIADR ASPKLSELRR YRNSVRLQRK
DILQDIIRKY GGLLQDNIIS ERYGRPVLAF KAGTSDQIKG MVHDSSASGN TIYVEPQVVI
SIGNRLAKID SEISDEERRL LADWSKEVGL NAIVIAHLVE ILLQIEFALS RARYSKWLNG
VPAILDQEEH SPFEIKDFRH PLLVWNDFYE KKNTVVPTSF DVAPDLKVVA ITGPNTGGKT
VALKSIGLAV LMAKAGLLLP CTGSPRLPWC KNVFADIGDE QSLQQNLSTF SGHILRISRI
LDAIDVFPGT TLVLLDEVGA GTDPTEGTAL AMALLQVMAN RARLTIATTH FGQLKALKYS
DSRFENASVS FDSETIQPTF HLQWGIPGRS NAIEISKRLG LDEQVIISAQ KFINPERVDN
VNQVIQGLEK QRERQQSAAE DAAALLAKTE LLHEELLNSW QKQRQQSEEF NEQGRFKLES
SIREGQKEVR HLIKRLRDQN ASGETARIAG QRLRQIEKGY RNDKRINHTQ SWTPKIGEKV
RLSSIGKAGE IISFSDDGMQ LTVLCGVFRS KVNLTEVESL DGQKVEINQS VQVKTSQVRK
NLSLVRTKKN TLDVRGLRVH EAEGVIEEKL RNCSGALWVI HGIGSGKLKK GLRKWFDSLP
YIEKVADAEP HDGGPGCSVV WMVD