Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9211_03641 |
Symbol | mutM |
ID | 5730109 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9211 |
Kingdom | Bacteria |
Replicon accession | NC_009976 |
Strand | - |
Start bp | 347587 |
End bp | 348447 |
Gene Length | 861 bp |
Protein Length | 286 aa |
Translation table | 11 |
GC content | 39% |
IMG OID | 641284718 |
Product | formamidopyrimidine-DNA glycosylase |
Protein accession | YP_001550249 |
Protein GI | 159902905 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0266] Formamidopyrimidine-DNA glycosylase |
TIGRFAM ID | [TIGR00577] formamidopyrimidine-DNA glycosylase (fpg) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.418667 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGCCTGAGC TCCCTGAAGT AGAGACAGTC CGCAAAGGCT TAGAGAAGCG GCTGAAGAAT TTTTATATAG ATAATGTTGA AGTTTTATCA GAGAGATCTA TTGCCAGTAA TGGTGGTAGC AATGTGTTTA TATTTAATTT AAAAGATCTA GTTTTTGGCA GATGGAGCAG AAGAGGAAAG TACTTAATAG CTTCTCTATG CAAAGAAAGT GATCTCATTG AAGAGATCCC TTCAGGCACA CTAGTAGTCC ATTTAAGAAT GACAGGATAT TTCGAATGGC ATCAAAATAC CAAGGCTCCT TGCACTCATA CAAGAGTTCG TTTTTGGAAT AAAAAAGGTT CCGAAATCCG CTTTATAGAT ATTCGAAATT TTGGGCAAAT GTGGTGGATA CCTCCTAACA AGCTTCCAAG CGAAGTTATC AACGGATTAA AAAATTTAGG ACCAGAGCCA TTCAGCAAAG ATTTCAACCC CGAGTATCTA AAATATTGCT TAAAAGGAAG GAAGAGGTCT ATTAAATCAT CCCTATTAGA TCAATCTATA CTTGCAGGAG TAGGAAACAT ATATGCAGAT GAAAGCCTTT TCGAAGCAGG AATAACCCCT ATAAAAGCAT CTGGTGACCT CAATGGTTGT GAGCTGAAAA AGCTTTGCAA AAGCCTTACT AGAATTCTAA AAGCTAGCAT TGGGAAAGGT GGAACAACTT TCTCAGACTT TAGAGACCTA GAAGGGCTTA ATGGTACTTA TGGTGGTTAT GCCTGGGTCT ATCGACGTAA TCAAAAGCCT TGCAGAAAAT GTGGAACATT AATCGAAAAA ACTAAGGTAG CTGGCAGAAG CACTCATTGG TGTCCGAACT GTCAAAACTA G
|
Protein sequence | MPELPEVETV RKGLEKRLKN FYIDNVEVLS ERSIASNGGS NVFIFNLKDL VFGRWSRRGK YLIASLCKES DLIEEIPSGT LVVHLRMTGY FEWHQNTKAP CTHTRVRFWN KKGSEIRFID IRNFGQMWWI PPNKLPSEVI NGLKNLGPEP FSKDFNPEYL KYCLKGRKRS IKSSLLDQSI LAGVGNIYAD ESLFEAGITP IKASGDLNGC ELKKLCKSLT RILKASIGKG GTTFSDFRDL EGLNGTYGGY AWVYRRNQKP CRKCGTLIEK TKVAGRSTHW CPNCQN
|
| |