Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9211_17541 |
Symbol | mutY |
ID | 5730628 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9211 |
Kingdom | Bacteria |
Replicon accession | NC_009976 |
Strand | - |
Start bp | 1578725 |
End bp | 1579924 |
Gene Length | 1200 bp |
Protein Length | 399 aa |
Translation table | 11 |
GC content | 35% |
IMG OID | 641286139 |
Product | A/G-specific DNA glycosylase |
Protein accession | YP_001551639 |
Protein GI | 159904295 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1194] A/G-specific DNA glycosylase |
TIGRFAM ID | [TIGR00586] mutator mutT protein [TIGR01084] A/G-specific adenine glycosylase |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 30 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCTGTTG GCTTTAGTCA AAATGATTTA GCAAATGGGG TTTTGGAAAA TCCTCAGAAG ATAGACGCTC TTAGGTCAAC TCTTCTTCAA TGGTTTAAAT CAAATGGACG TCATTATATC CCTTGGAAAT TAACTAAGGA TGGAACTTTA CCAAACGAGA ATCAATATCT TGCTGTATAT CCAATTTTGG TTGCTGAAGT AATGCTGCAA CAGACTCAGT TGAAAGTTGT TTTACCATAT TGGGAAAAGT GGATGTTAGC TCTGCCAACT CTTGTTGATT TGGCAAAGGC TGAAGAGGAT AAAGTCCTTT TACTTTGGCA AGGACTTGGT TATTACTCCA GAGCAAGAAG ATTACATGTT ACTTCTAGGA TTCTTTTGAA TTTAATTGGC ATACCTAACT CTTTAAATCC AGCTAATTGG CCTAAGGATT TAGAAAGCTG GATGAATTTA CCTGGCATAG GGCGTAATAC GGCTGGCAGT ATCATTTCTT CAGCATTTAA CCTCCCTAGC CCTTTACTAG ATGGGAATGT TAAACGGGTT TTAACTAGAT TAATTGGTAG TACTAAAACT CCAAACAAAG ATTTGGCAAG GCTTTGGAAA TTGAGTGATT TGTTACTAGA TAAAAATCTT CCTAGAACAT TTAATCAAGC TTTAATGGAT CTAGGTGCGA CAATTTGTAC CAAGTACAAT CCTATTTGTA CCAATTGCCC ATGGCAAAAT TATTGTTCTG CTTATAATTC AGGAAATCCT GAAAATCTAC CAGTAAAAGG TCAAAAATTG ATTCTTTCAA AAGCTGTTAT AGGCGTTGGC TTGATTCTTA ATAAAAATCA GGATGTTTTG ATTGATCAGA GACTTGATGA AGGAAGTATG GGAGGAATGT GGGAATTTCC CGGAGGTAAA AAGGAAAAAG ATGAATCAAT AGAAATGACT ATTGCAAGAG AACTACGCGA AGAATTAGGC GTTGAGGTAA AGGTAGGGAA AAAGCTTATT GAATTTGATC ATTCCTATAC CCATAAGAAA TTACATTTTA TAGTTCATTT GTGTGAATTA ATTTCTGGGA AACCAAAACC TTTATCAAGC CAGGAAGTCC GATGGGTAAA ATTAAGTGAT CTTCAAAATT ATCCGTTTCC TAAAGCTAAC TCATATATGA TTTCTGCTCT TAAAGAATAT TTTCTTATAT CCAAGACAAA GATGAAGTAA
|
Protein sequence | MAVGFSQNDL ANGVLENPQK IDALRSTLLQ WFKSNGRHYI PWKLTKDGTL PNENQYLAVY PILVAEVMLQ QTQLKVVLPY WEKWMLALPT LVDLAKAEED KVLLLWQGLG YYSRARRLHV TSRILLNLIG IPNSLNPANW PKDLESWMNL PGIGRNTAGS IISSAFNLPS PLLDGNVKRV LTRLIGSTKT PNKDLARLWK LSDLLLDKNL PRTFNQALMD LGATICTKYN PICTNCPWQN YCSAYNSGNP ENLPVKGQKL ILSKAVIGVG LILNKNQDVL IDQRLDEGSM GGMWEFPGGK KEKDESIEMT IARELREELG VEVKVGKKLI EFDHSYTHKK LHFIVHLCEL ISGKPKPLSS QEVRWVKLSD LQNYPFPKAN SYMISALKEY FLISKTKMK
|
| |