Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | NATL1_20801 |
Symbol | mutY |
ID | 4779386 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. NATL1A |
Kingdom | Bacteria |
Replicon accession | NC_008819 |
Strand | - |
Start bp | 1722891 |
End bp | 1724045 |
Gene Length | 1155 bp |
Protein Length | 384 aa |
Translation table | 11 |
GC content | 37% |
IMG OID | 640085376 |
Product | adenine glycosylase |
Protein accession | YP_001015900 |
Protein GI | 124026785 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1194] A/G-specific DNA glycosylase |
TIGRFAM ID | [TIGR00586] mutator mutT protein [TIGR01084] A/G-specific adenine glycosylase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGCATTT TTGATTCTCC TCAAGATATC CAAAATTCAC TTCTGGCATG GTTTAGGGAA AATGGTAGAT ACTGGATACC ATGGAAATTA AAGAAAGATG GTTCCGTTCC TCGATCAGGA GAAAGTATAT CTCCTTATGG AATTTGGATT GCAGAGGTCA TGCTTCAGCA GACACAGTTG AAGGTCGTTA TTCCTTATTG GAAAAAATGG ATGAAGTTTT TTCCTACCTT GTCGTCTTTA GCAGAGGCTG ATTTGGAGAA TCTTCTAATG ATTTGGCAAG GTCTTGGTTA TTATTCACGT GCGAAACGAA TACATCAATC CTCTAAAATA TTAGTTGAAT TTGTTGGCAA AAATAGAGAT CAAGATCCAG ATTCTTGGCC TAATCAAATA GATAAGTGGA TGTCTCTTCC TGGTATCGGT AGAAGTACTG CAGGTAGTAT TATCTCATCT GCCTTTGACC TGCCAACCCC AATATTGGAT GGGAATGTAA AAAGAATTTT GTCAAGATTG CTAGCCATTG AACGAAAATC TATTAAAGAT GAAAGAAAAT TATGGGAATT CAGCTCGTTA TTGATTGAAA GGCTAAGTCC AAGGGATTTC AATCAGGCTT TGATGGATTT AGGGGCAATT ATTTGTACTC CCACAAAACC AAGTTGTTCT TCGTGCCCAC TACAAAATTT TTGTGTTGCT TATACAAAGT ACGATCCTGA AGATTTTCCT AAAAAAGAAA TGACCAAAAT AAAGCCTCTG CAAGAAATTG GAATTGGGCT TGTTTTTAAT CAAAAAGGTG AATTGCTTAT CGATCAGCGA TTAGAAAGCT CAAGTATGGG CGGAATGTGG GAATTTCCAG GAGGAAAAAA AATTCCCAAC GAATCGATTG AGACAACTAT CGAGCGAGAA TTAAAAGAAG AGCTTGGGAT TGTTGTCAAC GTTGGAGAAA AGCTTTTATC TTTTGAACAC GCTTATACCC ACAAGAGGCT GAATTTTACT GTTCATATTT GCGCATGGAT ATCAGGTCAG CCCAAACCTT TAGCTAGTCA AAAATTACTT TGGGTGTCTC CGGACAAACT TTTTGATTTT CCTTTTCCTG CTGCTAACAC TAAAATTATT TCTGAATTAC ATAAACATCT TTGTATTGGA AATAAAAATC TGTAA
|
Protein sequence | MGIFDSPQDI QNSLLAWFRE NGRYWIPWKL KKDGSVPRSG ESISPYGIWI AEVMLQQTQL KVVIPYWKKW MKFFPTLSSL AEADLENLLM IWQGLGYYSR AKRIHQSSKI LVEFVGKNRD QDPDSWPNQI DKWMSLPGIG RSTAGSIISS AFDLPTPILD GNVKRILSRL LAIERKSIKD ERKLWEFSSL LIERLSPRDF NQALMDLGAI ICTPTKPSCS SCPLQNFCVA YTKYDPEDFP KKEMTKIKPL QEIGIGLVFN QKGELLIDQR LESSSMGGMW EFPGGKKIPN ESIETTIERE LKEELGIVVN VGEKLLSFEH AYTHKRLNFT VHICAWISGQ PKPLASQKLL WVSPDKLFDF PFPAANTKII SELHKHLCIG NKNL
|
| |