Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcHS_A3122 |
Symbol | mutY |
ID | 5593178 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli HS |
Kingdom | Bacteria |
Replicon accession | NC_009800 |
Strand | + |
Start bp | 3131901 |
End bp | 3132983 |
Gene Length | 1083 bp |
Protein Length | 360 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 640922241 |
Product | adenine DNA glycosylase |
Protein accession | YP_001459741 |
Protein GI | 157162423 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1194] A/G-specific DNA glycosylase |
TIGRFAM ID | [TIGR01084] A/G-specific adenine glycosylase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 53 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGCCCCCAA CAACAGTGAA TTCGGTGACC ATGCAAGCGT CGCAATTTTC AGCCCAGGTT CTGGACTGGT ACGATAAATA CGGGCGAAAA ACGCTGCCCT GGCAAATTGA CAAGACGCCC TACAAAGTAT GGCTCTCAGA AGTGATGTTG CAACAAACTC AGGTTGCGAC CGTTATCCCC TATTTTGAAC GCTTTATGGC GCGCTTCCCG ACGGTGACCG ATCTCGCCAA TGCGCCGCTC GACGAAGTTC TCCACTTGTG GACCGGGCTT GGCTATTACG CCCGCGCGCG CAATCTGCAT AAAGCGGCAC AACAAGTGGC GACCTTACAC GGCGGTAAAT TCCCGGAAAC CTTTGAAGAA GTCGCGGCGT TACCGGGCGT CGGGCGTTCC ACCGCAGGCG CGATTCTCTC GCTTTCTCTG GGTAAGCACT TTCCGATTCT CGACGGTAAC GTCAAACGCG TGCTGGCGCG CTGCTATGCT GTAAGCGGCT GGCCTGGGAA AAAAGAGGTC GAGAATAAAT TATGGAGTTT GAGCGAGCAG GTGACGCCCG CGGTTGGCGT GGAACGGTTT AATCAGGCGA TGATGGATTT GGGTGCGATG ATTTGTACGC GCTCGAAACC GAAATGTTCG CTCTGTCCGC TACAAAACGG ATGTATTGCC GCCGCCAACA ATAGCTGGGC GCTTTATCCG GGCAAAAAAC CGAAACAGAC GCTGCCGGAG CGCACCGGCT ACTTTTTGCT ATTACAGCAC GAAGATGAAG TATTGCTGGC GCAGCGTCCG CCGAGCGGAT TGTGGGGCGG TTTATACTGT TTCCCGCAGT TTGCCGACGA AGAAAGTTTG CGGCAGTGGC TGGCGCAACG GCAGATTGCT GCCGATAACC TGACGCAACT GACCGCGTTT CGGCATACCT TCAGCCATTT CCACTTAGAT ATTGTGCCTA TGTGGCTTCC CGTGTCGTCA TTCACCGGCT GCATGGATGA AGGCAATGCG CTCTGGTATA ACTTAGCGCA ACCGCCGTCA GTTGGCCTGG CGGCTCCCGT GGAGCGTTTG TTACAGCAGT TACGCACTGG CGCGCCGGTT TAG
|
Protein sequence | MPPTTVNSVT MQASQFSAQV LDWYDKYGRK TLPWQIDKTP YKVWLSEVML QQTQVATVIP YFERFMARFP TVTDLANAPL DEVLHLWTGL GYYARARNLH KAAQQVATLH GGKFPETFEE VAALPGVGRS TAGAILSLSL GKHFPILDGN VKRVLARCYA VSGWPGKKEV ENKLWSLSEQ VTPAVGVERF NQAMMDLGAM ICTRSKPKCS LCPLQNGCIA AANNSWALYP GKKPKQTLPE RTGYFLLLQH EDEVLLAQRP PSGLWGGLYC FPQFADEESL RQWLAQRQIA ADNLTQLTAF RHTFSHFHLD IVPMWLPVSS FTGCMDEGNA LWYNLAQPPS VGLAAPVERL LQQLRTGAPV
|
| |