Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_3104 |
Symbol | mutY |
ID | 6146887 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 3189820 |
End bp | 3190872 |
Gene Length | 1053 bp |
Protein Length | 350 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 641617972 |
Product | adenine DNA glycosylase |
Protein accession | YP_001745123 |
Protein GI | 170682894 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1194] A/G-specific DNA glycosylase |
TIGRFAM ID | [TIGR01084] A/G-specific adenine glycosylase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 30 |
Fosmid unclonability p-value | 0.00400976 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGCAAGCGT CGCAATTTTC AGCCCAGGTT CTGGACTGGT ACGATAAATA CGGGCGGAAA ACGCTGCCCT GGCAAATTGA CAAGACGCCC TACAAAGTAT GGCTCTCAGA AGTGATGTTG CAACAAACTC AGGTTGCGAC TGTTATCCCC TATTTTGAAC GCTTTATGGC GCGCTTCCCG ACGGTGACCG ATCTCGCCAA TGCACCGCTG GATGAAGTTC TCCACTTGTG GACCGGGCTT GGCTATTACG CCCGCGCGCG CAATCTGCAT AAAGCGGCAC AACAAGTGGC CACCTTACAC AGCGGTAAAT TCCCGGAAAC CTTTGAAGAA GTCGCGGCGT TACCAGGCGT CGGGCGTTCT ACCGCAGGCG CGATTCTCTC GCTTTCTCTG GGTAAGCACT TTCCGATTCT CGACGGTAAC GTCAAACGGG TGCTGGCGCG CTGCTATGCT GTAAGCGGCT GGCCTGGGAA AAAAGAGGTC GAGAATAAAC TGTGGAGTTT AAGCGAGCAG GTGACGCCCG CGGTTGGCGT GGAACGGTTT AATCAGGCGA TGATGGATTT GGGCGCGATG ATTTGCACGC GCTCGAAGCC GAAATGTTCG TTCTGTCCGC TACAAAACGG ATGTATTGCC ACCGCTAACA ATAGCTGGTC GCTTTATCCG GGCAAAAAAC CGAAACAGAC GCTGCCGGAG CGTACTGGCT ACTTTCTGCT GTTACAGCAC GAAGATGAAG TATTGCTGGC GCAGCGTCCG CCGAGCGGAT TGTGGGGCGG TTTATACTGT TTCCCGCAGT TTGCCGACGA AGAAAGTTTG CGGCAATGGC TGGCGCAACG GCAGATTGTT GCTGATAACC TGACGCAGCT GACCGCGTTT CGCCATACCT TCAGCCATTT CCACTTAGAT ATTGTGCCTA TGTGGCTTCC CGTGTCGTCA TTCACCGGCT GCATGGATGA AGGCAATGCG CTCTGGTATA ACTTAGCGCA ACCGCCGTCA GTTGGCCTGG CGGCTCCCGT GGAGCGTTTG TTACAGCAGT TACGCACTGG CGCGCCGGTT TAG
|
Protein sequence | MQASQFSAQV LDWYDKYGRK TLPWQIDKTP YKVWLSEVML QQTQVATVIP YFERFMARFP TVTDLANAPL DEVLHLWTGL GYYARARNLH KAAQQVATLH SGKFPETFEE VAALPGVGRS TAGAILSLSL GKHFPILDGN VKRVLARCYA VSGWPGKKEV ENKLWSLSEQ VTPAVGVERF NQAMMDLGAM ICTRSKPKCS FCPLQNGCIA TANNSWSLYP GKKPKQTLPE RTGYFLLLQH EDEVLLAQRP PSGLWGGLYC FPQFADEESL RQWLAQRQIV ADNLTQLTAF RHTFSHFHLD IVPMWLPVSS FTGCMDEGNA LWYNLAQPPS VGLAAPVERL LQQLRTGAPV
|
| |