Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SeHA_C3350 |
Symbol | mutY |
ID | 6488657 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Heidelberg str. SL476 |
Kingdom | Bacteria |
Replicon accession | NC_011083 |
Strand | + |
Start bp | 3261052 |
End bp | 3262104 |
Gene Length | 1053 bp |
Protein Length | 350 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 642743483 |
Product | adenine DNA glycosylase |
Protein accession | YP_002047099 |
Protein GI | 194447840 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1194] A/G-specific DNA glycosylase |
TIGRFAM ID | [TIGR01084] A/G-specific adenine glycosylase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 69 |
Fosmid unclonability p-value | 0.443349 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAAGCGT CTCAATTTTC AGCCCAGGTT CTGGACTGGT ACGACAAATA CGGGCGGAAA ACGCTGCCCT GGCAAATTAA CAAGACGCCT TACAAAGTAT GGCTCTCGGA AGTCATGTTG CAACAAACGC AGGTGACGAC GGTGATTCCT TACTTTGAGC GATTTATGGC GCGCTTTCCC ACAGTGACGG ATTTAGCGAA TGCGCCGCTG GATGAAGTAC TCCATTTATG GACCGGGCTC GGCTATTACG CCCGCGCACG TAATTTGCAT AAAGCGGCGC AACAGGTGGC GACGCTTCAC GGTGGAGAAT TCCCGCAAAC TTTTGCCGAA ATCGCCGCGC TACCCGGCGT CGGGCGTTCA ACCGCCGGCG CAATTCTCTC CCTCGCGTTA GGTAAACATT ATCCGATTCT TGATGGAAAC GTTAAACGTG TGCTGGCTCG CTGTTATGCT GTTAGCGGCT GGCCTGGAAA AAAAGAGGTG GAGAATACGC TGTGGACGCT GAGCGAGCAA GTGACGCCCG CACACGGCGT GGAGCGTTTT AATCAGGCGA TGATGGATCT GGGCGCGATG GTTTGTACGC GTTCAAAGCC AAAGTGCACC CTGTGTCCGC TGCAAAACGG TTGTATCGCC GCTGCGCATG AAAGCTGGTC ACGCTATCCG GGCAAGAAAC CGAAACAGAC GTTGCCGGAG CGGACGGGTT ACTTTTTATT GTTACAGCAT AATCAGGAGA TTTTCCTGGC GCAGCGTCCT CCCAGCGGTT TATGGGGCGG ACTCTACTGC TTTCCGCAGT TCGCCAGAGA AGATGAATTA CGTGAATGGC TGGCGCAACG GCATGTTAAC GCTGATAATT TGACCCAGCT TAATGCGTTT CGCCACACAT TTAGCCATTT CCATCTGGAT ATTGTGCCTA TGTGGCTTCC CGTGTCGTCA CTGGACGCCT GCATGGATGA AGGCAGCGCG CTCTGGTATA ACTTAGCGCA ACCGCCGTCA GTCGGACTGG CGGCCCCCGT GGAGCGCTTG TTACAGCAGT TACGTACCGG AGCGCCAGTT TAA
|
Protein sequence | MQASQFSAQV LDWYDKYGRK TLPWQINKTP YKVWLSEVML QQTQVTTVIP YFERFMARFP TVTDLANAPL DEVLHLWTGL GYYARARNLH KAAQQVATLH GGEFPQTFAE IAALPGVGRS TAGAILSLAL GKHYPILDGN VKRVLARCYA VSGWPGKKEV ENTLWTLSEQ VTPAHGVERF NQAMMDLGAM VCTRSKPKCT LCPLQNGCIA AAHESWSRYP GKKPKQTLPE RTGYFLLLQH NQEIFLAQRP PSGLWGGLYC FPQFAREDEL REWLAQRHVN ADNLTQLNAF RHTFSHFHLD IVPMWLPVSS LDACMDEGSA LWYNLAQPPS VGLAAPVERL LQQLRTGAPV
|
| |