Gene ECH74115_4265 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_4265 
SymbolmutY 
ID6969259 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp3949004 
End bp3950056 
Gene Length1053 bp 
Protein Length350 aa 
Translation table11 
GC content55% 
IMG OID643388003 
Productadenine DNA glycosylase 
Protein accessionYP_002272442 
Protein GI209395967 
COG category[L] Replication, recombination and repair 
COG ID[COG1194] A/G-specific DNA glycosylase 
TIGRFAM ID[TIGR01084] A/G-specific adenine glycosylase 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.528078 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones54 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAAGCGT CGCAATTTTC AGCCCAGGTT CTGGACTGGT ACGATAAATA CGGGCGAAAA 
ACTCTGCCCT GGCAAATTGA CAAGACGCCC TACAAAGTAT GGCTCTCAGA AGTGATGTTG
CAACAAACTC AGGTTGCGAC CGTTATCCCC TATTTTGAAC GCTTTATGGC GCGCTTCCCG
ACGGTGACCG ATCTCGCCAA TGCGCCGCTC GACGAAGTTC TCCACTTGTG GACGGGGCTT
GGCTATTACG CCCGCGCGCG CAATCTGCAT AAAGCGGCAC AACAAGTGGC GACCTTACAC
GGCGGTAAAT TCCCGGAAAC CTTTGAGGAA GTTGCAGCAC TGCCGGGCGT CGGGCGTTCC
ACCGCAGGCG CGATTCTATC GCTTTCTCTG GGTAAGCACT TTCCGATTCT CGACGGTAAC
GTCAAACGGG TGCTGGCGCG CTGCTATGCT GTAAGCGGCT GGCCTGGGAA AAAAGAGGTC
GAGAATAAAT TATGGAGTTT AAGCGAGCAG GTGACGCCCG CGGTCGGCGT GGAACGGTTT
AATCAGGCGA TGATGGATTT GGGCGCGATG ATTTGCACGC GTTCGAAGCC GAAATGTTCG
CTCTGTCCGC TACAAAACGG ATGTATTGCC GCCACCAACA ATAGCTGGTC GCTTTATCCG
GGCAAAAAAC CGAAACAGAC GCTGCCGGAG CGCACCGGCT ACTTTCTCCT GTTACAGCAC
GAAGATGAAG TATTGCTGGC GCAGCGTCCG CCGAGCGGAT TGTGGGGCGG TTTATACTGT
TTCCCGCAGT TTGCCGACGA AGAAAGTTTG CGGCAGTGGC TGGCGCAACG GCAGATTGCT
GCCGATAACC TGACGCAGCT GACCGCGTTT CGGCATACCT TCAGCCATTT CCACTTAGAT
ATTGTGCCTA TGTGGCTTCC CGTGTCGTCA TTCACCGGCT GCATGGATGA AGGCAATGCG
CTCTGGTATA ACTTAGCGCA ACCGCCGTCA GTTGGCCTGG CGGCTCCCGT GGAGCGTTTG
TTACAGCAGT TACGCACTGG CGCGCCGGTT TAG
 
Protein sequence
MQASQFSAQV LDWYDKYGRK TLPWQIDKTP YKVWLSEVML QQTQVATVIP YFERFMARFP 
TVTDLANAPL DEVLHLWTGL GYYARARNLH KAAQQVATLH GGKFPETFEE VAALPGVGRS
TAGAILSLSL GKHFPILDGN VKRVLARCYA VSGWPGKKEV ENKLWSLSEQ VTPAVGVERF
NQAMMDLGAM ICTRSKPKCS LCPLQNGCIA ATNNSWSLYP GKKPKQTLPE RTGYFLLLQH
EDEVLLAQRP PSGLWGGLYC FPQFADEESL RQWLAQRQIA ADNLTQLTAF RHTFSHFHLD
IVPMWLPVSS FTGCMDEGNA LWYNLAQPPS VGLAAPVERL LQQLRTGAPV