Gene EcHS_A3122 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A3122 
SymbolmutY 
ID5593178 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp3131901 
End bp3132983 
Gene Length1083 bp 
Protein Length360 aa 
Translation table11 
GC content55% 
IMG OID640922241 
Productadenine DNA glycosylase 
Protein accessionYP_001459741 
Protein GI157162423 
COG category[L] Replication, recombination and repair 
COG ID[COG1194] A/G-specific DNA glycosylase 
TIGRFAM ID[TIGR01084] A/G-specific adenine glycosylase 


Plasmid Coverage information

Num covering plasmid clones53 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGCCCCCAA CAACAGTGAA TTCGGTGACC ATGCAAGCGT CGCAATTTTC AGCCCAGGTT 
CTGGACTGGT ACGATAAATA CGGGCGAAAA ACGCTGCCCT GGCAAATTGA CAAGACGCCC
TACAAAGTAT GGCTCTCAGA AGTGATGTTG CAACAAACTC AGGTTGCGAC CGTTATCCCC
TATTTTGAAC GCTTTATGGC GCGCTTCCCG ACGGTGACCG ATCTCGCCAA TGCGCCGCTC
GACGAAGTTC TCCACTTGTG GACCGGGCTT GGCTATTACG CCCGCGCGCG CAATCTGCAT
AAAGCGGCAC AACAAGTGGC GACCTTACAC GGCGGTAAAT TCCCGGAAAC CTTTGAAGAA
GTCGCGGCGT TACCGGGCGT CGGGCGTTCC ACCGCAGGCG CGATTCTCTC GCTTTCTCTG
GGTAAGCACT TTCCGATTCT CGACGGTAAC GTCAAACGCG TGCTGGCGCG CTGCTATGCT
GTAAGCGGCT GGCCTGGGAA AAAAGAGGTC GAGAATAAAT TATGGAGTTT GAGCGAGCAG
GTGACGCCCG CGGTTGGCGT GGAACGGTTT AATCAGGCGA TGATGGATTT GGGTGCGATG
ATTTGTACGC GCTCGAAACC GAAATGTTCG CTCTGTCCGC TACAAAACGG ATGTATTGCC
GCCGCCAACA ATAGCTGGGC GCTTTATCCG GGCAAAAAAC CGAAACAGAC GCTGCCGGAG
CGCACCGGCT ACTTTTTGCT ATTACAGCAC GAAGATGAAG TATTGCTGGC GCAGCGTCCG
CCGAGCGGAT TGTGGGGCGG TTTATACTGT TTCCCGCAGT TTGCCGACGA AGAAAGTTTG
CGGCAGTGGC TGGCGCAACG GCAGATTGCT GCCGATAACC TGACGCAACT GACCGCGTTT
CGGCATACCT TCAGCCATTT CCACTTAGAT ATTGTGCCTA TGTGGCTTCC CGTGTCGTCA
TTCACCGGCT GCATGGATGA AGGCAATGCG CTCTGGTATA ACTTAGCGCA ACCGCCGTCA
GTTGGCCTGG CGGCTCCCGT GGAGCGTTTG TTACAGCAGT TACGCACTGG CGCGCCGGTT
TAG
 
Protein sequence
MPPTTVNSVT MQASQFSAQV LDWYDKYGRK TLPWQIDKTP YKVWLSEVML QQTQVATVIP 
YFERFMARFP TVTDLANAPL DEVLHLWTGL GYYARARNLH KAAQQVATLH GGKFPETFEE
VAALPGVGRS TAGAILSLSL GKHFPILDGN VKRVLARCYA VSGWPGKKEV ENKLWSLSEQ
VTPAVGVERF NQAMMDLGAM ICTRSKPKCS LCPLQNGCIA AANNSWALYP GKKPKQTLPE
RTGYFLLLQH EDEVLLAQRP PSGLWGGLYC FPQFADEESL RQWLAQRQIA ADNLTQLTAF
RHTFSHFHLD IVPMWLPVSS FTGCMDEGNA LWYNLAQPPS VGLAAPVERL LQQLRTGAPV