Gene B21_02754 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagB21_02754 
SymbolmutY 
ID8116401 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli BL21 
KingdomBacteria 
Replicon accessionNC_012892 
Strand
Start bp2932986 
End bp2934038 
Gene Length1053 bp 
Protein Length350 aa 
Translation table11 
GC content55% 
IMG OID644848945 
Producthypothetical protein 
Protein accessionYP_003000518 
Protein GI251786214 
COG category[L] Replication, recombination and repair 
COG ID[COG1194] A/G-specific DNA glycosylase 
TIGRFAM ID[TIGR01084] A/G-specific adenine glycosylase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0736099 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAAGCGT CGCAATTTTC AGCCCAGGTT CTGGACTGGT ACGATAAATA CGGGCGAAAA 
ACTCTGCCCT GGCAAATTGA CAAGACGCCC TACAAAGTAT GGCTCTCAGA AGTGATGTTG
CAACAAACTC AGGTTGCGAC CGTTATCCCC TATTTTGAAC GCTTTATGGC GCGCTTCCCG
ACGGTGACCG ATCTCGCCAA TGCGCCGCTC GACGAAGTTC TCCACTTGTG GACCGGGCTT
GGCTATTACG CCCGCGCGCG CAATCTGCAT AAAGCGGCAC AACAAGTGGC GACCTTACAC
GGCGGTAAAT TCCCGGAAAC CTTTGAGGAA GTTGCAGCAC TGCCGGGCGT CGGGCGTTCC
ACCGCAGGCG CGATTCTCTC GCTTTCTCTG GGTAAGCACT TTCCGATTCT CGACGGTAAC
GTCAAACGGG TGCTGGCGCG CTGCTATGCT GTAAGCGGCT GGCCAGGGAA AAAAGAGGTC
GAGAATAAAT TATGGAGTTT GAGCGAGCAG GTGACGCCCG CGGTTGGCGT GGAACGGTTT
AATCAGGCGA TGATGGATTT GGGTGCGATG ATTTGTACGC GCTCGAAACC GAAATGTTCG
CTCTGTCCGC TACAAAACGG ATGTATTGCC GCCGCCAACA ATAGCTGGGC GCTTTATCCG
GGCAAAAAAC CGAAACAGAC GCTGCCGGAG CGCACCGGCT ACTTTTTGCT ATTACAGCAC
GAAGATGAAG TATTGCTGGC GCAGCGTCCG CCGAGCGGAT TGTGGGGCGG TTTATACTGT
TTCCCGCAGT TTGCCGACGA AGAAAGTTTG CGGCAGTGGC TGGCGCAACG GCAGATTGCT
GCCGATAACC TGACGCAACT GACCGCGTTT CGGCATACCT TCAGCCATTT CCACTTAGAT
ATTGTGCCTA TGTGGCTTCC CGTGTCGTCA TTCACCGGCT GCATGGATGA AGGCAATGCG
CTCTGGTATA ACTTAGCGCA ACCGCCGTCA GTTGGCCTGG CGGCTCCCGT GGAGCGTTTG
TTACAGCAGT TACGCACTGG CGCGCCGGTT TAG
 
Protein sequence
MQASQFSAQV LDWYDKYGRK TLPWQIDKTP YKVWLSEVML QQTQVATVIP YFERFMARFP 
TVTDLANAPL DEVLHLWTGL GYYARARNLH KAAQQVATLH GGKFPETFEE VAALPGVGRS
TAGAILSLSL GKHFPILDGN VKRVLARCYA VSGWPGKKEV ENKLWSLSEQ VTPAVGVERF
NQAMMDLGAM ICTRSKPKCS LCPLQNGCIA AANNSWALYP GKKPKQTLPE RTGYFLLLQH
EDEVLLAQRP PSGLWGGLYC FPQFADEESL RQWLAQRQIA ADNLTQLTAF RHTFSHFHLD
IVPMWLPVSS FTGCMDEGNA LWYNLAQPPS VGLAAPVERL LQQLRTGAPV