Gene EcSMS35_3104 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_3104 
SymbolmutY 
ID6146887 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp3189820 
End bp3190872 
Gene Length1053 bp 
Protein Length350 aa 
Translation table11 
GC content55% 
IMG OID641617972 
Productadenine DNA glycosylase 
Protein accessionYP_001745123 
Protein GI170682894 
COG category[L] Replication, recombination and repair 
COG ID[COG1194] A/G-specific DNA glycosylase 
TIGRFAM ID[TIGR01084] A/G-specific adenine glycosylase 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value0.00400976 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCAAGCGT CGCAATTTTC AGCCCAGGTT CTGGACTGGT ACGATAAATA CGGGCGGAAA 
ACGCTGCCCT GGCAAATTGA CAAGACGCCC TACAAAGTAT GGCTCTCAGA AGTGATGTTG
CAACAAACTC AGGTTGCGAC TGTTATCCCC TATTTTGAAC GCTTTATGGC GCGCTTCCCG
ACGGTGACCG ATCTCGCCAA TGCACCGCTG GATGAAGTTC TCCACTTGTG GACCGGGCTT
GGCTATTACG CCCGCGCGCG CAATCTGCAT AAAGCGGCAC AACAAGTGGC CACCTTACAC
AGCGGTAAAT TCCCGGAAAC CTTTGAAGAA GTCGCGGCGT TACCAGGCGT CGGGCGTTCT
ACCGCAGGCG CGATTCTCTC GCTTTCTCTG GGTAAGCACT TTCCGATTCT CGACGGTAAC
GTCAAACGGG TGCTGGCGCG CTGCTATGCT GTAAGCGGCT GGCCTGGGAA AAAAGAGGTC
GAGAATAAAC TGTGGAGTTT AAGCGAGCAG GTGACGCCCG CGGTTGGCGT GGAACGGTTT
AATCAGGCGA TGATGGATTT GGGCGCGATG ATTTGCACGC GCTCGAAGCC GAAATGTTCG
TTCTGTCCGC TACAAAACGG ATGTATTGCC ACCGCTAACA ATAGCTGGTC GCTTTATCCG
GGCAAAAAAC CGAAACAGAC GCTGCCGGAG CGTACTGGCT ACTTTCTGCT GTTACAGCAC
GAAGATGAAG TATTGCTGGC GCAGCGTCCG CCGAGCGGAT TGTGGGGCGG TTTATACTGT
TTCCCGCAGT TTGCCGACGA AGAAAGTTTG CGGCAATGGC TGGCGCAACG GCAGATTGTT
GCTGATAACC TGACGCAGCT GACCGCGTTT CGCCATACCT TCAGCCATTT CCACTTAGAT
ATTGTGCCTA TGTGGCTTCC CGTGTCGTCA TTCACCGGCT GCATGGATGA AGGCAATGCG
CTCTGGTATA ACTTAGCGCA ACCGCCGTCA GTTGGCCTGG CGGCTCCCGT GGAGCGTTTG
TTACAGCAGT TACGCACTGG CGCGCCGGTT TAG
 
Protein sequence
MQASQFSAQV LDWYDKYGRK TLPWQIDKTP YKVWLSEVML QQTQVATVIP YFERFMARFP 
TVTDLANAPL DEVLHLWTGL GYYARARNLH KAAQQVATLH SGKFPETFEE VAALPGVGRS
TAGAILSLSL GKHFPILDGN VKRVLARCYA VSGWPGKKEV ENKLWSLSEQ VTPAVGVERF
NQAMMDLGAM ICTRSKPKCS FCPLQNGCIA TANNSWSLYP GKKPKQTLPE RTGYFLLLQH
EDEVLLAQRP PSGLWGGLYC FPQFADEESL RQWLAQRQIV ADNLTQLTAF RHTFSHFHLD
IVPMWLPVSS FTGCMDEGNA LWYNLAQPPS VGLAAPVERL LQQLRTGAPV