Gene SbBS512_E3394 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbBS512_E3394 
SymbolmutY 
ID6271779 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShigella boydii CDC 3083-94 
KingdomBacteria 
Replicon accessionNC_010658 
Strand
Start bp3156206 
End bp3157288 
Gene Length1083 bp 
Protein Length360 aa 
Translation table11 
GC content55% 
IMG OID641727285 
Productadenine DNA glycosylase 
Protein accessionYP_001881735 
Protein GI187731126 
COG category[L] Replication, recombination and repair 
COG ID[COG1194] A/G-specific DNA glycosylase 
TIGRFAM ID[TIGR01084] A/G-specific adenine glycosylase 


Plasmid Coverage information

Num covering plasmid clones48 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGCCCCCAA CAACAGTGAA TTCGGTGACC ATGCAAGCGT CGCAATTTTC AGCCCAGGTT 
CTGGACTGGT ACGATAAATA CGGGCGAAAA ACGCTGCCCT GGCAAATTGA CAAGACGCCC
TACAAAGTAT GGCTCTCAGA AGTGATGTTG CAACAAACTC AGGTTGCGAC CGTTATCCCC
TATTTTGAAC GCTTTATGGC GCGCTTCCCG ACGGTGACCG ATCTCGCCAA TGCGCCGCTC
GACGAAGTTC TCCACTTGTG GACCGGGCTT GGCTATTACG CCCGCGCGCG CAATATGCAT
AAAGCGGCAC AACAAGTGGC GACCTTACAC GGCGGTAAAT TCCCGGAAAC CTTTGAAGAA
GTCGCGGCGT TACCGGGCGT CGGGCGTTCC ACCGCAGGCG CGATTCTCTC GCTTTCTCTG
GGTAAGCACT TTCCGATTCT CGACGGTAAC GTCAAACGCG TGCTGGCGCG CTGCTATGCT
GTAAGCGGCT GGCCTGGGAA AAAAGAGGTC GAGAATAAAT TATGGAGTTT GAGCGAGCAG
GTGACGCCCG CGGTCGGCGT GGAACGGTTT AATCAGGCGA TGATGGATTT GGGCGCGATG
ATTTGTACGC GCTCGAAGCC GAAATGTTCG CTCTGTCCGC TACAAAACGG ATGTATTGCC
GCCGCCAATA ATAGCTGGTC GCTTTATCCG GGCAAAAAAC CGAAACAGAC GCTGCCGGAG
CGCACCGGCT ACTTTTTGCT GTTACAGCAC GAAGATGAAG TATTGCTGGC GCAGCGTCCG
CCGAGCGGAT TGTGGGGCGG TTTATACTGT TTCCCGCAGT TTGCCGACGA AGAAAGTTTG
CGGCAGTGGC TGGCGCAACG GCAGATTGCT GCCGATAACC TGACGCAACT GACCGCGTTT
CGGCATACCT TCAGCCATTT CCACTTAGAT ATTGTGCCTA TGTGGCTTCC CGTGTCGTCA
TTCACCGGCT GCATGGATGA AGGCAATGCG CTCTGGTATA ACTTAGCGCA ACCGCCGTCA
GTTGGCCTGG CGGCTCCCGT GGAGCGTTTG TTACAGCAGT TACGCACTGG CGCGCCGGTT
TAG
 
Protein sequence
MPPTTVNSVT MQASQFSAQV LDWYDKYGRK TLPWQIDKTP YKVWLSEVML QQTQVATVIP 
YFERFMARFP TVTDLANAPL DEVLHLWTGL GYYARARNMH KAAQQVATLH GGKFPETFEE
VAALPGVGRS TAGAILSLSL GKHFPILDGN VKRVLARCYA VSGWPGKKEV ENKLWSLSEQ
VTPAVGVERF NQAMMDLGAM ICTRSKPKCS LCPLQNGCIA AANNSWSLYP GKKPKQTLPE
RTGYFLLLQH EDEVLLAQRP PSGLWGGLYC FPQFADEESL RQWLAQRQIA ADNLTQLTAF
RHTFSHFHLD IVPMWLPVSS FTGCMDEGNA LWYNLAQPPS VGLAAPVERL LQQLRTGAPV