Gene SeD_A3453 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeD_A3453 
SymbolmutY 
ID6873877 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Dublin str. CT_02021853 
KingdomBacteria 
Replicon accessionNC_011205 
Strand
Start bp3315746 
End bp3316798 
Gene Length1053 bp 
Protein Length350 aa 
Translation table11 
GC content54% 
IMG OID642786447 
Productadenine DNA glycosylase 
Protein accessionYP_002217085 
Protein GI198243866 
COG category[L] Replication, recombination and repair 
COG ID[COG1194] A/G-specific DNA glycosylase 
TIGRFAM ID[TIGR01084] A/G-specific adenine glycosylase 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones65 
Fosmid unclonability p-value0.849569 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAAGCGT CTCAATTTTC AGCCCAGGTT CTGGACTGGT ACGACAAATA CGGGCGGAAA 
ACGCTGCCCT GGCAAATTAA CAAGACGCCT TACAAAGTAT GGCTCTCGGA AGTCATGTTG
CAACAAACGC AGGTGACGAC GGTGATTCCT TACTTTGAGC GATTTATGGC GCGCTTTCCG
ACGGTAACGG ATTTAGCGAA TGCGCCGCTG GATGAAGTAC TCCATTTATG GACCGGGCTC
GGCTATTACG CCCGCGCACG TAATTTGCAT AAAGCGGCGC AACAGGTGGC GACGCTTCAC
GGTGGAGAAT TCCCGCAAAC TTTTGCCGAA ATCGCCGCGC TCCCCGGCGT CGGACGTTCA
ACCGCCGGCG CAATTCTCTC CCTCGCGTTA GGTAAACATT ATCCGATTCT TGATGGAAAC
GTTAAACGTG TGCTGGCTCG CTGTTATGCT GTTAGCGGCT GGCCTGGAAA AAAAGAGGTG
GAGAATACGC TGTGGACGTT GAGCGAGCAA GTGACGCCCG CACGCGGCGT GGAGCGTTTT
AATCAGGCGA TGATGGATCT GGGCGCAATG GTTTGTACGC GTTCAAAGCC AAAGTGCACC
CTGTGTCCGC TGCAAAACGG TTGTATCGCC GCTGCGCATG AAAGCTGGTC ACGCTATCCG
GGCAAGAAAC CGAAACAGAC GTTGCCGGAG CGGACGGGTT ACTTTTTATT GTTACAGCAT
AATCAGGAGA TTTTCCTGGC GCAGCGCCCT CCCAGCGGTT TATGGGGCGG ACTCTACTGC
TTCCCGCAGT TCGCCAGAGA AGATGAATTA CGTGAATGGC TGGCGCAACG GCATGTTAAC
GCTGATAATT TGACCCAGCT TAATGCGTTT CGCCACACAT TTAGCCATTT CCATCTGGAT
ATTGTGCCTA TGTGGCTTCC CGTGTCGTCA CTGGACGCCT GCATGGATGA AGGCAGCGCG
CTCTGGTATA ACTTAGCGCA ACCGCCGTCA GTCGGACTGG CGGCCCCCGT GGAGCGCTTG
TTACAGCAGT TACGTACCGG AGCGCCAGTT TAA
 
Protein sequence
MQASQFSAQV LDWYDKYGRK TLPWQINKTP YKVWLSEVML QQTQVTTVIP YFERFMARFP 
TVTDLANAPL DEVLHLWTGL GYYARARNLH KAAQQVATLH GGEFPQTFAE IAALPGVGRS
TAGAILSLAL GKHYPILDGN VKRVLARCYA VSGWPGKKEV ENTLWTLSEQ VTPARGVERF
NQAMMDLGAM VCTRSKPKCT LCPLQNGCIA AAHESWSRYP GKKPKQTLPE RTGYFLLLQH
NQEIFLAQRP PSGLWGGLYC FPQFAREDEL REWLAQRHVN ADNLTQLNAF RHTFSHFHLD
IVPMWLPVSS LDACMDEGSA LWYNLAQPPS VGLAAPVERL LQQLRTGAPV