Gene SNSL254_A3358 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSNSL254_A3358 
SymbolmutY 
ID6484179 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Newport str. SL254 
KingdomBacteria 
Replicon accessionNC_011080 
Strand
Start bp3258019 
End bp3259071 
Gene Length1053 bp 
Protein Length350 aa 
Translation table11 
GC content55% 
IMG OID642738649 
Productadenine DNA glycosylase 
Protein accessionYP_002042370 
Protein GI194444686 
COG category[L] Replication, recombination and repair 
COG ID[COG1194] A/G-specific DNA glycosylase 
TIGRFAM ID[TIGR01084] A/G-specific adenine glycosylase 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones57 
Fosmid unclonability p-value0.173815 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAAGCGT CTCAATTTTC AGCCCAGGTT CTGGACTGGT ACGACAAATA CGGGCGGAAA 
ACGCTGCCCT GGCAAATTAA CAAGACGCCT TACAAAGTAT GGCTCTCGGA AGTCATGTTG
CAACAAACGC AGGTGACGAC GGTGATTCCT TACTTTGAGC GATTTATGGC GCGCTTTCCG
ACAGTGACGG ATTTAGCGAA TGCGCCGCTG GATGAAGTGC TCCATTTATG GACCGGGCTC
GGCTATTACG CCCGCGCGCG TAATTTGCAT AAAGCGGCGC AACAGGTGGC GACGCTTCAC
GGTGGAGAAT TCCCGCAAAC TTTTGCCGAA ATCGCCGCGC TACCCGGCGT CGGGCGCTCA
ACCGCCGGCG CGATTCTCTC CCTCGCGTTA GGTAAACATT ATCCGATTCT TGATGGAAAC
GTTAAACGTG TGCTGGCTCG CTGTTATGCT GTTAGCGGCT GGCCTGGAAA AAAAGAGGTG
GAGAATACGC TGTGGACGTT GAGCGAGCAA GTGACGCCCG CACGCGGCGT GGAGCGTTTT
AATCAGGCGA TGATGGATCT GGGCGCGATG GTTTGTACGC GTTCAAAGCC AAAGTGCACC
CTGTGTCCGC TGCAAAACGG TTGTATCGCC GCTGCGCATG AAAGCTGGTC ACGCTATCCG
GGCAAGAAAC CGAAACAGAC GTTGCCGGAG CGGACGGGTT ACTTTTTATT GTTACAGCAT
AATCAGGAGA TTTTCCTGGC GCAGCGCCCT CCCAGCGGTT TATGGGGCGG ACTCTACTGC
TTCCCGCAGT TCGCCAGCGA AGATGAATTA CGTGAATGGC TGGCGCAACG GCATGTTAAC
GCTGATAATT TGACCCAGCT TAACGCGTTT CGCCACACAT TTAGCCATTT CCATCTGGAT
ATTGTGCCTA TGTGGCTTCC CGTGTCGTCA CTGGGCGTCT GCATGGATGA AGGCAGCGCG
CTCTGGTATA ACTTAGCGCA ACCGCCGTCA GTCGGACTGG CGGCCCCCGT GGAGCGCTTG
TTACAGCAGT TACGTACCGG AGCGCCAGTT TAA
 
Protein sequence
MQASQFSAQV LDWYDKYGRK TLPWQINKTP YKVWLSEVML QQTQVTTVIP YFERFMARFP 
TVTDLANAPL DEVLHLWTGL GYYARARNLH KAAQQVATLH GGEFPQTFAE IAALPGVGRS
TAGAILSLAL GKHYPILDGN VKRVLARCYA VSGWPGKKEV ENTLWTLSEQ VTPARGVERF
NQAMMDLGAM VCTRSKPKCT LCPLQNGCIA AAHESWSRYP GKKPKQTLPE RTGYFLLLQH
NQEIFLAQRP PSGLWGGLYC FPQFASEDEL REWLAQRHVN ADNLTQLNAF RHTFSHFHLD
IVPMWLPVSS LGVCMDEGSA LWYNLAQPPS VGLAAPVERL LQQLRTGAPV