Gene SeHA_C3350 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeHA_C3350 
SymbolmutY 
ID6488657 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Heidelberg str. SL476 
KingdomBacteria 
Replicon accessionNC_011083 
Strand
Start bp3261052 
End bp3262104 
Gene Length1053 bp 
Protein Length350 aa 
Translation table11 
GC content54% 
IMG OID642743483 
Productadenine DNA glycosylase 
Protein accessionYP_002047099 
Protein GI194447840 
COG category[L] Replication, recombination and repair 
COG ID[COG1194] A/G-specific DNA glycosylase 
TIGRFAM ID[TIGR01084] A/G-specific adenine glycosylase 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones69 
Fosmid unclonability p-value0.443349 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAAGCGT CTCAATTTTC AGCCCAGGTT CTGGACTGGT ACGACAAATA CGGGCGGAAA 
ACGCTGCCCT GGCAAATTAA CAAGACGCCT TACAAAGTAT GGCTCTCGGA AGTCATGTTG
CAACAAACGC AGGTGACGAC GGTGATTCCT TACTTTGAGC GATTTATGGC GCGCTTTCCC
ACAGTGACGG ATTTAGCGAA TGCGCCGCTG GATGAAGTAC TCCATTTATG GACCGGGCTC
GGCTATTACG CCCGCGCACG TAATTTGCAT AAAGCGGCGC AACAGGTGGC GACGCTTCAC
GGTGGAGAAT TCCCGCAAAC TTTTGCCGAA ATCGCCGCGC TACCCGGCGT CGGGCGTTCA
ACCGCCGGCG CAATTCTCTC CCTCGCGTTA GGTAAACATT ATCCGATTCT TGATGGAAAC
GTTAAACGTG TGCTGGCTCG CTGTTATGCT GTTAGCGGCT GGCCTGGAAA AAAAGAGGTG
GAGAATACGC TGTGGACGCT GAGCGAGCAA GTGACGCCCG CACACGGCGT GGAGCGTTTT
AATCAGGCGA TGATGGATCT GGGCGCGATG GTTTGTACGC GTTCAAAGCC AAAGTGCACC
CTGTGTCCGC TGCAAAACGG TTGTATCGCC GCTGCGCATG AAAGCTGGTC ACGCTATCCG
GGCAAGAAAC CGAAACAGAC GTTGCCGGAG CGGACGGGTT ACTTTTTATT GTTACAGCAT
AATCAGGAGA TTTTCCTGGC GCAGCGTCCT CCCAGCGGTT TATGGGGCGG ACTCTACTGC
TTTCCGCAGT TCGCCAGAGA AGATGAATTA CGTGAATGGC TGGCGCAACG GCATGTTAAC
GCTGATAATT TGACCCAGCT TAATGCGTTT CGCCACACAT TTAGCCATTT CCATCTGGAT
ATTGTGCCTA TGTGGCTTCC CGTGTCGTCA CTGGACGCCT GCATGGATGA AGGCAGCGCG
CTCTGGTATA ACTTAGCGCA ACCGCCGTCA GTCGGACTGG CGGCCCCCGT GGAGCGCTTG
TTACAGCAGT TACGTACCGG AGCGCCAGTT TAA
 
Protein sequence
MQASQFSAQV LDWYDKYGRK TLPWQINKTP YKVWLSEVML QQTQVTTVIP YFERFMARFP 
TVTDLANAPL DEVLHLWTGL GYYARARNLH KAAQQVATLH GGEFPQTFAE IAALPGVGRS
TAGAILSLAL GKHYPILDGN VKRVLARCYA VSGWPGKKEV ENTLWTLSEQ VTPAHGVERF
NQAMMDLGAM VCTRSKPKCT LCPLQNGCIA AAHESWSRYP GKKPKQTLPE RTGYFLLLQH
NQEIFLAQRP PSGLWGGLYC FPQFAREDEL REWLAQRHVN ADNLTQLNAF RHTFSHFHLD
IVPMWLPVSS LDACMDEGSA LWYNLAQPPS VGLAAPVERL LQQLRTGAPV