Gene Amir_0343 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmir_0343 
Symbol 
ID8324501 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameActinosynnema mirum DSM 43827 
KingdomBacteria 
Replicon accessionNC_013093 
Strand
Start bp372629 
End bp373618 
Gene Length990 bp 
Protein Length329 aa 
Translation table11 
GC content76% 
IMG OID644940888 
ProductHhH-GPD family protein 
Protein accessionYP_003098158 
Protein GI256374498 
COG category[L] Replication, recombination and repair 
COG ID[COG1194] A/G-specific DNA glycosylase 
TIGRFAM ID[TIGR01084] A/G-specific adenine glycosylase 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCCGCA CGACGAGTCC CACCACGAGC CGCGCGACGA ACCGGACCGC GAGCGCCACC 
ACCGCCACCG CCGCAGCGGG CGCTGCCGAG GACGCCGCCA AGCCGGAGAG CGCCCACCTT
CCCCCGTCGG TGCTGAACAC CTGGTTCGCC GCCACCGCGC GCGACCTGCC CTGGCGCGAC
CCCGAGTGCA CCGCCTGGGG CGTCCTGGTC AGCGAGATCA TGCTCCAGCA GACCCCCGTC
GCCCGCGTCG AGCCGATCTG GCGGGTCTGG CTGGACAAGT GGCCGAGGCC CAGCGACATG
GCCGCCGCCT CCCAGGGCGA GGTGCTGCGC ATGTGGGGCA AGCTCGGCTA CCCGCGCCGC
GCCCTGCGCC TGCACGCCGC CGCCCAGGCC GTCGCCGCCG AGCACGACGA CGTCGTCCCG
GACGACGTGG AGACCCTGCT GGCCCTGCCC GGCATCGGCG CGTACACCGC GCGGGCCGTC
GCCGCCTTCG CCTACGGCCG CCGCTGCCCG GTGGTGGACA CCAACGTCCG CCGCGTCGTG
GCGCGGGCCG TGCACGGGGC CGGGGACGCG GGCCCGCCGT CGACCACCAG GGACCTGCGG
GACGTGGAGG CGCTGCTGCC CGAGGACGAG GCGTCGGCCG CGACCTACTC GGCGGCGCTG
ATGGAGCTGG GCGCGCTGGT GTGCACGGCC AGGACCCCGC GCTGCTCGGC GTGCCCGGTG
CTGGGCTCGT GCCAGTGGCA GCGCAACGGG CGGCCCGCGT ACGACGGGCC CGCGAAGGCG
GTGCAGAAGT TCGCGGGCAC CGACCGGCAG GTGCGCGGGC GGCTGCTGGA CGTGCTGCGC
GGCACGTCCG AGCCGGTCGC CAAGGAGGTG CTGGACCGGG CCTGGTCGGA CGCCGGTCAG
CGGGACCGGT GCCTGCACTC GCTGCTGGTG GACGGGCTGG TCGAGCAGAC CGCCGCCGGG
CTGTTCGCGC TGCCCGGCGA GCACGAGTGA
 
Protein sequence
MSRTTSPTTS RATNRTASAT TATAAAGAAE DAAKPESAHL PPSVLNTWFA ATARDLPWRD 
PECTAWGVLV SEIMLQQTPV ARVEPIWRVW LDKWPRPSDM AAASQGEVLR MWGKLGYPRR
ALRLHAAAQA VAAEHDDVVP DDVETLLALP GIGAYTARAV AAFAYGRRCP VVDTNVRRVV
ARAVHGAGDA GPPSTTRDLR DVEALLPEDE ASAATYSAAL MELGALVCTA RTPRCSACPV
LGSCQWQRNG RPAYDGPAKA VQKFAGTDRQ VRGRLLDVLR GTSEPVAKEV LDRAWSDAGQ
RDRCLHSLLV DGLVEQTAAG LFALPGEHE