Gene Arth_1507 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_1507 
Symbol 
ID4445970 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp1670602 
End bp1672116 
Gene Length1515 bp 
Protein Length504 aa 
Translation table11 
GC content70% 
IMG OID639689318 
ProductDNA-O6-methylguanine--protein-cysteine S-methyltransferase / DNA-3-methyladenine glycosylase II / transcriptional regulator Ada 
Protein accessionYP_831001 
Protein GI116670068 
COG category[F] Nucleotide transport and metabolism
[L] Replication, recombination and repair 
COG ID[COG0122] 3-methyladenine DNA glycosylase/8-oxoguanine DNA glycosylase
[COG2169] Adenosine deaminase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.102956 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGACTTCT GGCAGCGGTA CCGGGCAATC GATGCCCGCG ACCCCCGTTT CGACGGGCAG 
TTCTACACAG CCGTCCGGAC CACAGGCATC TATTGCCGAC CCTCGTGCCC GGCCCGGACC
CCCAAGGCGG AGAACGTCAC GTTCTACGAA ACCTCCGCTG CGGCCCACGA CGCGGGGTAC
CGGGCCTGCA AGCGCTGCCT TCCCGAAGCC GTGCCCGGAA CGCCGGCGTG GAACATCCGC
TCAGACATCG CGGGGCGGGC CATGCGGCTC ATCAACGACG GCGTCATCAA CCGCGACGGC
GTAGAAGGGC TCGCGTCGCG GCTCGGCTAC TCTTCGCGGC AGCTCAACCG GATCCTCACC
CACGAACTCG GCGCCGGTCC GCTGTCGCTG GCACGGGCAG GCAGGGCCCA GACGGCCCGG
ACGCTGCTGG TGTCCACGCA GATGAAGCTC GCCGACGTGG CGTTCGCGGC CGGATTCAGC
AGCGTCCGCC AGTTCAACGA GACAATCGGC GAGGTCTTCG ACCTGACGCC GACGGCCCTC
CGGGGCACCG CGCGCCACCA CCGGACGCCT ACTGCAACGA CGGCTCTGAC GCTGAACCTG
CCCTACCGTG AACCGTTCGA TCCGGGCATC TTCCAATTCC TCGCCGTGCG CTCCATTCCC
GGGATCGAGA CCGGCACCGG CACCTCCTAC GCGCGGACCT TGCGGCTGCC GCACGCTGAT
GCCCGCTTCA GCGTTGAGTA CGACGCCGAC GCCCCGGGGC GGCCGCTGGT TCTCACCATC
GGGGCCGTGG ACCTACGGGA CCTGCCGTCG CTGCTGAGCC GTGTCCGGCG GCTCCTGGAC
CTCGACGCAG ACCCCGTGGC CATCGACAAC GCGCTGGAAG CCGATCCGCG GCTGGCACCG
GCGGTCAAAG CCTTTCCGGG CATGCGGATG CCCGGGGCCG TGGATCCGCA GGAGTTGCTG
ATCAGGGCGA TGATCGGCCA GCAGATCACG GTCGCGGCCG CCCGGACCGC CCTCACCCAG
CTCTCCGCCT GCGGAAGCGA GAGCCTGGTG CCGGCGGACG GCCTGCACCG TCTCTTCCCC
ACTGCGGCCC AGATCGCCGA CCCGGGATTC GGCCTGCTGC GCGGTCCGCA GCGGCGGATC
GACTCGGTAA GGGCCGCTGC CGGCGCCATG GCCGCCGGAA ACCTCGACTT CGGTTACGGA
GACGACCTGG CGGGCCTGCA GTCCAAGCTC CTGCCGCTGC CCGGGGTGGG ACCCTGGACG
GTGGGGTACG TTGCCATGCG CGTGATCGGT GCACCGGATG TGTTCCTGGC CAATGACGCC
GCCGTGCGCA ACGGCATCCT CGCCCTCGAC ACCGGCCCGC AGGCGGGTGA ACGGCCGCCC
GGCGTGCAGC CCGCGGACTT CACGGACGTG AGCCCCTGGC GTTCCTACGC CACTATGCAC
CTCTGGCGTG CTGCCGCCAT GCGCCCTCAA GCCAGGCCGA GGCGGCAGGC CGAGTCTGCC
GCCTCGATGA GTTAA
 
Protein sequence
MDFWQRYRAI DARDPRFDGQ FYTAVRTTGI YCRPSCPART PKAENVTFYE TSAAAHDAGY 
RACKRCLPEA VPGTPAWNIR SDIAGRAMRL INDGVINRDG VEGLASRLGY SSRQLNRILT
HELGAGPLSL ARAGRAQTAR TLLVSTQMKL ADVAFAAGFS SVRQFNETIG EVFDLTPTAL
RGTARHHRTP TATTALTLNL PYREPFDPGI FQFLAVRSIP GIETGTGTSY ARTLRLPHAD
ARFSVEYDAD APGRPLVLTI GAVDLRDLPS LLSRVRRLLD LDADPVAIDN ALEADPRLAP
AVKAFPGMRM PGAVDPQELL IRAMIGQQIT VAAARTALTQ LSACGSESLV PADGLHRLFP
TAAQIADPGF GLLRGPQRRI DSVRAAAGAM AAGNLDFGYG DDLAGLQSKL LPLPGVGPWT
VGYVAMRVIG APDVFLANDA AVRNGILALD TGPQAGERPP GVQPADFTDV SPWRSYATMH
LWRAAAMRPQ ARPRRQAESA ASMS