Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Arth_1507 |
Symbol | |
ID | 4445970 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Arthrobacter sp. FB24 |
Kingdom | Bacteria |
Replicon accession | NC_008541 |
Strand | - |
Start bp | 1670602 |
End bp | 1672116 |
Gene Length | 1515 bp |
Protein Length | 504 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 639689318 |
Product | DNA-O6-methylguanine--protein-cysteine S-methyltransferase / DNA-3-methyladenine glycosylase II / transcriptional regulator Ada |
Protein accession | YP_831001 |
Protein GI | 116670068 |
COG category | [F] Nucleotide transport and metabolism [L] Replication, recombination and repair |
COG ID | [COG0122] 3-methyladenine DNA glycosylase/8-oxoguanine DNA glycosylase [COG2169] Adenosine deaminase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.102956 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGACTTCT GGCAGCGGTA CCGGGCAATC GATGCCCGCG ACCCCCGTTT CGACGGGCAG TTCTACACAG CCGTCCGGAC CACAGGCATC TATTGCCGAC CCTCGTGCCC GGCCCGGACC CCCAAGGCGG AGAACGTCAC GTTCTACGAA ACCTCCGCTG CGGCCCACGA CGCGGGGTAC CGGGCCTGCA AGCGCTGCCT TCCCGAAGCC GTGCCCGGAA CGCCGGCGTG GAACATCCGC TCAGACATCG CGGGGCGGGC CATGCGGCTC ATCAACGACG GCGTCATCAA CCGCGACGGC GTAGAAGGGC TCGCGTCGCG GCTCGGCTAC TCTTCGCGGC AGCTCAACCG GATCCTCACC CACGAACTCG GCGCCGGTCC GCTGTCGCTG GCACGGGCAG GCAGGGCCCA GACGGCCCGG ACGCTGCTGG TGTCCACGCA GATGAAGCTC GCCGACGTGG CGTTCGCGGC CGGATTCAGC AGCGTCCGCC AGTTCAACGA GACAATCGGC GAGGTCTTCG ACCTGACGCC GACGGCCCTC CGGGGCACCG CGCGCCACCA CCGGACGCCT ACTGCAACGA CGGCTCTGAC GCTGAACCTG CCCTACCGTG AACCGTTCGA TCCGGGCATC TTCCAATTCC TCGCCGTGCG CTCCATTCCC GGGATCGAGA CCGGCACCGG CACCTCCTAC GCGCGGACCT TGCGGCTGCC GCACGCTGAT GCCCGCTTCA GCGTTGAGTA CGACGCCGAC GCCCCGGGGC GGCCGCTGGT TCTCACCATC GGGGCCGTGG ACCTACGGGA CCTGCCGTCG CTGCTGAGCC GTGTCCGGCG GCTCCTGGAC CTCGACGCAG ACCCCGTGGC CATCGACAAC GCGCTGGAAG CCGATCCGCG GCTGGCACCG GCGGTCAAAG CCTTTCCGGG CATGCGGATG CCCGGGGCCG TGGATCCGCA GGAGTTGCTG ATCAGGGCGA TGATCGGCCA GCAGATCACG GTCGCGGCCG CCCGGACCGC CCTCACCCAG CTCTCCGCCT GCGGAAGCGA GAGCCTGGTG CCGGCGGACG GCCTGCACCG TCTCTTCCCC ACTGCGGCCC AGATCGCCGA CCCGGGATTC GGCCTGCTGC GCGGTCCGCA GCGGCGGATC GACTCGGTAA GGGCCGCTGC CGGCGCCATG GCCGCCGGAA ACCTCGACTT CGGTTACGGA GACGACCTGG CGGGCCTGCA GTCCAAGCTC CTGCCGCTGC CCGGGGTGGG ACCCTGGACG GTGGGGTACG TTGCCATGCG CGTGATCGGT GCACCGGATG TGTTCCTGGC CAATGACGCC GCCGTGCGCA ACGGCATCCT CGCCCTCGAC ACCGGCCCGC AGGCGGGTGA ACGGCCGCCC GGCGTGCAGC CCGCGGACTT CACGGACGTG AGCCCCTGGC GTTCCTACGC CACTATGCAC CTCTGGCGTG CTGCCGCCAT GCGCCCTCAA GCCAGGCCGA GGCGGCAGGC CGAGTCTGCC GCCTCGATGA GTTAA
|
Protein sequence | MDFWQRYRAI DARDPRFDGQ FYTAVRTTGI YCRPSCPART PKAENVTFYE TSAAAHDAGY RACKRCLPEA VPGTPAWNIR SDIAGRAMRL INDGVINRDG VEGLASRLGY SSRQLNRILT HELGAGPLSL ARAGRAQTAR TLLVSTQMKL ADVAFAAGFS SVRQFNETIG EVFDLTPTAL RGTARHHRTP TATTALTLNL PYREPFDPGI FQFLAVRSIP GIETGTGTSY ARTLRLPHAD ARFSVEYDAD APGRPLVLTI GAVDLRDLPS LLSRVRRLLD LDADPVAIDN ALEADPRLAP AVKAFPGMRM PGAVDPQELL IRAMIGQQIT VAAARTALTQ LSACGSESLV PADGLHRLFP TAAQIADPGF GLLRGPQRRI DSVRAAAGAM AAGNLDFGYG DDLAGLQSKL LPLPGVGPWT VGYVAMRVIG APDVFLANDA AVRNGILALD TGPQAGERPP GVQPADFTDV SPWRSYATMH LWRAAAMRPQ ARPRRQAESA ASMS
|
| |