Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Saro_0960 |
Symbol | |
ID | 3915742 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Novosphingobium aromaticivorans DSM 12444 |
Kingdom | Bacteria |
Replicon accession | NC_007794 |
Strand | + |
Start bp | 1006872 |
End bp | 1007951 |
Gene Length | 1080 bp |
Protein Length | 359 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 640443694 |
Product | A/G-specific DNA-adenine glycosylase |
Protein accession | YP_496239 |
Protein GI | 87198982 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1194] A/G-specific DNA glycosylase |
TIGRFAM ID | [TIGR01084] A/G-specific adenine glycosylase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 0.446889 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAGGCCA GAGGACAAGC CACAGCCAAG TTCGATCCGC AGGCGATCGC GCCCGCCTTG CTCGACTGGT ACGATGCCCA TGCACGCAAG CTGCCGTGGC GGCGGTTGCC GGGAGAAGCG CGGCAGGACC CCTACCGGGT GTGGCTGTCC GAGGTCATGC TGCAGCAGAC GACCGTGGCG GCGGTGGGCC CCTATTTCGA GAAGTTCACG CGTTTGTGGC CGACGGTTGG CGACCTGGCG GCGGCGGACG ACGGCGATGT CATGGCTGCC TGGGCCGGGC TGGGTTATTA TGCCCGGGCC CGCAACCTGC TGGCATGTGC GCGGGCGGTG GCGGCCATGG GCGGGACTTT CCCCGATAGC GAGGACGGTC TTCGCGCGCT GCCCGGACTG GGCGAATATA CGGCGGCGGC GGTGGCTGCG ATCGCGTTCG GCCGTCGGGC GGTGGTGGTC GATGCCAATG TCGAGCGCGT CATTGCCCGG CTTTTCGCCA TCGATGAGCC CTTGCCGGCG GGGAAAGCGG CGATCCGGCT GGCGGCGGGG CAAGTGACTC CGGAGGAGCG GGCGGGCGAT TTCGCCCAGG CGATGATGGA CCTTGGCGCT ACGGTGTGCA CCGCGCGGTC GCCCCGGTGC ATGTTGTGTC CACTGCGCGA ACATTGCCGC GCGCTTGCCG AAGGTGCGCC CGAGCGCCTG CCGGTGAAGG CCGCGCGCAA GGCAAAGCCG GTGCGGCAGG GGCGCGCCTA CTGGATCGAG CGCGAGGGCA GGGTGCTGCT GGTGCGGCGG CCGGGGCGCG GGATGCTGGG CGGAATGCGC GCGCTGCCCG ACGACGGCTG GTCGGCGCGA GGCGACGGCG CCGACGCCAT CGGCGGCGAA TGGCGCGGGG GCGGCGTGGT TCGCCACGGC TTCACGCATT TCGATCTCGA ATTGCAATTG ATGCTTTGCG TTCAGGCGGA AGCGGCTAGT CTGCCCGGCC TGAACGATAT CGAGGGAGAA TGGTGGCCAG TCGACGAGAT CGAGGCCGCC GGATTGCCGA CCGTTTTCGC CAAGGCGGCG CGGCTGGCGA TTGCCGAAAG GATTGGCTGA
|
Protein sequence | MQARGQATAK FDPQAIAPAL LDWYDAHARK LPWRRLPGEA RQDPYRVWLS EVMLQQTTVA AVGPYFEKFT RLWPTVGDLA AADDGDVMAA WAGLGYYARA RNLLACARAV AAMGGTFPDS EDGLRALPGL GEYTAAAVAA IAFGRRAVVV DANVERVIAR LFAIDEPLPA GKAAIRLAAG QVTPEERAGD FAQAMMDLGA TVCTARSPRC MLCPLREHCR ALAEGAPERL PVKAARKAKP VRQGRAYWIE REGRVLLVRR PGRGMLGGMR ALPDDGWSAR GDGADAIGGE WRGGGVVRHG FTHFDLELQL MLCVQAEAAS LPGLNDIEGE WWPVDEIEAA GLPTVFAKAA RLAIAERIG
|
| |