Gene Saro_0960 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_0960 
Symbol 
ID3915742 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp1006872 
End bp1007951 
Gene Length1080 bp 
Protein Length359 aa 
Translation table11 
GC content69% 
IMG OID640443694 
ProductA/G-specific DNA-adenine glycosylase 
Protein accessionYP_496239 
Protein GI87198982 
COG category[L] Replication, recombination and repair 
COG ID[COG1194] A/G-specific DNA glycosylase 
TIGRFAM ID[TIGR01084] A/G-specific adenine glycosylase 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value0.446889 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAGGCCA GAGGACAAGC CACAGCCAAG TTCGATCCGC AGGCGATCGC GCCCGCCTTG 
CTCGACTGGT ACGATGCCCA TGCACGCAAG CTGCCGTGGC GGCGGTTGCC GGGAGAAGCG
CGGCAGGACC CCTACCGGGT GTGGCTGTCC GAGGTCATGC TGCAGCAGAC GACCGTGGCG
GCGGTGGGCC CCTATTTCGA GAAGTTCACG CGTTTGTGGC CGACGGTTGG CGACCTGGCG
GCGGCGGACG ACGGCGATGT CATGGCTGCC TGGGCCGGGC TGGGTTATTA TGCCCGGGCC
CGCAACCTGC TGGCATGTGC GCGGGCGGTG GCGGCCATGG GCGGGACTTT CCCCGATAGC
GAGGACGGTC TTCGCGCGCT GCCCGGACTG GGCGAATATA CGGCGGCGGC GGTGGCTGCG
ATCGCGTTCG GCCGTCGGGC GGTGGTGGTC GATGCCAATG TCGAGCGCGT CATTGCCCGG
CTTTTCGCCA TCGATGAGCC CTTGCCGGCG GGGAAAGCGG CGATCCGGCT GGCGGCGGGG
CAAGTGACTC CGGAGGAGCG GGCGGGCGAT TTCGCCCAGG CGATGATGGA CCTTGGCGCT
ACGGTGTGCA CCGCGCGGTC GCCCCGGTGC ATGTTGTGTC CACTGCGCGA ACATTGCCGC
GCGCTTGCCG AAGGTGCGCC CGAGCGCCTG CCGGTGAAGG CCGCGCGCAA GGCAAAGCCG
GTGCGGCAGG GGCGCGCCTA CTGGATCGAG CGCGAGGGCA GGGTGCTGCT GGTGCGGCGG
CCGGGGCGCG GGATGCTGGG CGGAATGCGC GCGCTGCCCG ACGACGGCTG GTCGGCGCGA
GGCGACGGCG CCGACGCCAT CGGCGGCGAA TGGCGCGGGG GCGGCGTGGT TCGCCACGGC
TTCACGCATT TCGATCTCGA ATTGCAATTG ATGCTTTGCG TTCAGGCGGA AGCGGCTAGT
CTGCCCGGCC TGAACGATAT CGAGGGAGAA TGGTGGCCAG TCGACGAGAT CGAGGCCGCC
GGATTGCCGA CCGTTTTCGC CAAGGCGGCG CGGCTGGCGA TTGCCGAAAG GATTGGCTGA
 
Protein sequence
MQARGQATAK FDPQAIAPAL LDWYDAHARK LPWRRLPGEA RQDPYRVWLS EVMLQQTTVA 
AVGPYFEKFT RLWPTVGDLA AADDGDVMAA WAGLGYYARA RNLLACARAV AAMGGTFPDS
EDGLRALPGL GEYTAAAVAA IAFGRRAVVV DANVERVIAR LFAIDEPLPA GKAAIRLAAG
QVTPEERAGD FAQAMMDLGA TVCTARSPRC MLCPLREHCR ALAEGAPERL PVKAARKAKP
VRQGRAYWIE REGRVLLVRR PGRGMLGGMR ALPDDGWSAR GDGADAIGGE WRGGGVVRHG
FTHFDLELQL MLCVQAEAAS LPGLNDIEGE WWPVDEIEAA GLPTVFAKAA RLAIAERIG