Gene Daro_0064 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDaro_0064 
Symbol 
ID3568055 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDechloromonas aromatica RCB 
KingdomBacteria 
Replicon accessionNC_007298 
Strand
Start bp74598 
End bp75635 
Gene Length1038 bp 
Protein Length345 aa 
Translation table11 
GC content60% 
IMG OID637678493 
ProductA/G-specific DNA-adenine glycosylase 
Protein accessionYP_283293 
Protein GI71905706 
COG category[L] Replication, recombination and repair 
COG ID[COG1194] A/G-specific DNA glycosylase 
TIGRFAM ID[TIGR01084] A/G-specific adenine glycosylase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000000000000461825 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.645907 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGGCCACCC CGAACCCGTT CACCGAACAA TTGATCGCCT GGCAGAAGAT CGCCGGGCGC 
CATGACCTGC CCTGGCAGAA TACCTGCGAT CCTTACCGGG TCTGGCTTTC CGAAATTATG
TTGCAGCAAA CACAGGTCAG CACGGCGACC CCCTACTACC TGCGTTTCCT GAGCAGTTTT
CCCGATGTAA CAGCACTGGC TACCGCGCCG ATCGAAGTCG TGATCGAGCA CTGGGCCGGC
CTTGGCTATT ACGCAAGAGC ACGCAACCTC CATCGCTGTG CTCAGCAGAT TGTCACGGTC
TATGCCGGGA GCTTTCCGGA CTCTGTAGAA AAACTCGCAG AATTGCCTGG TATCGGCCGG
TCAACTGCCG CTGCAATTGC GGCATTCTCA TTCGGAAAAC GGGCCGCAAT CCTCGACGGT
AACGTCAAAC GGGTACTGTG TCGGCAATTC GGCATCGATG GCTTTCCCGG TTCGGTGACT
ATCGACCGCA AGCTGTGGAC GCTGGCCGAA AGCCTGCTGC CAGAACGGGA TATCGAGGTA
TACACACAGG GCTTGATGGA TCTCGGTGCC ACGTTATGTA CCCGGAGCAA GCCACGTTGC
GGCGACTGTC CTGTTGCTGC GGCCTGTATC GCCCGATGCG AAGGCAGGCA GGCTGAGCTG
CCAACGGCAA AACCACGCAC CAAGGTGCCT GAACGAACCG CGACCTACGT GTTGCTCAGC
GATGGGCACC GTCTGCTACT TGAGCGACGC CCCCCAAGCG GTCTGTGGGG TGGCCTGCTG
GTGCCGCCCG AGGGCGAGCC GGATCAAGTC GCCGCCCGCT TTGGCTTGCA ACTGGGCGAG
CAGTCGAAAC TGCCTGCACT GAAGCATACC TTTACGCATT TCAAGCTGAC GCTGGAACCG
GTGCTGTGCC GCATTGAGCC GCGCACCGAC CTGGGTGAGG CGGGACTCGA GTGGGTCAAT
ATCGACAAAG CAGCCCAAGC CGGCGTACCG ACCCCGATCC GGAAACTGAT CAAGCAGGTT
GCCAGCGCAG GGGGCTGA
 
Protein sequence
MATPNPFTEQ LIAWQKIAGR HDLPWQNTCD PYRVWLSEIM LQQTQVSTAT PYYLRFLSSF 
PDVTALATAP IEVVIEHWAG LGYYARARNL HRCAQQIVTV YAGSFPDSVE KLAELPGIGR
STAAAIAAFS FGKRAAILDG NVKRVLCRQF GIDGFPGSVT IDRKLWTLAE SLLPERDIEV
YTQGLMDLGA TLCTRSKPRC GDCPVAAACI ARCEGRQAEL PTAKPRTKVP ERTATYVLLS
DGHRLLLERR PPSGLWGGLL VPPEGEPDQV AARFGLQLGE QSKLPALKHT FTHFKLTLEP
VLCRIEPRTD LGEAGLEWVN IDKAAQAGVP TPIRKLIKQV ASAGG