Gene Gdia_2032 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGdia_2032 
Symbol 
ID6975459 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGluconacetobacter diazotrophicus PAl 5 
KingdomBacteria 
Replicon accessionNC_011365 
Strand
Start bp2253626 
End bp2254690 
Gene Length1065 bp 
Protein Length354 aa 
Translation table11 
GC content73% 
IMG OID643391562 
ProductA/G-specific adenine glycosylase 
Protein accessionYP_002276407 
Protein GI209544178 
COG category[L] Replication, recombination and repair 
COG ID[COG1194] A/G-specific DNA glycosylase 
TIGRFAM ID[TIGR01084] A/G-specific adenine glycosylase 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.0489232 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones51 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCCCCCTT CATCCGCCGA CCTGCTGCAT TGGTACGACC GCCACCGGCG AACCCTGCCC 
TGGCGGGCCC TGCCCGGCCA CAGCGCCGAT CCCTACCATG TCTGGCTGAG CGAAATCATG
CTGCAGCAGA CGACGGTCAC GGCGGTCATC CCCTATTATC GCCGATTCCT GGACCGGTTT
CCCACTGTCG TGGACCTGGC GCAGGCCGAT TCCGACACCG TCATGGCCGC CTGGGCCGGT
CTGGGCTATT ACGCCCGGGC GCGCAACCTG CATGACTGCG CGCGGGTGGT GGCGGCGGCC
GGCCGCTTTC CCGACGACAT GCCGAGGCTG CTGGCCCTGC CGGGGGTGGG GGCCTATACC
GCCGCCGCCA TCGCCGCCAT CGCCTTCGGC CGGCCGGTGG TCCCGGTGGA CGGCAATGTG
GAGCGCGTGA CCAGCCGGCT GTTCGCCCTG TCCGACCCGC TGCCGGGCGC CCGCAAATCC
ATCGCCCGCC AGGCGGCCAC CCTGAACCAT TCCGCCGAGG CGCAGGCGCG GCCGTCCGAT
TTCGCGCAGG CGCTGTTCGA CCTGGGCGCC GGGGTCTGCA CGCCGCGAAG CCCGGCCTGC
GCCCTGTGTC CATGGCGGGA GGCCTGCGCC GGGTTCCGCC AGGGCATCGC GGCGAACCTG
CCCGTCAAGG CGCCCCGCGC GACGAAGCCG GTGCGCTACG GCGCGCATTT CCACGTCACC
GACGCGGCCG GCCACATCCT GCTGCGCCGC CGGGCGGCGA AGGGATTGCT GGGCGGCATG
CTGGAACTGC CGGGGACCGA CTGGCGCGCC GCCCCCTGGA CGCCGGCCGA GGCCCTGGCC
CATGCCCCCC TGGCGGCATC CTGGCAGGCG GCCGGGCGGG TGACGCATGT CTTCACCCAT
TTCACCCTGC ATGTGGACCT GTATGACGCG GCGGTGGGGC ACTTCCCCAA CAGCGCGGCG
CGGGCGGGCG GCCTGGCCTT CGCCGGGCAG GCCCTGGACG GGCTGGCCCT GCCGTCGCTG
ATGCGCAAAT GCCTGGCCGC GATCCGTCCC GCCATGACGG CATGA
 
Protein sequence
MPPSSADLLH WYDRHRRTLP WRALPGHSAD PYHVWLSEIM LQQTTVTAVI PYYRRFLDRF 
PTVVDLAQAD SDTVMAAWAG LGYYARARNL HDCARVVAAA GRFPDDMPRL LALPGVGAYT
AAAIAAIAFG RPVVPVDGNV ERVTSRLFAL SDPLPGARKS IARQAATLNH SAEAQARPSD
FAQALFDLGA GVCTPRSPAC ALCPWREACA GFRQGIAANL PVKAPRATKP VRYGAHFHVT
DAAGHILLRR RAAKGLLGGM LELPGTDWRA APWTPAEALA HAPLAASWQA AGRVTHVFTH
FTLHVDLYDA AVGHFPNSAA RAGGLAFAGQ ALDGLALPSL MRKCLAAIRP AMTA