Gene Aave_3604 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAave_3604 
Symbol 
ID4666754 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAcidovorax citrulli AAC00-1 
KingdomBacteria 
Replicon accessionNC_008752 
Strand
Start bp3976291 
End bp3977382 
Gene Length1092 bp 
Protein Length363 aa 
Translation table11 
GC content71% 
IMG OID639824796 
ProductA/G-specific DNA-adenine glycosylase 
Protein accessionYP_971929 
Protein GI120612251 
COG category[L] Replication, recombination and repair 
COG ID[COG1194] A/G-specific DNA glycosylase 
TIGRFAM ID[TIGR01084] A/G-specific adenine glycosylase 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.211751 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00640648 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAAGCGCG AGGTCCCCGA CATCGCCACC GAAGTGGTGC GCTGGCAGGC CGTGCACGGC 
CGCAACCACC TGCCGTGGCA GCAGACGCGC GACCCCTACC GGGTCTGGCT GTCCGAAATC
ATGCTGCAGC AGACGCAGGT CAACACGGTG CTGGACTATT ACACCCGGTT CCTGGAGCGG
TTCCCCGACG TGCGCGCCCT GGCCGCGGCG CCGGAGGACG ACGTCATGGC CCTCTGGAGC
GGGCTGGGCT ACTACAGCCG CGCCCGCAAC CTGCACCGCT GCGCCAGGGA GGTCGTGGAT
CGGTACGGCG GGGAATTTCC GCGCTCCGCC GAGGCCCTGG CCGGCCTGCC TGGCATCGGC
CGTTCCACGG CCGGCGCGAT CGCCTCCTTC TGCTTCGCGG AGCGCGTGCC CATTCTGGAC
GCCAATGTCC GGCGGGTGCT CACGCGGGTG CTCGGCTTCG ATGCCGACCT GGCCGTCGCC
CGCAACGAGC GTGACCTGTG GGACCGTGCC AGCGAACTCC TGCCGCACGA CGATCTGCAG
GAGGCCATGC CCCGCTACAC GCAGGGCCTG ATGGATCTGG GCGCGAGCCT CTGCACGCCC
CGCAAGCCCG CCTGCATTCT CTGCCCCCTG CAACCGCAAT GCGTGGCCGC CGTGGCCGGC
AATCCCGAGG ATTACCCCGT GCGCACGCGC AAGCTGCTGC GGCGGGCGCA GGCATGGTGG
TTTCCGCTGC TGCACGACGG CGAGGGGCGC CTCTGGCTGC AGCGCAGGCC TTCCGAGGGC
ATCTGGGCCG GCCTGCATTG CCCGCCCATG TTCGACAGCC GGGAGGATGC GCTGCAATGG
CTCGCGCAGC GCGGCGCGGG CCGCACGCCG CGGGAACTGG ACACCGTGTT CCATGTCCTC
ACGCACCGGG ACCTGCACCT GCATCCCCTG CTGGTGCGCG GGCCGGAAAC TGCCGCGCCC
GGCCAGGCCG AAGCGGCGCA GGAGGGCGGC TGGTACACAG CCGCGCAATG GAAGGCGCTG
GGATTGCCGG CCCCCGTGCG CAAGCTGCTG GAACAGTTGC AGCTGCCCGC TGCGGGAGCC
GTGGAGGCCT GA
 
Protein sequence
MKREVPDIAT EVVRWQAVHG RNHLPWQQTR DPYRVWLSEI MLQQTQVNTV LDYYTRFLER 
FPDVRALAAA PEDDVMALWS GLGYYSRARN LHRCAREVVD RYGGEFPRSA EALAGLPGIG
RSTAGAIASF CFAERVPILD ANVRRVLTRV LGFDADLAVA RNERDLWDRA SELLPHDDLQ
EAMPRYTQGL MDLGASLCTP RKPACILCPL QPQCVAAVAG NPEDYPVRTR KLLRRAQAWW
FPLLHDGEGR LWLQRRPSEG IWAGLHCPPM FDSREDALQW LAQRGAGRTP RELDTVFHVL
THRDLHLHPL LVRGPETAAP GQAEAAQEGG WYTAAQWKAL GLPAPVRKLL EQLQLPAAGA
VEA