Gene Achl_2238 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAchl_2238 
Symbol 
ID7293706 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter chlorophenolicus A6 
KingdomBacteria 
Replicon accessionNC_011886 
Strand
Start bp2513076 
End bp2514047 
Gene Length972 bp 
Protein Length323 aa 
Translation table11 
GC content67% 
IMG OID643590640 
Productformamidopyrimidine-DNA glycosylase 
Protein accessionYP_002488292 
Protein GI220912983 
COG category[L] Replication, recombination and repair 
COG ID[COG0266] Formamidopyrimidine-DNA glycosylase 
TIGRFAM ID[TIGR00577] formamidopyrimidine-DNA glycosylase (fpg) 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.0000000000837179 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGCCTGAAC TGCCCGAGGT GGAGGTGGTC CGCCGCGGCC TGGTGAGCTG GGTCCGCGGC 
AGGACCATCG AGTCTGTCGA TGTGCTGGAC CCCCGCTCCA TCCGCCGCCA CGCCCTCGGT
GTGGAGGACT TCATCGGCAA CCTGCAGGGT GCCACCGTGT CTGATGTCGT CCGGCGCGGC
AAATTCCTGT GGCTGCCGTT AGTCGATGGT TCCGCCCACC AGCAGGCGGC AGGCAACGGA
CAAGCCCCAC CCTCGCGGGT GGCGCTGATG GCCCACCTTG GCATGAGCGG GCAACTGCTC
ATGCAGGACG CGGGCGTTCC GGACGAAAAG CACCTGAAAG TCCGGCTCCA TCTCAGCCCC
AGTGCCGGCA TGCCTGGCCA GCTGCGCTTC GTGGACCAGC GCATTTTCGG TGGGCTCTTT
GTCACCGCGT TGGTTCCCAC CGACGACGGC GGCCCCGGCG GCCTCGCGGA GTCGCCCCTT
CCGCTGATCG CCGGGGAGGC GTCACACATT GCCCGGGATC CGCTGGACCC GGCCTTCTCA
TTCGACCGTT TCTACCAGCG CCTGCGTGCG CGGAAAACCG GGTTGAAACG GGCGCTCCTG
GACCAGGGGC TGGTTTCGGG AATCGGTAAC ATCTACGCCG ATGAGGCCCT CTGGCGTGCA
CGGTTGCACT ACGCGCGTCC CACCGACAAG CTGCGGCGGG CTGACGCCTT CCGGCTTATC
GACAGCGCCC GGGCGGTCAT GCTCGATGCC CTGGACGCCG GCGGAACCAG CTTCGATTCC
CTGTACGTAA ACGTCAACGG GGCCTCCGGA TATTTTGACC GGTCGCTCAA TGCCTACGGG
CGCGAAGGCG AGCCCTGCAA ACGGTGCACG GCTGCGGGAA TCCACGCCAC CATCCGCCGT
GAACAGTTCA TGAACCGGTC CTCCTACACG TGCCCGGTAT GCCAGCCGCG GCCCCGCAAC
GGACGCTGGT AA
 
Protein sequence
MPELPEVEVV RRGLVSWVRG RTIESVDVLD PRSIRRHALG VEDFIGNLQG ATVSDVVRRG 
KFLWLPLVDG SAHQQAAGNG QAPPSRVALM AHLGMSGQLL MQDAGVPDEK HLKVRLHLSP
SAGMPGQLRF VDQRIFGGLF VTALVPTDDG GPGGLAESPL PLIAGEASHI ARDPLDPAFS
FDRFYQRLRA RKTGLKRALL DQGLVSGIGN IYADEALWRA RLHYARPTDK LRRADAFRLI
DSARAVMLDA LDAGGTSFDS LYVNVNGASG YFDRSLNAYG REGEPCKRCT AAGIHATIRR
EQFMNRSSYT CPVCQPRPRN GRW