Gene Arth_0177 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_0177 
Symbol 
ID4447370 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp180855 
End bp181898 
Gene Length1044 bp 
Protein Length347 aa 
Translation table11 
GC content71% 
IMG OID639687972 
ProductHhH-GPD family protein 
Protein accessionYP_829678 
Protein GI116668745 
COG category[L] Replication, recombination and repair 
COG ID[COG1194] A/G-specific DNA glycosylase 
TIGRFAM ID[TIGR01084] A/G-specific adenine glycosylase 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGGGGCACC ACCACGCCGG GAGGCGGACC GCGGGCCATA TCCACTACAG TATTTGCATC 
GAAGCCTTAG CCGTACCCAC TGCCCTGCCC CCGGCGAGTC GCCCCGCATT ACCGCCGCTT
GCCGCCCTCC ATGACGCCCT TGACGACTGG TTCGGAACCA CAGCCAGGGA TCTGCCGTGG
CGCGACCCCG AGTGCTCCCC GTGGGGTGTC CTGGTCAGCG AGATTATGCT CCAGCAGACG
CCCGTTGTCA GGGTCCTGCC CGTCTGGGAA GACTGGCTCC GCCGCTGGCC GTCGCCGGCG
CACCTGGCGA CCGAGGCCTC CGGCGAGGCA GTCCGGCACT GGGGCAGGCT TGGCTATCCC
CGGCGGGCCC TGCGCCTGCA TGCAGCCGCC GTCGCCATCG TGGAGAAGCA CGACGGCGGC
GTGCCGGGAA CGTACGACGA ACTGCTGGAA CTCCCCGGGG TGGGCAGCTA CACGGCGGCC
GCCGTCGCCG CCTTCGCCTT TGGCCGCCGC GAAACCGTGG TGGACACCAA CATCCGCCGC
GTCCACGCGC GGCTCTTTTC CGGCACCGCA CTGCCCTCGC AGTCACTGAC AGCGGCCGAA
ATGCGACTGG CCGCCGAACT GCTGCCGGCC GACGTCGGAC TCTCCGTCCG CTGGAACGCG
GCGGTCATGG AGCTGGGGGC ACTGGTCTGC ACGGCGAGGG CGCCGAAGTG CGGTGAATGC
CCTGTGCGGG GGGCGTGCGC GTGGCTGGCG GCCGGCGAGC CACCGCCGTC GTACACCCCG
AAGGGCCAGT CCTGGCACGG CACCGACCGG CAGGTACGGG GAGCCGTGAT GGCCGTCCTC
CGGCTGGCTG ACGCACCGGT GGCTCCGGAC ATGTTCCATC AGCCCGCCGC GGACCTTGGC
TTCGAAGCCG AAGGCATCGG TGTTCCGCTG GCAGCGCTGC ACCGGCTGAA CTCCGCACCC
GAGCAGCTGG AGCGCGCCCT GGCCGGACTG GTCAGCGACG GCCTGGCGGA ACTGCACCCG
GCCGGCCTGA CGCTGCCCGC CTGA
 
Protein sequence
MGHHHAGRRT AGHIHYSICI EALAVPTALP PASRPALPPL AALHDALDDW FGTTARDLPW 
RDPECSPWGV LVSEIMLQQT PVVRVLPVWE DWLRRWPSPA HLATEASGEA VRHWGRLGYP
RRALRLHAAA VAIVEKHDGG VPGTYDELLE LPGVGSYTAA AVAAFAFGRR ETVVDTNIRR
VHARLFSGTA LPSQSLTAAE MRLAAELLPA DVGLSVRWNA AVMELGALVC TARAPKCGEC
PVRGACAWLA AGEPPPSYTP KGQSWHGTDR QVRGAVMAVL RLADAPVAPD MFHQPAADLG
FEAEGIGVPL AALHRLNSAP EQLERALAGL VSDGLAELHP AGLTLPA