Gene Acry_1502 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcry_1502 
Symbol 
ID5159866 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAcidiphilium cryptum JF-5 
KingdomBacteria 
Replicon accessionNC_009484 
Strand
Start bp1663732 
End bp1664775 
Gene Length1044 bp 
Protein Length347 aa 
Translation table11 
GC content68% 
IMG OID640553415 
ProductHhH-GPD family protein 
Protein accessionYP_001234629 
Protein GI148260502 
COG category[L] Replication, recombination and repair 
COG ID[COG1194] A/G-specific DNA glycosylase 
TIGRFAM ID[TIGR01084] A/G-specific adenine glycosylase 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAGTCGGT CCCTGCCATC CGCCGAAAAT CTGCTCCGCT GGTATCATGT TCACCGGCGC 
ATCCTGCCTT GGCGCGCCGG CCCGGGTACC CTGCCCGATC CCTATCATGT CTGGCTGAGC
GAAATCATGC TGCAGCAGAC AGTCGTAGCG ACTGTCATAC CTTATTTCCA TCGCTTCATC
GAGCGTTTCC CCACGATCAG CGACCTCGCG GTCGCGGTGG ATGATGAGAT TCTGGGCCTG
TGGGCGGGGC TTGGCTATTA CGCGCGGGCA CGCAACCTGA TCCGCTGCGC GAGGGCCGTC
GCCGAGGCGG GCGGGTTTCC CGTCACGCTC GACGGGCTAC GTGCGCTGCC CGGCATCGGC
CCTTATACGG CTGCGGCGAT CGGCGCGATC GCCTTCGATA TTCCGGTGGT TCCGGTGGAC
GGCAATGTCG AGCGGGTTAC CGCCAGGATG TTCGCGATCG AGGAGGCGTT GCCCGCGGCG
AAGGACGCGA TTGCGGTCGC CGCTGCCCGC CTTGGCGCGC AGGCGGCAGC GCAATCCAGC
CCAGGTGACT TTGCGCAGGC ATTGTTCGAT CTCGGAGCCA CCGTCTGCAC GCCGCGCAGT
CCATCATGCA TGGTCTGCCC GTGGCGCGAC GGATGCGCGG CACATGCCCG GGGGCTGTCC
GCCGACCTGC CGCGCAAGGC GAAGCGCGCG GCGCGGCCCG TGCGGCGCGG CACCGTGTTC
GTGATGCAGG ATCGATCCGG CATGATTGGC CTGCGCCGGC GGCCACCACG CGGATTGCTC
GGAGGGATGC TGGAGGTGCC GGGCACGGAT TGGGAGGCGA CAGCTCCGCC CCCGGTGCCG
CCATGCGCCG CGCATTGGCT TGATGCCGGC ACGATCATTC ACGTTTTCAC CCATTTCGAG
TTGCGCCTCA CCGTGAAGGC GGGCCGCGTC GCGGCGCTAC CCGGCGGGAT CGTCGCCGCG
CCGCCCGATA CGCCTCTGCC GACCGTGATG CGCAAGGCGC TGGAGGCCGG GCTTGCTGTT
CTCGATGAGC GGTCGCCGAA ATAA
 
Protein sequence
MSRSLPSAEN LLRWYHVHRR ILPWRAGPGT LPDPYHVWLS EIMLQQTVVA TVIPYFHRFI 
ERFPTISDLA VAVDDEILGL WAGLGYYARA RNLIRCARAV AEAGGFPVTL DGLRALPGIG
PYTAAAIGAI AFDIPVVPVD GNVERVTARM FAIEEALPAA KDAIAVAAAR LGAQAAAQSS
PGDFAQALFD LGATVCTPRS PSCMVCPWRD GCAAHARGLS ADLPRKAKRA ARPVRRGTVF
VMQDRSGMIG LRRRPPRGLL GGMLEVPGTD WEATAPPPVP PCAAHWLDAG TIIHVFTHFE
LRLTVKAGRV AALPGGIVAA PPDTPLPTVM RKALEAGLAV LDERSPK