Gene Hlac_2002 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_2002 
Symbol 
ID7402021 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp1995972 
End bp1996964 
Gene Length993 bp 
Protein Length330 aa 
Translation table11 
GC content70% 
IMG OID643709073 
ProductHhH-GPD family protein 
Protein accessionYP_002566650 
Protein GI222480413 
COG category[L] Replication, recombination and repair 
COG ID[COG1194] A/G-specific DNA glycosylase 
TIGRFAM ID[TIGR01084] A/G-specific adenine glycosylase 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.0546697 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.289046 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGACT CGGACGCGAC CGCCGTAGCG GGGGCCGGCG TCGACGGCGA CGCCCCCGAG 
CTGCCGGCCG ATCTCGATGC GGTCCGCGAT GCACTCGTCG ACTGGTACGA GGCGGACCAC
CGCGAGTTCC CGTGGCGACG CACAGAGGAC CCCTACGAGA TCCTCGTCAG CGAGGTGATG
AGCCAGCAGA CACAGCTCGA CCGAGTCGTC CCCGCGTGGG AGGACTTCGT CGAGGAGTGG
CCGACGACCG AGGAGTTGGC CGAGGCCGAC CGCGGCGGCG TGGTCGCGTT CTGGTCCGAC
CACTCGCTCG GCTACAACAA CCGCGCGAAG TACCTCCACG AGGCCGCCGG ACAGGTAGAA
GGGGAGTACG GCGGGACGTT CCCGGAGACG CCCGAGGAAC TACAGGAGCT GATGGGCGTC
GGCCCGTACA CCGCGAACGC GGTGGCGTCG TTCGCGTTCG ACAACGGCGA CGCCGTCGTC
GACACCAACG TGAAGCGGGT GCTCCACCGC GCGTTCGCGG TCCCGGACGA CGACGCGGCG
TTCGCGCAGG TCGCATCGGA CGTGATGCCC GACGGCGAGT CCCGTATCTG GAACAACGCG
ATCATGGAGC TCGGTGGCGT CGCCTGCGGG ACGACCCCGC GGTGTGACGA GGCCGGCTGT
CCGTGGCGGA GATGGTGTCA CGCCTACGAA ACCGGCGACT TCACCGCGCC CGACGTGCCC
GAGCAGCCGA GCTTCGAGGG AAGTCGTCGG CAGTTCCGAG GTCGGATCGT CCGACTCCTC
GGCGAGTACG ACGAGCTGGC GCTCGACGAT CTCGGCCCCC GCGTCCGAGT CGACTATTCG
CCCGACGGCG AGCACGGCCG AGAGTGGCTG CGCGGGCTCG TCGACGACCT CGCGGACGAC
GGGCTCGTGG CGATCGAAGA GCGCGCAGGG GCGGACGAAG GGCGTTCGGC GGACGACGGA
GCGAGCGAGG TCGTCGTCTC TCTGCGGCGG TGA
 
Protein sequence
MTDSDATAVA GAGVDGDAPE LPADLDAVRD ALVDWYEADH REFPWRRTED PYEILVSEVM 
SQQTQLDRVV PAWEDFVEEW PTTEELAEAD RGGVVAFWSD HSLGYNNRAK YLHEAAGQVE
GEYGGTFPET PEELQELMGV GPYTANAVAS FAFDNGDAVV DTNVKRVLHR AFAVPDDDAA
FAQVASDVMP DGESRIWNNA IMELGGVACG TTPRCDEAGC PWRRWCHAYE TGDFTAPDVP
EQPSFEGSRR QFRGRIVRLL GEYDELALDD LGPRVRVDYS PDGEHGREWL RGLVDDLADD
GLVAIEERAG ADEGRSADDG ASEVVVSLRR