Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hlac_2002 |
Symbol | |
ID | 7402021 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorubrum lacusprofundi ATCC 49239 |
Kingdom | Archaea |
Replicon accession | NC_012029 |
Strand | - |
Start bp | 1995972 |
End bp | 1996964 |
Gene Length | 993 bp |
Protein Length | 330 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 643709073 |
Product | HhH-GPD family protein |
Protein accession | YP_002566650 |
Protein GI | 222480413 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1194] A/G-specific DNA glycosylase |
TIGRFAM ID | [TIGR01084] A/G-specific adenine glycosylase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.0546697 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 0.289046 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCGACT CGGACGCGAC CGCCGTAGCG GGGGCCGGCG TCGACGGCGA CGCCCCCGAG CTGCCGGCCG ATCTCGATGC GGTCCGCGAT GCACTCGTCG ACTGGTACGA GGCGGACCAC CGCGAGTTCC CGTGGCGACG CACAGAGGAC CCCTACGAGA TCCTCGTCAG CGAGGTGATG AGCCAGCAGA CACAGCTCGA CCGAGTCGTC CCCGCGTGGG AGGACTTCGT CGAGGAGTGG CCGACGACCG AGGAGTTGGC CGAGGCCGAC CGCGGCGGCG TGGTCGCGTT CTGGTCCGAC CACTCGCTCG GCTACAACAA CCGCGCGAAG TACCTCCACG AGGCCGCCGG ACAGGTAGAA GGGGAGTACG GCGGGACGTT CCCGGAGACG CCCGAGGAAC TACAGGAGCT GATGGGCGTC GGCCCGTACA CCGCGAACGC GGTGGCGTCG TTCGCGTTCG ACAACGGCGA CGCCGTCGTC GACACCAACG TGAAGCGGGT GCTCCACCGC GCGTTCGCGG TCCCGGACGA CGACGCGGCG TTCGCGCAGG TCGCATCGGA CGTGATGCCC GACGGCGAGT CCCGTATCTG GAACAACGCG ATCATGGAGC TCGGTGGCGT CGCCTGCGGG ACGACCCCGC GGTGTGACGA GGCCGGCTGT CCGTGGCGGA GATGGTGTCA CGCCTACGAA ACCGGCGACT TCACCGCGCC CGACGTGCCC GAGCAGCCGA GCTTCGAGGG AAGTCGTCGG CAGTTCCGAG GTCGGATCGT CCGACTCCTC GGCGAGTACG ACGAGCTGGC GCTCGACGAT CTCGGCCCCC GCGTCCGAGT CGACTATTCG CCCGACGGCG AGCACGGCCG AGAGTGGCTG CGCGGGCTCG TCGACGACCT CGCGGACGAC GGGCTCGTGG CGATCGAAGA GCGCGCAGGG GCGGACGAAG GGCGTTCGGC GGACGACGGA GCGAGCGAGG TCGTCGTCTC TCTGCGGCGG TGA
|
Protein sequence | MTDSDATAVA GAGVDGDAPE LPADLDAVRD ALVDWYEADH REFPWRRTED PYEILVSEVM SQQTQLDRVV PAWEDFVEEW PTTEELAEAD RGGVVAFWSD HSLGYNNRAK YLHEAAGQVE GEYGGTFPET PEELQELMGV GPYTANAVAS FAFDNGDAVV DTNVKRVLHR AFAVPDDDAA FAQVASDVMP DGESRIWNNA IMELGGVACG TTPRCDEAGC PWRRWCHAYE TGDFTAPDVP EQPSFEGSRR QFRGRIVRLL GEYDELALDD LGPRVRVDYS PDGEHGREWL RGLVDDLADD GLVAIEERAG ADEGRSADDG ASEVVVSLRR
|
| |