Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Htur_2122 |
Symbol | |
ID | 8742722 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haloterrigena turkmenica DSM 5511 |
Kingdom | Archaea |
Replicon accession | NC_013743 |
Strand | + |
Start bp | 2193012 |
End bp | 2193971 |
Gene Length | 960 bp |
Protein Length | 319 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 646512704 |
Product | HhH-GPD family protein |
Protein accession | YP_003403678 |
Protein GI | 284165399 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1194] A/G-specific DNA glycosylase |
TIGRFAM ID | [TIGR01084] A/G-specific adenine glycosylase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTGAGA GCGACGATGA TGCCGACGCC GGTTCGGGTA CGGGCGACGG GGACTGGTCG CTCCCCAACG ACCACGAGTC CGTCCGCGAG GCCCTGATCG CGTGGTACGA GGACGGCCAC CGCGAGTTCC CGTGGCGGCG GACCGACGAC CCCTACGAGA TCCTCGTCAG CGAGGTGATG AGCCAGCAGA CCCAACTGGA CCGCGTCGTC GAGGCCTGGG AAGGGTTCCT CGAGCGCTGG CCGACGACCG CGGCGCTGGC GGACGCCGAC CGAGCGGACG TCGTCGGCTT CTGGACGGAC CACAGCCTCG GCTACAACAA CCGGGCGAAG TACCTCCACG AGGCGGCCGG GCAGGTGGAA AACGAATACG ACGGCGAGTT CCCGACCGCG CCCGACGAAC TGCAGGAGCT CATGGGCGTC GGCCCCTACA CCGCCAACGC CGTGGCGAGT TTCGCCTTCA ACAACGGCGA CGCGGTCGTC GACACGAACG TCAAGCGTGT CGCCTACCGC GCGTTTTCGA TCCCCGACGA CGACGCGGCA TTCGAGGCGG CGGCGAGCGA GCTCATGCCC GACGGCGAGT CGCGAGTCTG GAACAACGCG ATCATGGAAC TGGGCGGCGT CGCCTGCACG CAGACACCGA AGTGTGACGA GGTCGGCTGC CCCTGGCGCG AGTGGTGTGA CGCCTACGCC AGCGGCGACT TCACCGCGCC GGACGTCCCG ACACAGCCCT CCTTCGAGGG GAGTCGCCGT CAGTTCCGCG GCCGCGTGAT CGGCACCCTG CGGGAGTACG ACGAACTCGA GTTGGACACC CTGGGCCATC GCATTCGCGT CGATTACGCA CCCGACGGCG AGTACGGACG CGAGTGGCTC ACGGGGCTGC TCGAAGACCT CGAGTCGGAC GGGTTAGTCG ACCTCGAGAC GGGCGAAGAC GGGGCGCTCG TGGCCCGTCT CCGTCGATAG
|
Protein sequence | MSESDDDADA GSGTGDGDWS LPNDHESVRE ALIAWYEDGH REFPWRRTDD PYEILVSEVM SQQTQLDRVV EAWEGFLERW PTTAALADAD RADVVGFWTD HSLGYNNRAK YLHEAAGQVE NEYDGEFPTA PDELQELMGV GPYTANAVAS FAFNNGDAVV DTNVKRVAYR AFSIPDDDAA FEAAASELMP DGESRVWNNA IMELGGVACT QTPKCDEVGC PWREWCDAYA SGDFTAPDVP TQPSFEGSRR QFRGRVIGTL REYDELELDT LGHRIRVDYA PDGEYGREWL TGLLEDLESD GLVDLETGED GALVARLRR
|
| |