Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hmuk_2201 |
Symbol | |
ID | 8411740 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halomicrobium mukohataei DSM 12286 |
Kingdom | Archaea |
Replicon accession | NC_013202 |
Strand | - |
Start bp | 2113978 |
End bp | 2114895 |
Gene Length | 918 bp |
Protein Length | 305 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 645020543 |
Product | HhH-GPD family protein |
Protein accession | YP_003178021 |
Protein GI | 257388248 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1194] A/G-specific DNA glycosylase |
TIGRFAM ID | [TIGR01084] A/G-specific adenine glycosylase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 41 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 30 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGGAGC TGCCGGGCGC AGTCCCCGAG AACCGCGAGG CCGTCCAGCG GGCCCTGATC GAGTGGTACG AGGCCGACCA CCGGTCGTTT CCCTGGCGCG AGACCGACGA CGCCTACGAG ATCCTCGTCT CCGAGGTGAT GAGCCAGCAG ACCCAGCTGG GCCGCGTCGT CGAGGCCTGG CGGGCGTTCC TCGATCGGTG GCCCGACGCC GAGGCGCTGG CGGCCACCGA CCAGTCCGAC GTGGTCGCGT TCTGGACGGC CCACTCGCTG GGGTACAACA ACCGGGCGAA GTACCTCCAC ACCGCCGCGA ACCAGATCAT CGACGAGTGG GACGGCGCGT TCCCCGAGAC GCCCGCCGAA CTGCAGGAGC TCCACGGCGT CGGCCCCTAC ACCGCCAACG CGGTCGCCTC GTTCGCGTTC AACGCGGGCG ACGCCGTCGT CGACACGAAC GTCAAGCGCG TCCTCCACCG CGCGTTCGAC GTGCCCGACG ACGACGCGGC GTTCGAGGAA GTCGCGGGGG CGCTCATGCC CGACGGACGG TCACGGATAT GGAACAACGC GATCATGGAG CTCGGCGGGG TCGCCTGCGA GAAGACGCCG GCCTGCGACG CCGCCGGCTG CCCGTGGCGC GAGTGGTGTC ACGCCTACGA CACCGGCGAC TTCACTGCGC CGGACGTGCC CACCCAGCCC GATTTCGAGG GGAGCCGTCG GCAGTTCCGC GGGCGGATCG TCAACGTCCT CGGCGAGTAC GACCGCCTCG CCTTGGACGA CCTCGGCCCG CGCGTGCGAG TCGACTACGC GCCCGAGGGC GAACACGGCC GAGAGTGGCT CCGCGGCCTC GTCGAGGACC TCGCGGACGA CGGCCTCGTC AGCGTCGAGG ACGGCGACGA GTCGCTGGTG GTCGGTCTCA GCGAGTGA
|
Protein sequence | MTELPGAVPE NREAVQRALI EWYEADHRSF PWRETDDAYE ILVSEVMSQQ TQLGRVVEAW RAFLDRWPDA EALAATDQSD VVAFWTAHSL GYNNRAKYLH TAANQIIDEW DGAFPETPAE LQELHGVGPY TANAVASFAF NAGDAVVDTN VKRVLHRAFD VPDDDAAFEE VAGALMPDGR SRIWNNAIME LGGVACEKTP ACDAAGCPWR EWCHAYDTGD FTAPDVPTQP DFEGSRRQFR GRIVNVLGEY DRLALDDLGP RVRVDYAPEG EHGREWLRGL VEDLADDGLV SVEDGDESLV VGLSE
|
| |