Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Huta_2939 |
Symbol | |
ID | 8385248 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorhabdus utahensis DSM 12940 |
Kingdom | Archaea |
Replicon accession | NC_013158 |
Strand | - |
Start bp | 3026899 |
End bp | 3027819 |
Gene Length | 921 bp |
Protein Length | 306 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 644974017 |
Product | HhH-GPD family protein |
Protein accession | YP_003131833 |
Protein GI | 257054000 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1194] A/G-specific DNA glycosylase |
TIGRFAM ID | [TIGR01084] A/G-specific adenine glycosylase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.223861 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTGAGG CCCTCGAGAA TCTACCAGCC GATCAGGGTG CTATCCAGCG TGCGCTTATC GAGTGGTACC AGGACGACCA CCGCGAGTAC CCTTGGCGGG AGACGGATGA CCCCTACGCG ATCCTCGTCT CGGAGGTCAT GAGCCAGCAG ACACAACTCG ATCGTGTCGT CGACGCCTGG GACGACTTCC TCGATCGCTG GCCCACGGTC GCAGACCTCG CAGATGCCGA CCGAGCCGAT GTGGTGGGCT TCTGGTCGGA TCACAGCCTC GGGTACAACA ATCGGGCGAA GTACCTCCAC GAGGCCGCGA CCCAGATCGT CGAGGAGTAC GACGGCGCGT TCCCCGAGTC GCCCGACGAA CTGTCCGAAC TCATGGGCGT CGGTCCCTAC ACCGCCAACG CGGTCGCGAG TTTCGCGTTC AACAACGGCG ACGCTGTCGT CGACACCAAC GTCAAGCGGG TGCTGTATCG GGCCTTTTCG ATCCCCGACG AGGACGCGGC TTTCGAGGAC GCTGCAAGCA CGCTCATGTC GGAGGGAGAG TCCCGGGTCT GGAACAACGC GATCATGGAA CTCGGCGGCG TGGCCTGCGA GAAGACGCCG CGCTGTGACG CGGCGGGCTG TCCCTGGCGA GAGTGGTGTG ACGCCTACGC GAACGGGGAC TTTTCGGCAC CCGACGTGCC CGAACAGTCG ACCTTCGAAG GGAGTCGCCG CCAGATGCGC GGGCGGGTGA TCGCCGCGTT AAAGGAACAC GATCACCTCG CGATCGACAA CCTCGGCCCG AAAGTGCGAG TTGACTACGC GCCGGAGGCT GACGCCGAGG CCGATCGGGA GTGGCTCCGG GACCTCCTCG AAGACCTAGC CGATGATGGG CTTGTCGAGA TCGAGGAAGG GAGCGATCAA CCGATCGCTC GACTCCGATA G
|
Protein sequence | MSEALENLPA DQGAIQRALI EWYQDDHREY PWRETDDPYA ILVSEVMSQQ TQLDRVVDAW DDFLDRWPTV ADLADADRAD VVGFWSDHSL GYNNRAKYLH EAATQIVEEY DGAFPESPDE LSELMGVGPY TANAVASFAF NNGDAVVDTN VKRVLYRAFS IPDEDAAFED AASTLMSEGE SRVWNNAIME LGGVACEKTP RCDAAGCPWR EWCDAYANGD FSAPDVPEQS TFEGSRRQMR GRVIAALKEH DHLAIDNLGP KVRVDYAPEA DAEADREWLR DLLEDLADDG LVEIEEGSDQ PIARLR
|
| |