Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_4708 |
Symbol | |
ID | 5707217 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | + |
Start bp | 5329788 |
End bp | 5330687 |
Gene Length | 900 bp |
Protein Length | 299 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 641274106 |
Product | HhH-GPD family protein |
Protein accession | YP_001539452 |
Protein GI | 159040199 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1194] A/G-specific DNA glycosylase |
TIGRFAM ID | [TIGR01084] A/G-specific adenine glycosylase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.109375 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.0140541 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACTGAAC CGACCTTCGC CACCCTGGTC AGCCGGTGGT ACACACACCA TGCCCGGGAT CTGCCGTGGC GGCACCCCGG CGTCGGCGCC TGGGCCATCC TCGTCAGTGA GGTGATGCTC CAGCAGACCC CGGTCGCACG GGTGATACCA GCCTGGACGG CCTGGCTGTC CCGCTGGCCG ACGGCGGCTG ACCTGGCGGC GGAGCCACCG GCGGAGGCGA TCCGCATGTG GGGGCGGCTG GGCTACCCAC GGCGGGCGGT CCGGTTACGC GAGTGCGCGG TGGCGATGGT GGAACGGCAC GGTGGGCAGG TACCGGACCG GTTGGAGCAA CTGTTGGCCC TGCCGGGGGT CGGTACGTAC ACGGCACGGG CCGTGGCCGC GTTCGCGTAC GGACAACGGC ATCCCGTCGT GGACACCAAC GTACGGCGGG TGATCTGCCG GGCGGTCGCG GGTGAACCCG ATGCCGGCCC GGCCACCCGC CCGGCCGATC TTGCCGCCAC CGAGGAGCTA CTCCCGACTG AACCGGCCGC CGCCGCGCTG GCCAGCGCGG CGTTCATGGA GCTGGGGGCG GTGGTCTGCA CAGCTCGGTC GCCGCGTTGC GGGAGTTGCC CGGTCACGTC GATCTGTGCC TGGCGGGCCT CCGGGCAGCC GGCGCCGGCC GGCCCCACCC GACGACCTCA GCGGTACGCC GGCACCGACC GTCAGGTCCG TGGTCTGCTG CTCGGCGTCC TTCGGGAGGC GACCGCCCCC GTGTCCAGAC ACCGTCTGGA CCAGGTGTGG ACCGACAACG TGCAGTGCGT CCGGGCGCTC ACCGGCCTGG TCAAGGATGG CCTCGTGGAG CAGGTCGACG AGACGTCCTT CCGGCTGGCC GGGGACGGCC CGCCAATCCT CGTTCCCTGA
|
Protein sequence | MTEPTFATLV SRWYTHHARD LPWRHPGVGA WAILVSEVML QQTPVARVIP AWTAWLSRWP TAADLAAEPP AEAIRMWGRL GYPRRAVRLR ECAVAMVERH GGQVPDRLEQ LLALPGVGTY TARAVAAFAY GQRHPVVDTN VRRVICRAVA GEPDAGPATR PADLAATEEL LPTEPAAAAL ASAAFMELGA VVCTARSPRC GSCPVTSICA WRASGQPAPA GPTRRPQRYA GTDRQVRGLL LGVLREATAP VSRHRLDQVW TDNVQCVRAL TGLVKDGLVE QVDETSFRLA GDGPPILVP
|
| |