Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Apar_0121 |
Symbol | |
ID | 8412965 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Atopobium parvulum DSM 20469 |
Kingdom | Bacteria |
Replicon accession | NC_013203 |
Strand | + |
Start bp | 135534 |
End bp | 136535 |
Gene Length | 1002 bp |
Protein Length | 333 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 645021689 |
Product | HhH-GPD family protein |
Protein accession | YP_003179148 |
Protein GI | 257783931 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1194] A/G-specific DNA glycosylase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.00000000756832 |
Plasmid hitchhiking | No |
Plasmid clonability | unclonable |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.180452 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGAAAA AAATTTCACA TACAAAAACA AACGAAGACC TGGATACGGT GTCTGACATT TCAACATCTT TATCCGATGA GCTTGCAGTT ACATTTAAAA AGACAGAGCT TACGACAGAG CTACGTGCGT TTGTCGAGTC TGTGGCCAAA AAAGGCCGCG AGCTGTATCG CGATTTGCCT TGGCGTCGCA CGTACGATCC ATATGCCATT TGGATTTCTG AGGTCATGCT TCAGCAGACG CAGGTTAGTC GCGTAGATGG TCGCTGGCAG AGATGGTTGG AACACTTTCC AACGGTTGAT GCGCTGGCTG CTGCCGCGCC TTCAGATGTA CTTGAAGAAT GGCAGGGCCT GGGCTACAAC CGTCGAGCTT TGTCTGTACA TCGAGCTGCT CAAGCAATTT CTGAAGCAGG CGGAGTCTTT CCACAAGATC AAAAGGAGCT CGTAAAGCTT CCAGGCATTG GTCCCGCTAC TGCAGCAGGT ATTCGCGCGT TTGCGTTCAA TCTGCATGGC GTTTATTTGG AGACTAACGT TCGTACGGTT TTCTTGCATG AGCTTTACCC GCAGGCAGAA GGAGTGCCAG ACTCTGAGCT TATTCCTCTT GTTGAGCTGA CGTGCCCTGC GAGTGTTTCT ACCGCAGCGG GCACTGACAC AGCAAACGCT GCTACAACGG AACTCACGCC GCGTAGCTGG TACTACGCCC TTCTCGACTA TGGCGCGTAC CTGAAGAAAA CTATTCCCAA TCCTTCACGA AGGTCTAAAA GCCACGTCAA ACAGTCTCGC TTTGAGGGCT CTCATCGGCA GAAGCGTGCT GAGCTTTTAC GCGTTCTTCT TGCCCACAAA GATGAGGGTG GAGCAGAGTT TGAGACACTT CATCAGGAAC TCTGTCAGAT TGAGGTCCAT GCCGGGCGAG AAACCCTTGA TGAGCAGGTT ACCCTTGGCT TACTTGAAGA ACTTGCGAAG GAGGGCTTCT GTCAGAAAAA TGATGAATAT TGGTTGCCAT AA
|
Protein sequence | MKKKISHTKT NEDLDTVSDI STSLSDELAV TFKKTELTTE LRAFVESVAK KGRELYRDLP WRRTYDPYAI WISEVMLQQT QVSRVDGRWQ RWLEHFPTVD ALAAAAPSDV LEEWQGLGYN RRALSVHRAA QAISEAGGVF PQDQKELVKL PGIGPATAAG IRAFAFNLHG VYLETNVRTV FLHELYPQAE GVPDSELIPL VELTCPASVS TAAGTDTANA ATTELTPRSW YYALLDYGAY LKKTIPNPSR RSKSHVKQSR FEGSHRQKRA ELLRVLLAHK DEGGAEFETL HQELCQIEVH AGRETLDEQV TLGLLEELAK EGFCQKNDEY WLP
|
| |