Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mpal_0552 |
Symbol | |
ID | 7271968 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methanosphaerula palustris E1-9c |
Kingdom | Archaea |
Replicon accession | NC_011832 |
Strand | + |
Start bp | 545258 |
End bp | 546115 |
Gene Length | 858 bp |
Protein Length | 285 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 643569199 |
Product | 8-oxoguanine DNA glycosylase domain protein |
Protein accession | YP_002465648 |
Protein GI | 219851216 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0122] 3-methyladenine DNA glycosylase/8-oxoguanine DNA glycosylase |
TIGRFAM ID | [TIGR00588] 8-oxoguanine DNA-glycosylase (ogg) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.729182 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.316005 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCGCACA CGCTGAACCT GTCGTCTGAT CAGTCTTTCT CTCTCTCGCT GACGCTGGGG TGCGGGCAGG CCTTCCGGTG GGAGCAGGAC GAAGCCGGAT GGTGGGAAGG GGTCGTCGGG GATGAGGTGA TCCGGGTCCG CCAGGAAGAT CGGTCGCTGA CCTTCACCGG AACCAGTGAA GAACGCCTGA TCGAGTACTT CGCCCTGGAC ATGGATCTCG CACATGTGCT GGAGACGATC GATCGAGATC CGTTCATCCA TGCGGCGATT GAGGAGTGTG CTGGCCTTCG GATCCTCAGG CAGCCCTCCT GGGAGTGCCT CCTCTCCTAC CTCTGCGCGA CCAATACCAA CATTCCAATG GTGAAGAAAC GGGTCCGTCT CCTCGCCGAG AGCCTGGGGG AACGGATCCC CGGCACTGAC CAGTTCGCCT TTCCTGTACC ATCCGTCTTC AACGAAACCT GTGCCGAACC ACTCGACCAT TGTAGACTGG GGTACCGGAA AGGGTATCTC GCCACGACGG CGTGCCAGCT CGCTGCTGAG GGGGGATGGG AGGGACGGGT CCGGGCTCAA CCATTCGAGG AGGCACGACA GGTACTGACC AGGCTTCCTG GTATCGGACC TAAGGCAGCC GACTGTGTCC TCCTCTTTGG TTTTTCGCGG TACGAGGCGT TCCCGGTCGA TGTCTGGATC CGGCGAATCA TGCAGCAGTT CTACCCTGAA ACTGCTGCAG AGGGATCGTT CACCCCCAAA GAATACGAGC GGATTCGGCG GTTTGCCTGG GAATATTTTG GTGAATATGC CGGCTATGCC CAGGAATACC TCTACGGAGC CCGGATGGGA GCAGCCCAGA TCCCCTGA
|
Protein sequence | MAHTLNLSSD QSFSLSLTLG CGQAFRWEQD EAGWWEGVVG DEVIRVRQED RSLTFTGTSE ERLIEYFALD MDLAHVLETI DRDPFIHAAI EECAGLRILR QPSWECLLSY LCATNTNIPM VKKRVRLLAE SLGERIPGTD QFAFPVPSVF NETCAEPLDH CRLGYRKGYL ATTACQLAAE GGWEGRVRAQ PFEEARQVLT RLPGIGPKAA DCVLLFGFSR YEAFPVDVWI RRIMQQFYPE TAAEGSFTPK EYERIRRFAW EYFGEYAGYA QEYLYGARMG AAQIP
|
| |