Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmar_1540 |
Symbol | |
ID | 5773201 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosopumilus maritimus SCM1 |
Kingdom | Archaea |
Replicon accession | NC_010085 |
Strand | + |
Start bp | 1402714 |
End bp | 1403577 |
Gene Length | 864 bp |
Protein Length | 287 aa |
Translation table | 11 |
GC content | 27% |
IMG OID | 641317191 |
Product | HhH-GPD family protein |
Protein accession | YP_001582874 |
Protein GI | 161529048 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0122] 3-methyladenine DNA glycosylase/8-oxoguanine DNA glycosylase |
TIGRFAM ID | [TIGR00588] 8-oxoguanine DNA-glycosylase (ogg) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.000245501 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAAGAGAT TCCAATCTAA AATGCAAATA AAGCAACCAA TAGATGTTGA AAATTCAATA AACAGCGGTC AAGTTTTTCT TTGGAGAAAG AATAAAGAAT TTTGGTACGG AGTAAACGGA CAAGACATAC TAGAAGTAAA CAAAAATGGA AAAATAAAGT CATTACAAAA TTACAAAACA GATTTTTTTA GAAATAATGA TAATTTTGAT GAAATTATCA AATCGATTTC AAAAGACAAG ATAGTAAAAA ATGCTGTAAA AAAATATCCA GGGCTTAGAA TCATAAAACA AGATCCATTC CAATGTTTGA TTTCATTTAT CGTATCATCA AACTCTAACA TCCAAAAAAT TAAAACAAAT CTAGAAAACA TATCACAAAA ATTTGGAGAA AGAGTGGAAT ACAAAGATCA AGAGTTTTTT TTATTTCCTA ATGCAAAAAC GCTTTCCAAA GCATCAATAA CTGAAATTAA AAATTGTGGA GTTGGATATC GTGCAAAATT CATCAAAGAA GCCTCAAAAA TTTTTGCTTC TGAAAAAATT ATGATTGATG ATTTGAAATC AAGTGATTAT TTTGATGCAA AAAAGAAGAT TCGCATCATT CCAGGCATAG GAAACAAAGT TGCAGATTGT ATTTTATTAT TTTCTCTTGA TAAGTTAGAA TCATTTCCAT TAGATAGATG GATGATTAGA ATTTTAGAGA AATATTATTC AAAAAAATTT CAAATAGATA CCAAAACAAT TACAGAAAAA CAATATGATA TTCTACATGA AAAGATTGTA GATTATTTTG GGCTATATGC AGGATATGCA CAACAGTTTC TCTTCAAAAT GGAAAGAGAA AATTATCAAA AAAAGTGGCT GTAA
|
Protein sequence | MKRFQSKMQI KQPIDVENSI NSGQVFLWRK NKEFWYGVNG QDILEVNKNG KIKSLQNYKT DFFRNNDNFD EIIKSISKDK IVKNAVKKYP GLRIIKQDPF QCLISFIVSS NSNIQKIKTN LENISQKFGE RVEYKDQEFF LFPNAKTLSK ASITEIKNCG VGYRAKFIKE ASKIFASEKI MIDDLKSSDY FDAKKKIRII PGIGNKVADC ILLFSLDKLE SFPLDRWMIR ILEKYYSKKF QIDTKTITEK QYDILHEKIV DYFGLYAGYA QQFLFKMERE NYQKKWL
|
| |