Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmar_1109 |
Symbol | |
ID | 5773946 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosopumilus maritimus SCM1 |
Kingdom | Archaea |
Replicon accession | NC_010085 |
Strand | - |
Start bp | 1013628 |
End bp | 1014734 |
Gene Length | 1107 bp |
Protein Length | 368 aa |
Translation table | 11 |
GC content | 37% |
IMG OID | 641316751 |
Product | luciferase family protein |
Protein accession | YP_001582443 |
Protein GI | 161528617 |
COG category | [C] Energy production and conversion |
COG ID | [COG2141] Coenzyme F420-dependent N5,N10-methylene tetrahydromethanopterin reductase and related flavin-dependent oxidoreductases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 34 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTACTTGA CTGATAAGAA ACTCAAATTT GGTATTCAAA ACGGCCTAAA TGTTGCAAGA GCCGGTTATA CTGAGGACCA AATCTTAACA GCATGTATGC TGGCTGATAA AACCGGCTAT GATTCTATTT TCTACATGGA CCACACAAAT GTCCCACAAT GGAAGAATGC CATTGTTCTA GACCCTTGGG TTATGTTATC TGCAATTGCT GCAGTTACTA ACAATGTAGA ACTTGGAACT TGTGTAACTG ATGCAATCCG AAGACATCCT TCAAACATTG CACTAGCTGC AATTACACTT GATAGAGTTT CTAAAGGTAG AGCCATTCTT GGTATTGGTG CAGGTGAAGC ACAAAATCTA AAAGAATTCT GCATTCCGTT TGAAAAACCA GTATCAAAAT GGGAAGAACA AATTGAAACT ATTCATACAT TATACAAATC AACTCCAGAT AACACTGTAG ATTATGAAGG AAAGTATTAC AAACTTGAAG GTGCATGTTT GCAAGCCCCT CCTATTAGAA AACCACATCC ACCAACTTAC ATGGCTTCTG GTGGAAAGAG AACTTTAGCA TTAACTGGAA AACTTGGTGA TGGTTGGCTA CCAATTGGTT ATACTCCAGA ACTTTTCGAA GATCATGCTG CTCAAATTAA AAAATCAATG GATGAAAACA ACAGAACACA AGAAGAGAAA GACAATTTCC AATATGCACT TGATATCGAT GTATACTTTT CTGAAGATGC AGAAGAATCC TGGGCAAGAA TGAAAGAAGC AGTAAAGGTC AGTCTATTCA AGCCTGAAGT TTTGAGAGTT CATAACTTGA AAGAAATTGA AGGATTTGAT TTCGTAAAAT ACTTTACAGA ATATTCTATG TCTAATCAAG AATGGATTGT AAAGATGAGA GAAGCTGCAA CAAAGATTCC TGAAGCAGTT GCACGTTCTT CAACTGCAGT TGGAACTCCT GACGACATCA TTCCAACATT TGAGAGATTC ATGGATGCTG GTGTTAATCA CTTTGTAATT AGATTCTGGG GTAAGAATTA CTTTGGCTCT ATTGACAAAT TTGCAAGCCA TGTAATGCCT GCCTTGAGAG AAAAAGCCAA ACAATAA
|
Protein sequence | MYLTDKKLKF GIQNGLNVAR AGYTEDQILT ACMLADKTGY DSIFYMDHTN VPQWKNAIVL DPWVMLSAIA AVTNNVELGT CVTDAIRRHP SNIALAAITL DRVSKGRAIL GIGAGEAQNL KEFCIPFEKP VSKWEEQIET IHTLYKSTPD NTVDYEGKYY KLEGACLQAP PIRKPHPPTY MASGGKRTLA LTGKLGDGWL PIGYTPELFE DHAAQIKKSM DENNRTQEEK DNFQYALDID VYFSEDAEES WARMKEAVKV SLFKPEVLRV HNLKEIEGFD FVKYFTEYSM SNQEWIVKMR EAATKIPEAV ARSSTAVGTP DDIIPTFERF MDAGVNHFVI RFWGKNYFGS IDKFASHVMP ALREKAKQ
|
| |