Gene Nmar_1109 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmar_1109 
Symbol 
ID5773946 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosopumilus maritimus SCM1 
KingdomArchaea 
Replicon accessionNC_010085 
Strand
Start bp1013628 
End bp1014734 
Gene Length1107 bp 
Protein Length368 aa 
Translation table11 
GC content37% 
IMG OID641316751 
Productluciferase family protein 
Protein accessionYP_001582443 
Protein GI161528617 
COG category[C] Energy production and conversion 
COG ID[COG2141] Coenzyme F420-dependent N5,N10-methylene tetrahydromethanopterin reductase and related flavin-dependent oxidoreductases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTACTTGA CTGATAAGAA ACTCAAATTT GGTATTCAAA ACGGCCTAAA TGTTGCAAGA 
GCCGGTTATA CTGAGGACCA AATCTTAACA GCATGTATGC TGGCTGATAA AACCGGCTAT
GATTCTATTT TCTACATGGA CCACACAAAT GTCCCACAAT GGAAGAATGC CATTGTTCTA
GACCCTTGGG TTATGTTATC TGCAATTGCT GCAGTTACTA ACAATGTAGA ACTTGGAACT
TGTGTAACTG ATGCAATCCG AAGACATCCT TCAAACATTG CACTAGCTGC AATTACACTT
GATAGAGTTT CTAAAGGTAG AGCCATTCTT GGTATTGGTG CAGGTGAAGC ACAAAATCTA
AAAGAATTCT GCATTCCGTT TGAAAAACCA GTATCAAAAT GGGAAGAACA AATTGAAACT
ATTCATACAT TATACAAATC AACTCCAGAT AACACTGTAG ATTATGAAGG AAAGTATTAC
AAACTTGAAG GTGCATGTTT GCAAGCCCCT CCTATTAGAA AACCACATCC ACCAACTTAC
ATGGCTTCTG GTGGAAAGAG AACTTTAGCA TTAACTGGAA AACTTGGTGA TGGTTGGCTA
CCAATTGGTT ATACTCCAGA ACTTTTCGAA GATCATGCTG CTCAAATTAA AAAATCAATG
GATGAAAACA ACAGAACACA AGAAGAGAAA GACAATTTCC AATATGCACT TGATATCGAT
GTATACTTTT CTGAAGATGC AGAAGAATCC TGGGCAAGAA TGAAAGAAGC AGTAAAGGTC
AGTCTATTCA AGCCTGAAGT TTTGAGAGTT CATAACTTGA AAGAAATTGA AGGATTTGAT
TTCGTAAAAT ACTTTACAGA ATATTCTATG TCTAATCAAG AATGGATTGT AAAGATGAGA
GAAGCTGCAA CAAAGATTCC TGAAGCAGTT GCACGTTCTT CAACTGCAGT TGGAACTCCT
GACGACATCA TTCCAACATT TGAGAGATTC ATGGATGCTG GTGTTAATCA CTTTGTAATT
AGATTCTGGG GTAAGAATTA CTTTGGCTCT ATTGACAAAT TTGCAAGCCA TGTAATGCCT
GCCTTGAGAG AAAAAGCCAA ACAATAA
 
Protein sequence
MYLTDKKLKF GIQNGLNVAR AGYTEDQILT ACMLADKTGY DSIFYMDHTN VPQWKNAIVL 
DPWVMLSAIA AVTNNVELGT CVTDAIRRHP SNIALAAITL DRVSKGRAIL GIGAGEAQNL
KEFCIPFEKP VSKWEEQIET IHTLYKSTPD NTVDYEGKYY KLEGACLQAP PIRKPHPPTY
MASGGKRTLA LTGKLGDGWL PIGYTPELFE DHAAQIKKSM DENNRTQEEK DNFQYALDID
VYFSEDAEES WARMKEAVKV SLFKPEVLRV HNLKEIEGFD FVKYFTEYSM SNQEWIVKMR
EAATKIPEAV ARSSTAVGTP DDIIPTFERF MDAGVNHFVI RFWGKNYFGS IDKFASHVMP
ALREKAKQ