Gene Nmar_0249 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmar_0249 
Symbol 
ID5773143 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosopumilus maritimus SCM1 
KingdomArchaea 
Replicon accessionNC_010085 
Strand
Start bp220541 
End bp221920 
Gene Length1380 bp 
Protein Length459 aa 
Translation table11 
GC content29% 
IMG OID641315871 
Producthistidine kinase 
Protein accessionYP_001581583 
Protein GI161527757 
COG category[T] Signal transduction mechanisms 
COG ID[COG4191] Signal transduction histidine kinase regulating C4-dicarboxylate transport system 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones37 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGTTAA CTCAAAGGAT TATACTAATA ACAATTTTAC CATTAATAAT TTCTACATCA 
GTTTCAGCGG CAATCATTTC AGAAATAGCA AGTGATCAGT ATTTTGAACT AACAATATCA
AAAGTTCAAT CATTAAGTAA ATTGACTGAA AGTGAAATGC GAAATCCAAT GCATCAATTA
GATTTTGATG CATTAAATGA GATAGTAGAT AACTTGGAGG GAGATGAAAA TATTCAGCAA
GTTTTAGTCC TGTTTCCTGA TGGACGTTTA CTAACGGATG GAACTGATAA CGATTACAAT
TATGGGACAA CTTTTGAGGA TAAATTTATT CAAAATGCAA TAACAAGTAA TGAAGAGGCA
GTAACGATTG ATAATAACAT AATTCGTGTA TCAAATTCAA TAGTTCTTAA TGAGAAAATT
GGGATTTTGG TAATAGATTA TTCAACAAAT AGTATTGAAA AAAGTATTCA AGAAACAATT
ACTGAAATTA TTGTAGTAGC AGGAATAATT ATCGGAATAT CTATTTTTGT TGCAGTATAT
CTTAGTCGTT CAATTAGCAA TCCAATTCTA ATAATTAAAG AAAAAATAGA TAGTATTTCA
AAAGGAGATT TACAAAAACA AGAAATTAAA TCAAAAATTC CCGAAATCAA TGAGCTTTAT
GATGAAATTA TCAACATGGG AGAAAAAATT GAGAAATATC AAAATGAATT AGTAAAAACA
GAGAGACTCA CCACTATCGG GGAAATGTCT GCAAGAATTA CACATGATTT GAGAAATCCG
TTAACTACAA TAAAAAATGC AGTAGCAGTT ATGAAAATGA AAAATCCTGA AAAAATTAAA
GAAAATCAAC AATATTTTGA CATGATACAA GATGGAGTAA CTCGAATGAA CCATCAAATT
GACGAAGTGC TGGCATTTGT TAAAGCAAAA GAACCTGAAA GGAACTTTGT GGAATTTTCT
GAAATATCAA ATAATGTATT AAATACAATC TCAATACCAG AAAATATCAA AATATCAATT
TCAGAAAGTA ATGAAAAAAT TTGGTGTGAT AAAATTCAAT TACAAAATGT TTTGATCAAT
ATGATCTCAA ATTCAGTTCA AGCTATTGGA AAAAATCAAG GTGAGATAAT AATTGATTAT
AAAATAGAGG GAGAATTCGA CAAAATTACA GTAAAAGACA ATGGAACTGG AATACCAGAA
AATTTACTGG ACGCAGTATT TGAGCCACTT TACACTACAA AACAAGACGG CACAGGTCTA
GGGCTAGTTA GTTGTAAAAA TGCTGTTGAG GCTCATAATG GCAAAATTTA CGCTCAAAAT
TTAGATGAAG GAGGGGCAAT TTTTACAATT TTGCTACCTA AAATTAAAGA GACAAAATAA
 
Protein sequence
MKLTQRIILI TILPLIISTS VSAAIISEIA SDQYFELTIS KVQSLSKLTE SEMRNPMHQL 
DFDALNEIVD NLEGDENIQQ VLVLFPDGRL LTDGTDNDYN YGTTFEDKFI QNAITSNEEA
VTIDNNIIRV SNSIVLNEKI GILVIDYSTN SIEKSIQETI TEIIVVAGII IGISIFVAVY
LSRSISNPIL IIKEKIDSIS KGDLQKQEIK SKIPEINELY DEIINMGEKI EKYQNELVKT
ERLTTIGEMS ARITHDLRNP LTTIKNAVAV MKMKNPEKIK ENQQYFDMIQ DGVTRMNHQI
DEVLAFVKAK EPERNFVEFS EISNNVLNTI SIPENIKISI SESNEKIWCD KIQLQNVLIN
MISNSVQAIG KNQGEIIIDY KIEGEFDKIT VKDNGTGIPE NLLDAVFEPL YTTKQDGTGL
GLVSCKNAVE AHNGKIYAQN LDEGGAIFTI LLPKIKETK