Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmar_0249 |
Symbol | |
ID | 5773143 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosopumilus maritimus SCM1 |
Kingdom | Archaea |
Replicon accession | NC_010085 |
Strand | - |
Start bp | 220541 |
End bp | 221920 |
Gene Length | 1380 bp |
Protein Length | 459 aa |
Translation table | 11 |
GC content | 29% |
IMG OID | 641315871 |
Product | histidine kinase |
Protein accession | YP_001581583 |
Protein GI | 161527757 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG4191] Signal transduction histidine kinase regulating C4-dicarboxylate transport system |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 37 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGTTAA CTCAAAGGAT TATACTAATA ACAATTTTAC CATTAATAAT TTCTACATCA GTTTCAGCGG CAATCATTTC AGAAATAGCA AGTGATCAGT ATTTTGAACT AACAATATCA AAAGTTCAAT CATTAAGTAA ATTGACTGAA AGTGAAATGC GAAATCCAAT GCATCAATTA GATTTTGATG CATTAAATGA GATAGTAGAT AACTTGGAGG GAGATGAAAA TATTCAGCAA GTTTTAGTCC TGTTTCCTGA TGGACGTTTA CTAACGGATG GAACTGATAA CGATTACAAT TATGGGACAA CTTTTGAGGA TAAATTTATT CAAAATGCAA TAACAAGTAA TGAAGAGGCA GTAACGATTG ATAATAACAT AATTCGTGTA TCAAATTCAA TAGTTCTTAA TGAGAAAATT GGGATTTTGG TAATAGATTA TTCAACAAAT AGTATTGAAA AAAGTATTCA AGAAACAATT ACTGAAATTA TTGTAGTAGC AGGAATAATT ATCGGAATAT CTATTTTTGT TGCAGTATAT CTTAGTCGTT CAATTAGCAA TCCAATTCTA ATAATTAAAG AAAAAATAGA TAGTATTTCA AAAGGAGATT TACAAAAACA AGAAATTAAA TCAAAAATTC CCGAAATCAA TGAGCTTTAT GATGAAATTA TCAACATGGG AGAAAAAATT GAGAAATATC AAAATGAATT AGTAAAAACA GAGAGACTCA CCACTATCGG GGAAATGTCT GCAAGAATTA CACATGATTT GAGAAATCCG TTAACTACAA TAAAAAATGC AGTAGCAGTT ATGAAAATGA AAAATCCTGA AAAAATTAAA GAAAATCAAC AATATTTTGA CATGATACAA GATGGAGTAA CTCGAATGAA CCATCAAATT GACGAAGTGC TGGCATTTGT TAAAGCAAAA GAACCTGAAA GGAACTTTGT GGAATTTTCT GAAATATCAA ATAATGTATT AAATACAATC TCAATACCAG AAAATATCAA AATATCAATT TCAGAAAGTA ATGAAAAAAT TTGGTGTGAT AAAATTCAAT TACAAAATGT TTTGATCAAT ATGATCTCAA ATTCAGTTCA AGCTATTGGA AAAAATCAAG GTGAGATAAT AATTGATTAT AAAATAGAGG GAGAATTCGA CAAAATTACA GTAAAAGACA ATGGAACTGG AATACCAGAA AATTTACTGG ACGCAGTATT TGAGCCACTT TACACTACAA AACAAGACGG CACAGGTCTA GGGCTAGTTA GTTGTAAAAA TGCTGTTGAG GCTCATAATG GCAAAATTTA CGCTCAAAAT TTAGATGAAG GAGGGGCAAT TTTTACAATT TTGCTACCTA AAATTAAAGA GACAAAATAA
|
Protein sequence | MKLTQRIILI TILPLIISTS VSAAIISEIA SDQYFELTIS KVQSLSKLTE SEMRNPMHQL DFDALNEIVD NLEGDENIQQ VLVLFPDGRL LTDGTDNDYN YGTTFEDKFI QNAITSNEEA VTIDNNIIRV SNSIVLNEKI GILVIDYSTN SIEKSIQETI TEIIVVAGII IGISIFVAVY LSRSISNPIL IIKEKIDSIS KGDLQKQEIK SKIPEINELY DEIINMGEKI EKYQNELVKT ERLTTIGEMS ARITHDLRNP LTTIKNAVAV MKMKNPEKIK ENQQYFDMIQ DGVTRMNHQI DEVLAFVKAK EPERNFVEFS EISNNVLNTI SIPENIKISI SESNEKIWCD KIQLQNVLIN MISNSVQAIG KNQGEIIIDY KIEGEFDKIT VKDNGTGIPE NLLDAVFEPL YTTKQDGTGL GLVSCKNAVE AHNGKIYAQN LDEGGAIFTI LLPKIKETK
|
| |