Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmar_1247 |
Symbol | |
ID | 5773149 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosopumilus maritimus SCM1 |
Kingdom | Archaea |
Replicon accession | NC_010085 |
Strand | + |
Start bp | 1144197 |
End bp | 1145837 |
Gene Length | 1641 bp |
Protein Length | 546 aa |
Translation table | 11 |
GC content | 29% |
IMG OID | 641316891 |
Product | histidine kinase |
Protein accession | YP_001582581 |
Protein GI | 161528755 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG4191] Signal transduction histidine kinase regulating C4-dicarboxylate transport system |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 40 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCATTAG GAAAAAATAA AATTTTATGG GGATTATTAG GATTAATCAT AATATCGGTT ATAGTCACAG GATTCTACAT TAATGATTTT ACAAAAACAA ACATTCGTGA ATCACTAATT GAAGAAAAAA CTGAAATCCA AATGATTATG ACCAAAAGTC TTGCAGCATC TGTTAATTCA GAGTTTCAGA GACTTTTGAC AGATATGAAA ACTATTTCAG AATCTGCACA AGTTCAAGAA AGTCTAACAA GTAGTCAAAC ACAAGAATAT CTTTCAAACA GATGGAATAA TATTAATTCA ATTACAAAGA TCTCAGACAT ATTTTTGACA GATGACTCAC TTACAATAGT ATCACAGGTA AATGATGAAA AATTTCGCCT AGTGGGATTA AATTTGCAAA ATATTCAATC TAGTGAAGAA TTCAAGGCTA AACCAGGGTA TTCAGGAGAG ATATTTAGCA CAGATGGGAT TTATCGAGTA CTTGTATCAA GCCCAATTAT TGATTCAGAA TCAGGGGAAT TCAAAGGGAT TGTGTTTGGG ATAGTTGAAC CATCAGAAAT CATCTCAAAA TATATCGGAA TTTATGAAAT TGACATATCA TCGATTACCA TATTTGATGA AAATCAAAAA ATCCTATTTG CTGAAAATAC TGATTTACTT GGAAAAGAAT TTTCAAGTTA TTTTGCACAA AGATATTTTG GGGAAAATGA AATTCAAAAT TCTCATTATG AAAATATTTT CTTAGGAAAT ACAGATTCAT TTGTGTATGA AGCACATAGA TTTGGAGATG TGATAAGCAC AGGAACTCCT GTATCAATAG AAGAAAAAAA CAGATTCTTT TTCTTTGTAA CAACTCCAGT AAATCAAATA GTTGAAGATA TTGAAGACAA TCTATTTGTT GAGGATCTAA AGAATAATTT AATTTTATTT ATAATAACAA TTTTGTTTAT TGTGATTGTC ATTAAACGAG TTAGATCGAT TGAAAATGAA AAACTGTTAG TCATAGGACA GCTTGCATCA AACATTGCAC ATGATATAAG AAATCCTCTT GGAACTATAC GTAGTTCAGT TACAAGAATT GAGAAACAAA ATGAAACAAT AAATGAAACA ATAAATCAAG AAACAGAAAG AATCAAACGC TCAGTTGCAA GAATGAATCA CCAAGTAGAA AGTGTCTTAA ATTATGTAAG AACAACTCCC CTCAATTTAT CTGAAAACTC ACTAAATGAT TTAATTCAAT CATCTATAAA CTCACTTGTC ATCCCAAAAA ATATTGAACT TAACATTCCA AAAGAAGACA TAAAATTTGA ATGTGATTCA GATAAATTCA AAGTTGTTTT TGAGAATCTA CTGCTAAATG CAGTTCAGTC AATTGATTCA AAAGAAGGTA AAATTAGCAT AAATTCAAAC CAAAACGAAA AAGAAATTAC AATATCATTT GAGAATTCGG GTCCAAATAT TACAGAAGAA AACATTTCAA AAATTTTCAG GCCACTCTTT ACCTCAAAAC TTAAAGGAAC AGGACTTGGT CTTTCAAGTT GCCAGAACAT TATCACGCAA CATCAAGGTA CAATTTTGGT GACAAATAAT CCAGTAACAT TTACCATTAA AATCCCAAAA AATTTAAGGA AAGAAAAGTA A
|
Protein sequence | MALGKNKILW GLLGLIIISV IVTGFYINDF TKTNIRESLI EEKTEIQMIM TKSLAASVNS EFQRLLTDMK TISESAQVQE SLTSSQTQEY LSNRWNNINS ITKISDIFLT DDSLTIVSQV NDEKFRLVGL NLQNIQSSEE FKAKPGYSGE IFSTDGIYRV LVSSPIIDSE SGEFKGIVFG IVEPSEIISK YIGIYEIDIS SITIFDENQK ILFAENTDLL GKEFSSYFAQ RYFGENEIQN SHYENIFLGN TDSFVYEAHR FGDVISTGTP VSIEEKNRFF FFVTTPVNQI VEDIEDNLFV EDLKNNLILF IITILFIVIV IKRVRSIENE KLLVIGQLAS NIAHDIRNPL GTIRSSVTRI EKQNETINET INQETERIKR SVARMNHQVE SVLNYVRTTP LNLSENSLND LIQSSINSLV IPKNIELNIP KEDIKFECDS DKFKVVFENL LLNAVQSIDS KEGKISINSN QNEKEITISF ENSGPNITEE NISKIFRPLF TSKLKGTGLG LSSCQNIITQ HQGTILVTNN PVTFTIKIPK NLRKEK
|
| |