Gene Nmar_1247 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmar_1247 
Symbol 
ID5773149 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosopumilus maritimus SCM1 
KingdomArchaea 
Replicon accessionNC_010085 
Strand
Start bp1144197 
End bp1145837 
Gene Length1641 bp 
Protein Length546 aa 
Translation table11 
GC content29% 
IMG OID641316891 
Producthistidine kinase 
Protein accessionYP_001582581 
Protein GI161528755 
COG category[T] Signal transduction mechanisms 
COG ID[COG4191] Signal transduction histidine kinase regulating C4-dicarboxylate transport system 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones40 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCATTAG GAAAAAATAA AATTTTATGG GGATTATTAG GATTAATCAT AATATCGGTT 
ATAGTCACAG GATTCTACAT TAATGATTTT ACAAAAACAA ACATTCGTGA ATCACTAATT
GAAGAAAAAA CTGAAATCCA AATGATTATG ACCAAAAGTC TTGCAGCATC TGTTAATTCA
GAGTTTCAGA GACTTTTGAC AGATATGAAA ACTATTTCAG AATCTGCACA AGTTCAAGAA
AGTCTAACAA GTAGTCAAAC ACAAGAATAT CTTTCAAACA GATGGAATAA TATTAATTCA
ATTACAAAGA TCTCAGACAT ATTTTTGACA GATGACTCAC TTACAATAGT ATCACAGGTA
AATGATGAAA AATTTCGCCT AGTGGGATTA AATTTGCAAA ATATTCAATC TAGTGAAGAA
TTCAAGGCTA AACCAGGGTA TTCAGGAGAG ATATTTAGCA CAGATGGGAT TTATCGAGTA
CTTGTATCAA GCCCAATTAT TGATTCAGAA TCAGGGGAAT TCAAAGGGAT TGTGTTTGGG
ATAGTTGAAC CATCAGAAAT CATCTCAAAA TATATCGGAA TTTATGAAAT TGACATATCA
TCGATTACCA TATTTGATGA AAATCAAAAA ATCCTATTTG CTGAAAATAC TGATTTACTT
GGAAAAGAAT TTTCAAGTTA TTTTGCACAA AGATATTTTG GGGAAAATGA AATTCAAAAT
TCTCATTATG AAAATATTTT CTTAGGAAAT ACAGATTCAT TTGTGTATGA AGCACATAGA
TTTGGAGATG TGATAAGCAC AGGAACTCCT GTATCAATAG AAGAAAAAAA CAGATTCTTT
TTCTTTGTAA CAACTCCAGT AAATCAAATA GTTGAAGATA TTGAAGACAA TCTATTTGTT
GAGGATCTAA AGAATAATTT AATTTTATTT ATAATAACAA TTTTGTTTAT TGTGATTGTC
ATTAAACGAG TTAGATCGAT TGAAAATGAA AAACTGTTAG TCATAGGACA GCTTGCATCA
AACATTGCAC ATGATATAAG AAATCCTCTT GGAACTATAC GTAGTTCAGT TACAAGAATT
GAGAAACAAA ATGAAACAAT AAATGAAACA ATAAATCAAG AAACAGAAAG AATCAAACGC
TCAGTTGCAA GAATGAATCA CCAAGTAGAA AGTGTCTTAA ATTATGTAAG AACAACTCCC
CTCAATTTAT CTGAAAACTC ACTAAATGAT TTAATTCAAT CATCTATAAA CTCACTTGTC
ATCCCAAAAA ATATTGAACT TAACATTCCA AAAGAAGACA TAAAATTTGA ATGTGATTCA
GATAAATTCA AAGTTGTTTT TGAGAATCTA CTGCTAAATG CAGTTCAGTC AATTGATTCA
AAAGAAGGTA AAATTAGCAT AAATTCAAAC CAAAACGAAA AAGAAATTAC AATATCATTT
GAGAATTCGG GTCCAAATAT TACAGAAGAA AACATTTCAA AAATTTTCAG GCCACTCTTT
ACCTCAAAAC TTAAAGGAAC AGGACTTGGT CTTTCAAGTT GCCAGAACAT TATCACGCAA
CATCAAGGTA CAATTTTGGT GACAAATAAT CCAGTAACAT TTACCATTAA AATCCCAAAA
AATTTAAGGA AAGAAAAGTA A
 
Protein sequence
MALGKNKILW GLLGLIIISV IVTGFYINDF TKTNIRESLI EEKTEIQMIM TKSLAASVNS 
EFQRLLTDMK TISESAQVQE SLTSSQTQEY LSNRWNNINS ITKISDIFLT DDSLTIVSQV
NDEKFRLVGL NLQNIQSSEE FKAKPGYSGE IFSTDGIYRV LVSSPIIDSE SGEFKGIVFG
IVEPSEIISK YIGIYEIDIS SITIFDENQK ILFAENTDLL GKEFSSYFAQ RYFGENEIQN
SHYENIFLGN TDSFVYEAHR FGDVISTGTP VSIEEKNRFF FFVTTPVNQI VEDIEDNLFV
EDLKNNLILF IITILFIVIV IKRVRSIENE KLLVIGQLAS NIAHDIRNPL GTIRSSVTRI
EKQNETINET INQETERIKR SVARMNHQVE SVLNYVRTTP LNLSENSLND LIQSSINSLV
IPKNIELNIP KEDIKFECDS DKFKVVFENL LLNAVQSIDS KEGKISINSN QNEKEITISF
ENSGPNITEE NISKIFRPLF TSKLKGTGLG LSSCQNIITQ HQGTILVTNN PVTFTIKIPK
NLRKEK