Gene Nmul_A1334 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A1334 
Symbol 
ID3785060 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp1524099 
End bp1525043 
Gene Length945 bp 
Protein Length314 aa 
Translation table11 
GC content53% 
IMG OID637811422 
Producthistidine kinase, dimerisation and phosphoacceptor region 
Protein accessionYP_412029 
Protein GI82702463 
COG category[T] Signal transduction mechanisms 
COG ID[COG4585] Signal transduction histidine kinase 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGCAAA AAAAGGAAAA TTCAGAACCC CTCCATATGG GGGAAGCGTA CCCTCCGGAT 
GTTGCCGCCA ATTCTCCAGG GATGGTTTTT CAATACATAC AAAAGGGAGA TGGTACTTCA
TCCATGCCCT TTATCAGTGA TCAGTGCGCC AATGTACTCG GGATCTCGGC TGGAGAACTG
AAGGCTAATC CTTCGCTGCT CCAGGATCTG GTGCTTCCGG AAGACCGGGA AAGCCTGGAG
TATTCGAAGG CGCAATCCGC AGCTAATCTG ACAACGTGGA ATTGGGAAGG ACGCCTGTGG
ATCGAATCCT ACGGGGATAT AAAGTGGGTC AGCTTGCGAG CAAGTCCGAG GCGGGAGAAG
GGCGCGTGCG TGGTTTGGGA AGGCATTATC ATCAATATCA CCGAGAGCAG GCGCCGGGAG
ACAGAGCTCA AGGCATCCCA TGAACGGCTC AAGGAAGTAT CAGCCCACGT CATGGCGGCG
AGGGAGCACG AGCGTATACG CATTGCCAGA GAAATACACG ATGATCTGGG GGGGAATCTC
ACTGCCATCA AGATCGATCT CGACTGGCTG GTGAGGCGGA TCGATGCCGG CGCCAGCAGC
GCCGAGAATG CTGTATTGCT TGCGAAAGTG CGTATTGTTT CAGATCTGGT GGATCGCACC
ATACATTCGA TACAACGTAT TTCCCGGGAC CTGCGGCCCG GCATCATGGA TTTTGGCATT
TTTGCGGCAA TAGAATGGGA AGCGGGTGAA TTTGCAAAAC GCTATGGCAT ACCTTGCAAG
GTGTCGTGCA ACGAGCCGGA TATTGAACTC GAATCTGACA TGGCCGTGGC CGTATTTCGA
ATTTTTCAGG AAGCGCTCAC CAATATTGCG AAGCACGCGC ATGCCTCGCA TGTCTGGGTC
GGGCTGGACG TGAGATCATG GATTTTGGCA TTTTTGCGGC AATAG
 
Protein sequence
MKQKKENSEP LHMGEAYPPD VAANSPGMVF QYIQKGDGTS SMPFISDQCA NVLGISAGEL 
KANPSLLQDL VLPEDRESLE YSKAQSAANL TTWNWEGRLW IESYGDIKWV SLRASPRREK
GACVVWEGII INITESRRRE TELKASHERL KEVSAHVMAA REHERIRIAR EIHDDLGGNL
TAIKIDLDWL VRRIDAGASS AENAVLLAKV RIVSDLVDRT IHSIQRISRD LRPGIMDFGI
FAAIEWEAGE FAKRYGIPCK VSCNEPDIEL ESDMAVAVFR IFQEALTNIA KHAHASHVWV
GLDVRSWILA FLRQ