Gene Nmul_A1000 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A1000 
Symbol 
ID3785830 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp1160959 
End bp1162116 
Gene Length1158 bp 
Protein Length385 aa 
Translation table11 
GC content49% 
IMG OID637811083 
Producthistidine kinase 
Protein accessionYP_411695 
Protein GI82702129 
COG category[T] Signal transduction mechanisms 
COG ID[COG0642] Signal transduction histidine kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.146298 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACTCTGG CAAGCGACCA AAATATCAAT CCTGAGAAAC TTTCCGCAAC CGCTCGAAGA 
ATGCTCGAGC TTCGAGATGA AGTACTTTCG GAATGGATGA AAAGGGTTCG AAACAGCATC
AAGGAAGCCG AGCATTTACC CAATCCAATA ATCATCAATA CCTTTCCCGC CCTGTACGAT
AACCTTGCCG AAGCTATTAC GCCTGATTAT CCAAGAGCAA CAGGAAATGA GGGTACTACG
GTGGCGGCGG AGCACGGTGG GGAGCGAGCG CGCCTTACAA GCTATAACGC GCACTCGGTA
ATCGCGGAAT ATCAGCAACT GCGGTGGACA ATCTTTGATG TTCTAAAGAT GAATGACGTA
CGCCTCAATG ACCGTGAAAT TTACATTATC AATGCCTCTA TCGATGGATC AATCCGTGAG
GCTGTCAACG CCTTCGCCTT GACCCAGGCA GCGCTCCAGG AAAGATTTGT TGCAACACTT
GCTCACGACC TGAGAAATCC ATTATCGAAT GCCCATCTTG CCGCCCAGTT GATCAAATCC
ACGTCCGATT TGAACAAGAT AAAGGAATTT GCGGAAGGAA TCATGAACAA CCTGAGTCGA
ATGGATGGAA TGATTCGCGA TTTGCTCGAC TCGATAAAAT TCCACATGGG AGAACAATTA
CACCTGCGGC TCAAGGAATT CGACATACAG GAAGTCATGA AGGAAGTACT CGACAGCTTC
ACCGCCATTC ATGGGGCACG CTTCCGTCTG ATCGGCACTT CTATCACAGG ATGGTGGGAC
CGGGAGGCAA TCAAACGGGC GGTGGAAAAT ATTATTGGAA ATGCAGTGAA ATATGGCTCT
GCCGATACGC CTGTTCGAAT CAAGATTGCT TCACAAAACG AGCGCATGCT ACTGTCTGTG
CATAACGAAG GGGAATTCAT TCCACCTGAA CAAATCGAGA GTATATTTCA AATATTCGGA
AGAGCAGAGG CCGCAAAAAA GGGAAACAAG GAAGGCTGGG GTATTGGCTT GCCGTATGTG
CGAAGTGTTG CGGAAACCCA TGGTGGCAGT GTCGCGGTCG ATAGCTCACC TTATCGCGGC
ACAACCTTCA CGATAGATAT TCCGGTGGAT GCAAGACCTT ATCAAGGTGC CTTGCAACCT
TCCCGGAAGC CGGAATGA
 
Protein sequence
MTLASDQNIN PEKLSATARR MLELRDEVLS EWMKRVRNSI KEAEHLPNPI IINTFPALYD 
NLAEAITPDY PRATGNEGTT VAAEHGGERA RLTSYNAHSV IAEYQQLRWT IFDVLKMNDV
RLNDREIYII NASIDGSIRE AVNAFALTQA ALQERFVATL AHDLRNPLSN AHLAAQLIKS
TSDLNKIKEF AEGIMNNLSR MDGMIRDLLD SIKFHMGEQL HLRLKEFDIQ EVMKEVLDSF
TAIHGARFRL IGTSITGWWD REAIKRAVEN IIGNAVKYGS ADTPVRIKIA SQNERMLLSV
HNEGEFIPPE QIESIFQIFG RAEAAKKGNK EGWGIGLPYV RSVAETHGGS VAVDSSPYRG
TTFTIDIPVD ARPYQGALQP SRKPE