Gene Nmul_A0727 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A0727 
Symbol 
ID3786073 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp847284 
End bp848714 
Gene Length1431 bp 
Protein Length476 aa 
Translation table11 
GC content53% 
IMG OID637810809 
Productputative signal transduction histidine kinase 
Protein accessionYP_411426 
Protein GI82701860 
COG category[T] Signal transduction mechanisms 
COG ID[COG4585] Signal transduction histidine kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATCAGCC GGGATAAAAG GCACTTCAAC TCCGTCCATT CCCCTGTGGA TGAACATGCT 
GACGCAAGTC AGGCGCTTCG CGAGCGCCTG AAAGAAATCA CCTGTCTCTA TGAAATTCGC
CGGGGCATGG GGCCGGAATT ATCGGTGGAG AACGTTTGCC GGCAGATTTT CGAGCACTTG
ATACACGCGA TGCAATTTCC GGAAATTGCT ACCGCCATGA TCGAGCTCGA CGGCAGACGC
TTCATTTCCC AGAATCACGA CGAAGGTGCC ACGCATGAGC TGCAATCGAC GATTAACGTC
AACGCCCATC CTTGTGGCCA GCTACGGGTC TTCTATCCGG AAGATAAACC TTTCCTGGTG
CCGGAAGAAC AGCGGCTCAT CGACGCGATC GCAACTGATC TGGGAAGGTG GTTTGAGCGC
AAACAGATCG ACGAGGCGTT GCGCGAGCGT CTGAAAGAAA TCACTTGCCT CTACGAGATT
CGCCATGGCA TGGGAGTGGA ATTATCGGTG GACAACGTCT GCCAGCAGAT TTTCGAGCAC
CTGATACCCG CGATGCAATT TCCGGAAATT GCTACCGCCA TGATCGAACT CGATGGCAAG
CGCTTCACTT CCAAGAACCA CGGTCAGGGT CTTACGCACG AACTGAAATC GACGATCAGC
GCCAACAACC ATTCCTGCGG CCAGTTGCGT GTCTTCTATC CCGAAGACAA ACCTTTCCTG
GTGCCGGAAG AACAGCGGCT CATCGACGCG GTTGCGACTG ATCTGGGGAG ATGGTTTGAG
CGCAAACATC TCGAGCAAAC CCTGGTTTCC ATAGCGGAAG AACATCAGCG TTCGATCGGC
CAGGATTTAC ACGACAATCT CGGGCAGCAG ATTGCAGCGA TTGGCTATCA GGCCAAAGCG
CTGCAGAAAA AAATATCCTC GTTGGGGAGT ACGGATGCCG CAACCGTCGC TGCTTCCATC
GCGACTCAAG CACAGATCGC CGTGATGCAA TGCAAGCAGC TTGCGCAGGG GCTGCTCCCA
TTTGAACTGG AGACCCATGG CCTGGTGGCC GCACTGCGGG CATTTGCATC CAGAATCGCA
ATCACTTACA AGATTACTTG TGATTTTATA TGCAAAAATG AAGTTCTCAT CAAGGATAAG
GATCTTGCGC TTAATATTTA CCGGATTGCC CAAGAGGCCA CCAATAACGC AATACGCCAC
GGGAGCGCAC AGCATGTGAC AATTTCGCTG GATTCCGAGG AAGAAATGCT CTCTCTGTCG
ATACGCGATG ATGGCGGCGG CTTTGCCGGT TTTAACACAA AAGAGGGAGC GACTCCAGGA
ATGGGTATTA AAATCATGCA ATATCGCGCC CGGCAGCTTG GCGCAATACT GGAATTCGTG
TCGCATCCCG AAGGCGGAGT GGAAGTGCGG CTCGAAATGC GAATGATGTA G
 
Protein sequence
MISRDKRHFN SVHSPVDEHA DASQALRERL KEITCLYEIR RGMGPELSVE NVCRQIFEHL 
IHAMQFPEIA TAMIELDGRR FISQNHDEGA THELQSTINV NAHPCGQLRV FYPEDKPFLV
PEEQRLIDAI ATDLGRWFER KQIDEALRER LKEITCLYEI RHGMGVELSV DNVCQQIFEH
LIPAMQFPEI ATAMIELDGK RFTSKNHGQG LTHELKSTIS ANNHSCGQLR VFYPEDKPFL
VPEEQRLIDA VATDLGRWFE RKHLEQTLVS IAEEHQRSIG QDLHDNLGQQ IAAIGYQAKA
LQKKISSLGS TDAATVAASI ATQAQIAVMQ CKQLAQGLLP FELETHGLVA ALRAFASRIA
ITYKITCDFI CKNEVLIKDK DLALNIYRIA QEATNNAIRH GSAQHVTISL DSEEEMLSLS
IRDDGGGFAG FNTKEGATPG MGIKIMQYRA RQLGAILEFV SHPEGGVEVR LEMRMM