Gene Nmul_A1159 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A1159 
Symbol 
ID3784215 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp1336779 
End bp1338335 
Gene Length1557 bp 
Protein Length518 aa 
Translation table11 
GC content54% 
IMG OID637811244 
Productperiplasmic sensor signal transduction histidine kinase 
Protein accessionYP_411854 
Protein GI82702288 
COG category[T] Signal transduction mechanisms 
COG ID[COG0642] Signal transduction histidine kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGGCATG CAAATAAAAT TGCTTCGATT CCTCAAAGGA ACCAAAGAAG AGAAATGATT 
GCCCCCGTGG AAGCCGGAAG ACAGGATAGC GTAAGCGGTT CATGGGTAAG ACTGCGGCGT
AAGCCCAAAT CCTTTGTCAA ACTCATTCTG CTGGGTTTTG CTCTGGTTGG CTTGCCACTG
ATCGTGGCGC TCATCAATAG CGCTTTTTCC ATTGACCGGT TGGCAAATCA GAGCCGGAAG
GCTGTATATC AGGCGGCGGA GATTGCGCAC AGCAGCCGCG CTCTTGCGGA TGAAATCGCG
AATATGGAGC GGGCGGTGCG CCAGACTCAC ATTCTGGGCG ATACTGCCCT GCTGGAGGGT
TATTTCCGTG GACATGCCGG ATTCGAGAAA ATCGCCGCGG GCCTTGCCGA ACTCTCTCTC
AGCGATGAGC AGAGACGATT GCTGGGACAG CTGAAATCAG AGGAGGCCGC TATTTTCGAG
GAAATTTCAG CCGTTAAGCA ATCACCGGAA GAGTTGCGTA AGCTGATCGG CAATTTTGGT
CATCTGCGGG ATTCCGCACG CTTCTTTTTC TCGCTCGGGT ACGCGCTCAT AGAGCGCGAA
GTGGACGAAA TGCAGGATAT GGCGGGATCA GCGCGCTTTA CCGTTGGCTG GCAACTGCTT
GCACTGATTC CATTTGCCAT ATTACTTGCC TTCTTTTTTT CCTTCCGGAT CGCCCGCCCG
ATTCGCCAGA TTGAAGAGGC TATCCGCAGC ATGGGACAGG GAAAGCTGCA TAAGGCGATT
CGGGTGGATG GTCCTCAGGA CTTGGTTTAT TTTGGCGAAA GACTGGACTG GATGAGACAG
CGCCTGCTAA AGCTCGAAGA ACAGAAGACG CGGTTCCTGC AGCATGTTTC TCACGAACTC
AAGACCCCGC TGACAGCCAT GCGGGAAGGG GCGGATTTGC TCGTGGAAGG TGTGGCGGGC
AAGCTGACGG AAGAGCAGCA GCGCGTTGCC AGCATTCTTC ATAGCAATAG CCTGCAGTTG
CAGCGCCGCA TCGAGGACTT GCTGAGTTAT AGCGCCCTTC AGACCGAGAA AGCCGCACTG
GTGAAACAGG AGGTGGATTT GAAGAAGATT CTGGATGTGG TATTGCACGA TCAGAATCTC
ATCATTATGA ACAAAGGCCT GCGAATGGAT TTGTCTTGCC CGGAAGTGAT GCTGGAGTGC
GAGCCGCAAA AAGTCAAAGT GATTGTGGAC AATCTTTTAT CGAACGCTGT GAAATTCTCC
CCCCGGGGAG GGTGCATCCG CATCTGGACA AGTGAGACAG CAGGGGTTGC GCAACTGGAT
ATAGTGGACG CTGGACCCGG AGTGGATGAT GCCGATCGGG AGAAAGTATT CGAACCGTTT
TATCAAGGGC GGAGAGTGCT TGACAGTCAC GTGAGGGGAA CGGGCTTGGG ATTGTCGATC
GCCCGGGAGT ATGCGCTTGC GCATGGCGGG AACATCGAGC TCGTACCGCT GCCTGACCGC
GGCGCCCACT TTCGGCTCAC TCTCCCCGTT CATGATGTGC CTGAAGGATC AGCATGA
 
Protein sequence
MRHANKIASI PQRNQRREMI APVEAGRQDS VSGSWVRLRR KPKSFVKLIL LGFALVGLPL 
IVALINSAFS IDRLANQSRK AVYQAAEIAH SSRALADEIA NMERAVRQTH ILGDTALLEG
YFRGHAGFEK IAAGLAELSL SDEQRRLLGQ LKSEEAAIFE EISAVKQSPE ELRKLIGNFG
HLRDSARFFF SLGYALIERE VDEMQDMAGS ARFTVGWQLL ALIPFAILLA FFFSFRIARP
IRQIEEAIRS MGQGKLHKAI RVDGPQDLVY FGERLDWMRQ RLLKLEEQKT RFLQHVSHEL
KTPLTAMREG ADLLVEGVAG KLTEEQQRVA SILHSNSLQL QRRIEDLLSY SALQTEKAAL
VKQEVDLKKI LDVVLHDQNL IIMNKGLRMD LSCPEVMLEC EPQKVKVIVD NLLSNAVKFS
PRGGCIRIWT SETAGVAQLD IVDAGPGVDD ADREKVFEPF YQGRRVLDSH VRGTGLGLSI
AREYALAHGG NIELVPLPDR GAHFRLTLPV HDVPEGSA