Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmul_A1491 |
Symbol | |
ID | 3785368 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosospira multiformis ATCC 25196 |
Kingdom | Bacteria |
Replicon accession | NC_007614 |
Strand | + |
Start bp | 1703711 |
End bp | 1705801 |
Gene Length | 2091 bp |
Protein Length | 696 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 637811579 |
Product | PAS/PAC sensor hybrid histidine kinase |
Protein accession | YP_412186 |
Protein GI | 82702620 |
COG category | [K] Transcription [T] Signal transduction mechanisms |
COG ID | [COG0642] Signal transduction histidine kinase [COG3437] Response regulator containing a CheY-like receiver domain and an HD-GYP domain |
TIGRFAM ID | [TIGR00229] PAS domain S-box |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.125145 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGATAATA ATGATGAAAG CCTGCCGCCC TCGGTTACCT TGCATACAAA CGCCGTACTT CACGCCGGGC AACAGGTCGA GCATGAGCTT CAGCGGGTCA AAAGGGCCCT TGAGGATAAA ACGAAACAGC TTGATGAGTC ACTATCTATC CTGCGAGCAA CCATAGAATC CACTGCCGAC GGAATACTGG TTACCGACGA GTATGGCCAT ATGCTTCGCT ACAACGAACG ATATCTGCAG ATGTGGCAGA TTAAGCTCCC CGATATCGTT GAATTGCATC AACATAGGCA ATTGCAGAAA CTTAGTTGCA AGTACTTGGC GGACCCGAAG CAATATCTTG AGAGAATAGA GGAAATATAT GATGCCTGGC CGCCCGAAAC ATACGACTTG CTGGAACTTA CCGATGGCCG GATATTTGAA CGGTATTCGA AGATTCAGTA TGCAGAGGAC CTGTGTGTAG GCCGCGTATG GAGCTTTAGA GATATCACGG CCCGTAAACA TGCCGAAGAG GCGCTGCAGG AAAGCACCGA GCGTCTCCAC TTCATGGCTG AGGCCATGCC GCAGAAAATA TTCACTGCGC GCGCCGATGG GGCCGTGGAT TATTTCAATC AGCAATGGAA AGATTACACC GGTCTTGCCC GCGAGAAAAT GCAGGGTTGG GAATGGATCA AGCTCATCCA TCCTGACGAC GCCCCGGAGT GCGTGCGGCG CTGGCAGCAC TCGCTTAACA CAGGGGAGCA TCTGCAGATG GAGTGCCGTT TCCGCCAAGC GAACGGCACT TACCGCTGGC ACCTGCTTCA GGCACAGGCG CGGCGCGATA CGAAAGGGTA TATATCCATG TGGGTCGGTT CGAACACGGA TATCGACACG CTAAAACGCG CAGACGAAGA GAAAAAACAG CTCCTTGAGA ATGAACGGAT TGCGCGAAGC GAAGCCGAAC GCGCGAACCG CATCAAGGAT GATTTCCTGG CAACCCTATC CCATGAACTT AGAACACCGC TTAATGCCAT CCTTGGATGG TCACAGCTCA TCTTGCAGGG GACAATGAAA AACGAAGACA TTCAAAGGGG TCTGGAAACC ATCGAGCGAA ACGCCCGGGC ACAGAATAAG CTTATCGAAG ATCTTCTGGA AATGAGCAGC ATCATTTCTA ACAAAATCCG ATTGGATATG CAGGAATGGG ACCTTGGCGC AATAGCCGAG GCAGCAGTCG AATCTGTTGC ACCTGCAGCC GAGGCTAAAG GAATATGTAT CCGAAGAACA ATCGATGCCA CTGCAGGACG GGTTCTAGGG GATAACCATC GCCTTCAGCA GATAATATGG AACCTTCTTT CCAATGCCGT CAAGTTTACG CCTGACGGCG GTACCGTAGA GGTAATCGTG GAACGCGCCG CATCGCAGCT TAGGGTCATC GTAAAAGATT CCGGGGTGGG TATCAAACCT GAATTCCTGG CGTATGTTTT TGACCGCTTC CGGCAAGCCG ACTCTTCCCT CACCCGGAAT CATGGCGGCC TGGGCTTGGG ACTTGCCATC GTGAAGCAGC TTGTGGGGCT GCATGGTGGA ACGGTTTGCG CAGAGAGCAA GGGGGAAGGT CAGGGCGCCT CATTTTGCGT TACCTTACCG CCTGCCCCGA TCAAGGACGG CATGGACAGA CAACTGCTGC CTGATGCCAA ACCTTCCCAG AAGGATGGCA ATATCCTTCT CTCAGGGATG AGGATTCTCG TCATTGATGA CGAACGGGAC TCGCGTGAGC TCATCCATAA AGTGCTCGCG CAGCACCGGG TTGAAGTCAT TACCGCGGCA AACGCCATGG AAGGACTGTT GATACTGAAA AGCCAGATGC CGGATGTAAT GATCAGTGAT ATTGGCATGC CGGGAAAAGA CGGCTACCAG TTTATTCGCG AAGTGAGAAG ACTTCCTGCA ATCCAGGGAG GAGAGATACC GGCAATCGCA CTGAGCGCAT TTGCGCATCC GGAAGACCGT ACCTGCGCAA TGATGGCGGG TTACCAGATG CATCTCTCCA AACCCGTGGA ATCAAAAGAA CTGATCGCTT CCATCGGAAG CCTGATGGCG CGAGCAAAAA CAGCGAGTTG A
|
Protein sequence | MDNNDESLPP SVTLHTNAVL HAGQQVEHEL QRVKRALEDK TKQLDESLSI LRATIESTAD GILVTDEYGH MLRYNERYLQ MWQIKLPDIV ELHQHRQLQK LSCKYLADPK QYLERIEEIY DAWPPETYDL LELTDGRIFE RYSKIQYAED LCVGRVWSFR DITARKHAEE ALQESTERLH FMAEAMPQKI FTARADGAVD YFNQQWKDYT GLAREKMQGW EWIKLIHPDD APECVRRWQH SLNTGEHLQM ECRFRQANGT YRWHLLQAQA RRDTKGYISM WVGSNTDIDT LKRADEEKKQ LLENERIARS EAERANRIKD DFLATLSHEL RTPLNAILGW SQLILQGTMK NEDIQRGLET IERNARAQNK LIEDLLEMSS IISNKIRLDM QEWDLGAIAE AAVESVAPAA EAKGICIRRT IDATAGRVLG DNHRLQQIIW NLLSNAVKFT PDGGTVEVIV ERAASQLRVI VKDSGVGIKP EFLAYVFDRF RQADSSLTRN HGGLGLGLAI VKQLVGLHGG TVCAESKGEG QGASFCVTLP PAPIKDGMDR QLLPDAKPSQ KDGNILLSGM RILVIDDERD SRELIHKVLA QHRVEVITAA NAMEGLLILK SQMPDVMISD IGMPGKDGYQ FIREVRRLPA IQGGEIPAIA LSAFAHPEDR TCAMMAGYQM HLSKPVESKE LIASIGSLMA RAKTAS
|
| |