Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmul_A0335 |
Symbol | |
ID | 3785960 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosospira multiformis ATCC 25196 |
Kingdom | Bacteria |
Replicon accession | NC_007614 |
Strand | + |
Start bp | 364668 |
End bp | 367610 |
Gene Length | 2943 bp |
Protein Length | 980 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 637810411 |
Product | response regulator receiver/GGDEF/EAL domain-containing protein |
Protein accession | YP_411035 |
Protein GI | 82701469 |
COG category | [K] Transcription [T] Signal transduction mechanisms |
COG ID | [COG0745] Response regulators consisting of a CheY-like receiver domain and a winged-helix DNA-binding domain [COG5001] Predicted signal transduction protein containing a membrane domain, an EAL and a GGDEF domain |
TIGRFAM ID | [TIGR00254] diguanylate cyclase (GGDEF) domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.376364 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGGCGCATG ATATTTCAAT CCACGATGCC CGGAAAGTCG AGATTCTCAT CGCAGAGGAC AGTCCCACAC AAGCGGAAAA ACTGAAGTAT CTGCTGGAGG AAAACGGCTA CCAAGTTACC ACCGCAGGCG ATGGCAAGCA GGCCCTCTCG AGGGCGCGGC AGCGCAACCC GACATTGATT ATCAGTGATG TGATGATGCC GGAGATGAAT GGGTTTGCGT TATGCAGTGA AATCAAGCGG GACGCACAAC TCAAGAAGAT TCCAGTGATC CTGCTTACCT CGCTGGCGGA TGTCAAGGAT ATCATCCAGG GATTGGAATC GGGGGCCGAT AATTTTATCC GCAAGCCATA CGACGACAAA TATCTTCTTG ATCGTATCGA GTATCTACTG ATGAGCCATG AGCTGCGCAA GAGTCAGAAG ATGCAAATGG GAATGCAGAT TTATCTGGGC GGGCAAAAAC ATTTCATCAC CAGCGAGCGG CAGCAGATCG TGGATCTATT GATTTCAGTT TATGAGGATG CGGTCCATCT CAACGCTGAA CTGAATAGGC GACAACGTGA GCTGACGCAC TCGAATCGCT TGCTCAATGG GCTGTATCGA ATGGCGGAAG GCTTGAACCA GGCAGCAACC GAGCGCGAGG TCAGCGAAAA AGCGCTGGAG TACGCCATGA TGTTCCCGGG AGTCCAGGCG GGGTGGCTTT ATTTATGGAA TACCGACGGC CTGAGTTTGG CAGGCTCACG AAATGTACCT GCATCAGTAA GTGCGGATGG CTGGGACACA TCATGCAATT GCCTGCGCCT TTTGCTCGCC GGGGAGCTCG GCAATACTCC TAAAATTTTA AACTGCGAGC GCCTCTTCTC AAGCGGAGGC AATGGTTCTT GCCACCATGT CACCATTCCG ATGTGGAGTG GCAGCAAGAA CCAGGGCGTG ATGAATCTCC TGGGCTTGGA ACAGGATCAG TTCAGGGAGA ATGAGTTGGA TACATTCTAT GGGATTGGGC AGCAGGTAAG CGTAGCGCTG GAGCGTGCCC GATTGCATGA ACATATGGAA AAATTGGTGG GGGAGCGTAC TGCGGCGTTA ACAGCGGAGA TTGCTCAGCG CAAGGAATAT GAAGCGAGAG TAGTGCGGCT CAATCGTTTG TATAGCGTGT TAAGTGGTAT CAATACAGCC ATTGTGCGTG TCCGTGAAAT ACAGGAGCTT TTCAATGAGG CATGCCGTAT TGCAGTCGTT CACGGCGGAT TCGCTTTCGC TTGGATCGGA ATGCTGCGGG GAAGTACAAG AGTAACTACA CCAGTGGCCA AGATGGGGCA AGGAAATGAT GAGCTGCTCC AACTCAACCT ATCGCTGATG AGTGCGCCGA AAGGCGCTCC GCTGATAGAG GATTTAATCA CGCATCTCAA GCCGTTCATT TGCAATGATC TCGTTGCCGA TAAGCGCATG GAGGGTTTGC ATGATGCACC GCTGGCTCAC GATTATCTTT CCATGGCGAT CTTGCCCCTG GTTCTCGATG GGCAACTCGT CGGCACTCTG ACTCTCTATA CTCCCGAGAC AGGGTTTTTC GACGAGGAGG AAATTGCTTT GCTGGTAGAA ATGGCGGGCG ATATTTCGTT TGCAATGGAT CACCTGAAAA AAGAGGAGCG CATCAATTAT CTGGCTTTCT TCGACGCTGT AACCGAGCTC CCGAACCGGG CGTTATTTCT AGACCGGGTT AATCAGCGGA TCAAGATGGT TTCTTCCGGT CACGAGATGC TTTCCGTAAT TGTTCTGGAC ATCGAACGGT TCAGCAGCAT TAATGAATCG TTCGGTCATG GAGCGGGGGA CAGTCTACTC CGCCAACTTG CAAAGCGGCT GAAGCAGCTG CTTTCAGAAG CAGACATTCT GGCTCATCTT TCCGCCGATT ATTTTGCTAT CGCCAGAAAA CATGAAGAGG AAAGCATCGA CATCGTCCAT ATGCTGGAGA AGCTTCTGTT CGAGGTTCAG AACGAGCCCT TTTTAATTAA TGGACAGGAA CTGTGGGTTT CCGCCAGGGC GGGCGCTTCG TTCTTCCCTA GCGATGGCCT GGATACTGAT ACGCTCTTGA GAAATGCCGG GACAGCGCTA AAGAAGGCCA AGCATTCTTC CGACAAGCAT CTGTTTTATG CTCCCGAATT TCATGCACGG GTTAAAGAAA AACTGAGACT CGAGATCAAG CTGCGTCAGG CGTTGGAGCA GGAACAATTG ATGCTGTACT ACCAACCCAA AGTTGATTTG AAGAGTGGAC AAATAAGCGG ACTCGAGGCG TTGATGCGCT GGCACGATCC GGAAAGCGCT ACGGTACTTT CACCGCTTCA CTTCATTCCC TTGCTGGAAG AGACGGGAAT GATTCTTGAT GCAGGCCGGT GGGCGCTGAG CAAAGCGATA TCGGATGCAA ATAAATGGAA AGCCATGAGT TTGAATCCAC CGAGAATTGC CATCAATGTT TCTCCCGTTC AGTTACGGCA GAAGGATTTT GTCGATATGG TGGCGAATGT GGTCAGCGAC GCGGGAGATC TGGCCGCATT CCAGTTCGAG ATTACGGAAA GTGTAATCAT GCACGATATC AAAGCCAATA TCGAGAAGCT CAATCTCATC CAGGAAATGG GTATAGAGCT TGCGATAGAT GATTTCGGTA CTGGTTATTC TTCGCTCAGT TATATCGCCA AACTGCCCGT CAATGTCCTG AAAATTGATC GCGCTTTCAT CAGTGATATG AATATCAATC CAGACAATCT CAGCATTGTT TCCGGAATCA TATCCCTGGC ACATTCGCTG CATTTGCGTG TTGTTGCCGA GGGAGTGGAA ACGGCTGAGC AAGCGGAGCT TTTGCGAAGC CTCGAATGTG AGGAGATGCA GGGCTATCTC TTTAGTCCCG CCGTCACAGC GACCAAGATA ATAGAGTTTC TACACCAGAA GAAATCTCTT TAA
|
Protein sequence | MAHDISIHDA RKVEILIAED SPTQAEKLKY LLEENGYQVT TAGDGKQALS RARQRNPTLI ISDVMMPEMN GFALCSEIKR DAQLKKIPVI LLTSLADVKD IIQGLESGAD NFIRKPYDDK YLLDRIEYLL MSHELRKSQK MQMGMQIYLG GQKHFITSER QQIVDLLISV YEDAVHLNAE LNRRQRELTH SNRLLNGLYR MAEGLNQAAT EREVSEKALE YAMMFPGVQA GWLYLWNTDG LSLAGSRNVP ASVSADGWDT SCNCLRLLLA GELGNTPKIL NCERLFSSGG NGSCHHVTIP MWSGSKNQGV MNLLGLEQDQ FRENELDTFY GIGQQVSVAL ERARLHEHME KLVGERTAAL TAEIAQRKEY EARVVRLNRL YSVLSGINTA IVRVREIQEL FNEACRIAVV HGGFAFAWIG MLRGSTRVTT PVAKMGQGND ELLQLNLSLM SAPKGAPLIE DLITHLKPFI CNDLVADKRM EGLHDAPLAH DYLSMAILPL VLDGQLVGTL TLYTPETGFF DEEEIALLVE MAGDISFAMD HLKKEERINY LAFFDAVTEL PNRALFLDRV NQRIKMVSSG HEMLSVIVLD IERFSSINES FGHGAGDSLL RQLAKRLKQL LSEADILAHL SADYFAIARK HEEESIDIVH MLEKLLFEVQ NEPFLINGQE LWVSARAGAS FFPSDGLDTD TLLRNAGTAL KKAKHSSDKH LFYAPEFHAR VKEKLRLEIK LRQALEQEQL MLYYQPKVDL KSGQISGLEA LMRWHDPESA TVLSPLHFIP LLEETGMILD AGRWALSKAI SDANKWKAMS LNPPRIAINV SPVQLRQKDF VDMVANVVSD AGDLAAFQFE ITESVIMHDI KANIEKLNLI QEMGIELAID DFGTGYSSLS YIAKLPVNVL KIDRAFISDM NINPDNLSIV SGIISLAHSL HLRVVAEGVE TAEQAELLRS LECEEMQGYL FSPAVTATKI IEFLHQKKSL
|
| |