Gene Nmul_A0335 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A0335 
Symbol 
ID3785960 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp364668 
End bp367610 
Gene Length2943 bp 
Protein Length980 aa 
Translation table11 
GC content50% 
IMG OID637810411 
Productresponse regulator receiver/GGDEF/EAL domain-containing protein 
Protein accessionYP_411035 
Protein GI82701469 
COG category[K] Transcription
[T] Signal transduction mechanisms 
COG ID[COG0745] Response regulators consisting of a CheY-like receiver domain and a winged-helix DNA-binding domain
[COG5001] Predicted signal transduction protein containing a membrane domain, an EAL and a GGDEF domain 
TIGRFAM ID[TIGR00254] diguanylate cyclase (GGDEF) domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.376364 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGGCGCATG ATATTTCAAT CCACGATGCC CGGAAAGTCG AGATTCTCAT CGCAGAGGAC 
AGTCCCACAC AAGCGGAAAA ACTGAAGTAT CTGCTGGAGG AAAACGGCTA CCAAGTTACC
ACCGCAGGCG ATGGCAAGCA GGCCCTCTCG AGGGCGCGGC AGCGCAACCC GACATTGATT
ATCAGTGATG TGATGATGCC GGAGATGAAT GGGTTTGCGT TATGCAGTGA AATCAAGCGG
GACGCACAAC TCAAGAAGAT TCCAGTGATC CTGCTTACCT CGCTGGCGGA TGTCAAGGAT
ATCATCCAGG GATTGGAATC GGGGGCCGAT AATTTTATCC GCAAGCCATA CGACGACAAA
TATCTTCTTG ATCGTATCGA GTATCTACTG ATGAGCCATG AGCTGCGCAA GAGTCAGAAG
ATGCAAATGG GAATGCAGAT TTATCTGGGC GGGCAAAAAC ATTTCATCAC CAGCGAGCGG
CAGCAGATCG TGGATCTATT GATTTCAGTT TATGAGGATG CGGTCCATCT CAACGCTGAA
CTGAATAGGC GACAACGTGA GCTGACGCAC TCGAATCGCT TGCTCAATGG GCTGTATCGA
ATGGCGGAAG GCTTGAACCA GGCAGCAACC GAGCGCGAGG TCAGCGAAAA AGCGCTGGAG
TACGCCATGA TGTTCCCGGG AGTCCAGGCG GGGTGGCTTT ATTTATGGAA TACCGACGGC
CTGAGTTTGG CAGGCTCACG AAATGTACCT GCATCAGTAA GTGCGGATGG CTGGGACACA
TCATGCAATT GCCTGCGCCT TTTGCTCGCC GGGGAGCTCG GCAATACTCC TAAAATTTTA
AACTGCGAGC GCCTCTTCTC AAGCGGAGGC AATGGTTCTT GCCACCATGT CACCATTCCG
ATGTGGAGTG GCAGCAAGAA CCAGGGCGTG ATGAATCTCC TGGGCTTGGA ACAGGATCAG
TTCAGGGAGA ATGAGTTGGA TACATTCTAT GGGATTGGGC AGCAGGTAAG CGTAGCGCTG
GAGCGTGCCC GATTGCATGA ACATATGGAA AAATTGGTGG GGGAGCGTAC TGCGGCGTTA
ACAGCGGAGA TTGCTCAGCG CAAGGAATAT GAAGCGAGAG TAGTGCGGCT CAATCGTTTG
TATAGCGTGT TAAGTGGTAT CAATACAGCC ATTGTGCGTG TCCGTGAAAT ACAGGAGCTT
TTCAATGAGG CATGCCGTAT TGCAGTCGTT CACGGCGGAT TCGCTTTCGC TTGGATCGGA
ATGCTGCGGG GAAGTACAAG AGTAACTACA CCAGTGGCCA AGATGGGGCA AGGAAATGAT
GAGCTGCTCC AACTCAACCT ATCGCTGATG AGTGCGCCGA AAGGCGCTCC GCTGATAGAG
GATTTAATCA CGCATCTCAA GCCGTTCATT TGCAATGATC TCGTTGCCGA TAAGCGCATG
GAGGGTTTGC ATGATGCACC GCTGGCTCAC GATTATCTTT CCATGGCGAT CTTGCCCCTG
GTTCTCGATG GGCAACTCGT CGGCACTCTG ACTCTCTATA CTCCCGAGAC AGGGTTTTTC
GACGAGGAGG AAATTGCTTT GCTGGTAGAA ATGGCGGGCG ATATTTCGTT TGCAATGGAT
CACCTGAAAA AAGAGGAGCG CATCAATTAT CTGGCTTTCT TCGACGCTGT AACCGAGCTC
CCGAACCGGG CGTTATTTCT AGACCGGGTT AATCAGCGGA TCAAGATGGT TTCTTCCGGT
CACGAGATGC TTTCCGTAAT TGTTCTGGAC ATCGAACGGT TCAGCAGCAT TAATGAATCG
TTCGGTCATG GAGCGGGGGA CAGTCTACTC CGCCAACTTG CAAAGCGGCT GAAGCAGCTG
CTTTCAGAAG CAGACATTCT GGCTCATCTT TCCGCCGATT ATTTTGCTAT CGCCAGAAAA
CATGAAGAGG AAAGCATCGA CATCGTCCAT ATGCTGGAGA AGCTTCTGTT CGAGGTTCAG
AACGAGCCCT TTTTAATTAA TGGACAGGAA CTGTGGGTTT CCGCCAGGGC GGGCGCTTCG
TTCTTCCCTA GCGATGGCCT GGATACTGAT ACGCTCTTGA GAAATGCCGG GACAGCGCTA
AAGAAGGCCA AGCATTCTTC CGACAAGCAT CTGTTTTATG CTCCCGAATT TCATGCACGG
GTTAAAGAAA AACTGAGACT CGAGATCAAG CTGCGTCAGG CGTTGGAGCA GGAACAATTG
ATGCTGTACT ACCAACCCAA AGTTGATTTG AAGAGTGGAC AAATAAGCGG ACTCGAGGCG
TTGATGCGCT GGCACGATCC GGAAAGCGCT ACGGTACTTT CACCGCTTCA CTTCATTCCC
TTGCTGGAAG AGACGGGAAT GATTCTTGAT GCAGGCCGGT GGGCGCTGAG CAAAGCGATA
TCGGATGCAA ATAAATGGAA AGCCATGAGT TTGAATCCAC CGAGAATTGC CATCAATGTT
TCTCCCGTTC AGTTACGGCA GAAGGATTTT GTCGATATGG TGGCGAATGT GGTCAGCGAC
GCGGGAGATC TGGCCGCATT CCAGTTCGAG ATTACGGAAA GTGTAATCAT GCACGATATC
AAAGCCAATA TCGAGAAGCT CAATCTCATC CAGGAAATGG GTATAGAGCT TGCGATAGAT
GATTTCGGTA CTGGTTATTC TTCGCTCAGT TATATCGCCA AACTGCCCGT CAATGTCCTG
AAAATTGATC GCGCTTTCAT CAGTGATATG AATATCAATC CAGACAATCT CAGCATTGTT
TCCGGAATCA TATCCCTGGC ACATTCGCTG CATTTGCGTG TTGTTGCCGA GGGAGTGGAA
ACGGCTGAGC AAGCGGAGCT TTTGCGAAGC CTCGAATGTG AGGAGATGCA GGGCTATCTC
TTTAGTCCCG CCGTCACAGC GACCAAGATA ATAGAGTTTC TACACCAGAA GAAATCTCTT
TAA
 
Protein sequence
MAHDISIHDA RKVEILIAED SPTQAEKLKY LLEENGYQVT TAGDGKQALS RARQRNPTLI 
ISDVMMPEMN GFALCSEIKR DAQLKKIPVI LLTSLADVKD IIQGLESGAD NFIRKPYDDK
YLLDRIEYLL MSHELRKSQK MQMGMQIYLG GQKHFITSER QQIVDLLISV YEDAVHLNAE
LNRRQRELTH SNRLLNGLYR MAEGLNQAAT EREVSEKALE YAMMFPGVQA GWLYLWNTDG
LSLAGSRNVP ASVSADGWDT SCNCLRLLLA GELGNTPKIL NCERLFSSGG NGSCHHVTIP
MWSGSKNQGV MNLLGLEQDQ FRENELDTFY GIGQQVSVAL ERARLHEHME KLVGERTAAL
TAEIAQRKEY EARVVRLNRL YSVLSGINTA IVRVREIQEL FNEACRIAVV HGGFAFAWIG
MLRGSTRVTT PVAKMGQGND ELLQLNLSLM SAPKGAPLIE DLITHLKPFI CNDLVADKRM
EGLHDAPLAH DYLSMAILPL VLDGQLVGTL TLYTPETGFF DEEEIALLVE MAGDISFAMD
HLKKEERINY LAFFDAVTEL PNRALFLDRV NQRIKMVSSG HEMLSVIVLD IERFSSINES
FGHGAGDSLL RQLAKRLKQL LSEADILAHL SADYFAIARK HEEESIDIVH MLEKLLFEVQ
NEPFLINGQE LWVSARAGAS FFPSDGLDTD TLLRNAGTAL KKAKHSSDKH LFYAPEFHAR
VKEKLRLEIK LRQALEQEQL MLYYQPKVDL KSGQISGLEA LMRWHDPESA TVLSPLHFIP
LLEETGMILD AGRWALSKAI SDANKWKAMS LNPPRIAINV SPVQLRQKDF VDMVANVVSD
AGDLAAFQFE ITESVIMHDI KANIEKLNLI QEMGIELAID DFGTGYSSLS YIAKLPVNVL
KIDRAFISDM NINPDNLSIV SGIISLAHSL HLRVVAEGVE TAEQAELLRS LECEEMQGYL
FSPAVTATKI IEFLHQKKSL