Gene Nmul_A0202 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A0202 
Symbol 
ID3785875 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp210885 
End bp214049 
Gene Length3165 bp 
Protein Length1054 aa 
Translation table11 
GC content52% 
IMG OID637810273 
Productdiguanylate cyclase/phosphodiesterase 
Protein accessionYP_410902 
Protein GI82701336 
COG category[T] Signal transduction mechanisms 
COG ID[COG5001] Predicted signal transduction protein containing a membrane domain, an EAL and a GGDEF domain 
TIGRFAM ID[TIGR00229] PAS domain S-box
[TIGR00254] diguanylate cyclase (GGDEF) domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATACAG AGATCCACAA TCCCCATATC CCTGGCGGGC GTCCGCCCTC CCGACCGGCT 
ATCGCCGCTC TCACTTTCGT GATTTTTATG TCGCTTACCC TAGCGAGCTG GTCTGTCGTA
CAGGATCTGC AGGAACGGCA TGAAAATACT CGATTTGATA AGCGGGTAGC AGAAGTCATT
CACAAGATCG AGCATCACTT TTCAGGGTAT GAACAGGTGT TGAAAGGGGC GCTGGGACAC
CTGCTTGCCT CCCCCTCCGT TTCCCGGAAC GAATGGCGCA CATATGTGAA TGCACTGCGA
ATAAACGACC GGTATCCGGG CATTCACGGT ATCGGTTTCG CCAAATATAT CCCGCGGTCA
GGGCTTGCCG CCCATACTGA AGAAGTACGC GCGGAAGGTT TTCCGTCCTA CAAAGTCTGG
CCTGGAACCC CGCGCCAGGA GTATACCTCC ATCATATATC TCGAACCCTT TACGGGATCA
AATCTGCGTG CGTTCGGATA TGACATGTTT TCAAATGCTG TTCGACGCGC CGCCATGTCC
CGTGCGCGCG ATACGGGAGA AGTCGCGCTT ACCCGCAAAG TGAAGCTCGT GCAGGAAACC
GGTGAGGATA TTCAGGCAGG CGTCCTTATG TATCTCCCCT ACTATGGTGC TGCAAATCTG
CCGAAAACGG TGGAAGAAAG GCGCGCATCA TTGGGTGGCT ATGTATATGC TCCGTTCAGA
ATGGTTGACT TCGTACGCGC CACCCTTGGG AGCGAACTTG ATATCCTCGA TCTGAGAATT
TTCGATGGAA AAGCAATGGC AGAGGATTCT CTTCTTTTCG ACAGCGTAAA ACATCGATCT
GGGCCGTCGC CCGTACCTAA ATTTAAACGG GTCATACCTA TATCCTTATA TGGTCAAACA
TGGGCGCTCG AAGCATCCTC CCGTCCTGCA TTTGAAAAAG CAATAAAGAA TTACGAGGCA
CTCCTCGTTC TGCTGGGCGG ATTCGTGGTC AGCATGTTGA CTGCCATGGT TTCCTTTGTC
CTTTCGGGGA ACAAAGAGAA GGCTGCGGCA CTGGGACATG TAAATAAAAA ACTGCTGTTG
GCCATGGAGG AGCAGCAGGC AACAACACGC GAGCTTTCAA ATGCGAAGCT TCGCACAGAG
AGGATTTTGG AAAGTATTAC TGACGCGTTT TATACCCTGG ATCGGGAATG GCGTTTCACC
TATGTGAATA AGGAAGCCGA GAATCTGCTA CAGCGTAATC GCGAGGATCT TCTGGGAAGG
GTGTGCTGGG AAGAATTCAA AGAGACAGCA GGAACAACCT TTGATCGCGA ATACCATCGG
GCACTCATGG AAAACAGGAC AGTGACGTTT GAGGAATTCT ACGTTCCACT GGATGAATGG
TTCGAGGTAC ATGCTTATCC ATCTGAAGAA GGCTTGACTG TTTATTTCCG CGATATTACC
GAACGCAAGG AGTCTGAGCA GGCGCGGCAG GAAGCCCACG TTCGTATCCG CCAGCAAGCT
TCCCTTCTCG ACAAGGCAAC GGATGCGATC ATTGTTTTCG GAATGGATAA TTGTATTGAA
TTCTGGAACC AAGGCGCGGA ACGGCTCTAT GGATGGACGT CCGAAGAGGT GATGGGAAGA
GAGGTTGAGA TGTTATATGA CAATGTTGCC GTTTTCGATG AAGCAAACCG GGCGCTGCTC
AGTTCCGGCG AATGGAGAGG AGAACTTGCG CAGCGACGCA GGGACCAGAG TATGCTGAAC
GTGGAGGCCC ATTGGACGCT GGTCAGAAAT GACGACGGAG AGCCTCAGGC TATCTTCGCG
ATCAATACAG ATATTACCCA GCGCAAGAGC GCCGAGAACG AAATCCTGCA TCTCGCCTTA
TACGATTCTT TGACGGGTTT GCCCAATAGG CGGCTCTTGC TGGATCGCCT CGGGCATGCA
TTAGCGGTGA GTGCTCGCAA TCAGCGCATG GGCGCGTTAC TGTTTATCGA CCTCGATAAC
TTCAAGCTGC TGAATGATAC GCTGGGGCAT GACATGGGTG ACCTGCTGCT TCGGCAGGTC
GCGCCGCGCC TGTCCTCTTG CGTACGCGAA AGCGATACGG TAGCTCGCTT GGGCGGAGAT
GAGTTTGTGG TGATATTGAT GGGTGACTTT GGCGAAGATC ATGATGAGGC TCTTACCCAG
ATAAGCACCA TTTGCGAAAG GATACGCAGC GCCTTCATTC AGCCGTTTAA TCTCGATCCA
TACATCCATC ACATTACACC CAGCATCGGT ATTGCACTGT TTAACGATCA GTCCCAAACA
ACAAACGAAC TCCTGAAGCG GGCAGACCTC GCTATGTATC AGGCAAAAGC ATCGGGCCGC
AATGCCATGC GTTTTTTCGA TCCGGACATG CAAGCAGCAA TGAATGCCCG AGCCATTCTC
GAGTCGGAGT TATACAAAAG CTGGGAGAGA AACGAGTTCA TTCTCCATTA TCAGCTGCAG
GTGGACAGCC GGGGGATTAT CGGTGCTGAA GCGCTGGTGC GATGGCAGCA CCCGCGCCGG
GGTCTTTTGC CCCCCTCCGA ATTCATTCCA CAGGCAGAGG AAACCGGTCT GATTCTTCCT
TTGGGCGAGT GGGTGCTGGA AACCGCATGC AACCAACTGG CAAGCTGGGC ACTTCAACCG
GAGACAGTAC AGCTCAATCT CTCTGTGAAT GTCAGTTCCC TCCAGTTTTG CCAGCCCGAT
TTTGTCGAGC AAGTAATTTC AATACTCGAC CGCACAGGCG CCAATCCACA AAGACTCAAG
CTTGAGCTTA CCGAAAGCAT TCTGGTCCAT GATATGGACG ATACCATTGC AAAAATGATG
ACACTCAAAG CCCGAGGGGT GGGCTTTGCG CTGGATGACT TTGGCATCGG CTATTCCTCA
CTCTACTATC TGAAACGTCT GCCGCTGGAT TGGGTAAAAA TTGATCGGTC ATTTGTGAGA
GACGTGCTGA CCGATAATAA TGATGCAACG ATCGTTCGAA CGATCCTGCT CCTTGCTAAA
AGCATGGGGT TGGCGGTAAT TGCCGAAGGA GTGGAAACCG GAGCACAAAA AGATTTTCTT
GCTAGCCACG GTTGTACTGC TTATCAGGGG TACCTGTTCA GCCGGCCTCT GCCTTTAGAG
CAGTTTGAGC GGTTTGTGCA GCCGGCTGCG GGAGGGGCTG TGTAA
 
Protein sequence
MDTEIHNPHI PGGRPPSRPA IAALTFVIFM SLTLASWSVV QDLQERHENT RFDKRVAEVI 
HKIEHHFSGY EQVLKGALGH LLASPSVSRN EWRTYVNALR INDRYPGIHG IGFAKYIPRS
GLAAHTEEVR AEGFPSYKVW PGTPRQEYTS IIYLEPFTGS NLRAFGYDMF SNAVRRAAMS
RARDTGEVAL TRKVKLVQET GEDIQAGVLM YLPYYGAANL PKTVEERRAS LGGYVYAPFR
MVDFVRATLG SELDILDLRI FDGKAMAEDS LLFDSVKHRS GPSPVPKFKR VIPISLYGQT
WALEASSRPA FEKAIKNYEA LLVLLGGFVV SMLTAMVSFV LSGNKEKAAA LGHVNKKLLL
AMEEQQATTR ELSNAKLRTE RILESITDAF YTLDREWRFT YVNKEAENLL QRNREDLLGR
VCWEEFKETA GTTFDREYHR ALMENRTVTF EEFYVPLDEW FEVHAYPSEE GLTVYFRDIT
ERKESEQARQ EAHVRIRQQA SLLDKATDAI IVFGMDNCIE FWNQGAERLY GWTSEEVMGR
EVEMLYDNVA VFDEANRALL SSGEWRGELA QRRRDQSMLN VEAHWTLVRN DDGEPQAIFA
INTDITQRKS AENEILHLAL YDSLTGLPNR RLLLDRLGHA LAVSARNQRM GALLFIDLDN
FKLLNDTLGH DMGDLLLRQV APRLSSCVRE SDTVARLGGD EFVVILMGDF GEDHDEALTQ
ISTICERIRS AFIQPFNLDP YIHHITPSIG IALFNDQSQT TNELLKRADL AMYQAKASGR
NAMRFFDPDM QAAMNARAIL ESELYKSWER NEFILHYQLQ VDSRGIIGAE ALVRWQHPRR
GLLPPSEFIP QAEETGLILP LGEWVLETAC NQLASWALQP ETVQLNLSVN VSSLQFCQPD
FVEQVISILD RTGANPQRLK LELTESILVH DMDDTIAKMM TLKARGVGFA LDDFGIGYSS
LYYLKRLPLD WVKIDRSFVR DVLTDNNDAT IVRTILLLAK SMGLAVIAEG VETGAQKDFL
ASHGCTAYQG YLFSRPLPLE QFERFVQPAA GGAV