Gene Nmul_A0328 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A0328 
Symbol 
ID3784005 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp353681 
End bp356536 
Gene Length2856 bp 
Protein Length951 aa 
Translation table11 
GC content52% 
IMG OID637810404 
Productdiguanylate cyclase/phosphodiesterase 
Protein accessionYP_411028 
Protein GI82701462 
COG category[T] Signal transduction mechanisms 
COG ID[COG2200] FOG: EAL domain 
TIGRFAM ID[TIGR00229] PAS domain S-box
[TIGR00254] diguanylate cyclase (GGDEF) domain 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAATCAA CGGTAATACC AACAGATTTT GGCAAGCTGG TTCTTGACGA AATGCCCGGC 
GGCGCAATTG TTACCACTGG CGACGGCGTT GTCGTTTATT GGAACAAGGG CGCGCAATCG
ATATTTGGCT ATACGGTGGC GGAAGCCTTG CAGAGGAGGC TTACTGAACT CGTCGGCGCC
CCTGATCATC GGCCCAAGAT CAACCAGAGC CTGAGAAATA CGCACGAGAC AAGGGCTTCA
GTGGACGAGA TCTTGTGTCG TAAAAAAGAC GGCTCGCTGC TCTATGTAAG CATGTCATGC
AAAGTCCTGT CGCAGGATCA GCGCCAGGAC GGCTACGTCC TGATCACAAG TACAAATATC
ACGCATATCA AAGCATTGCG CGATGCCAAG CTGATCAATA CCAAATTCGG CAATTTGCTC
GATTCAATAC CAGATGGAAT AGTCATAGTG AACTCGACAG GACGGATAGT CCTGGTCAAC
ACGCAAACGG AGAAACTGTT CGGATATAGC GCGCACGAAC TTTGCGGCCA GCCCGTGGAA
ATGCTGCTGC CCCCTCGCAA GCGCGGCCTC CATTCGGGAG AACGCGCAGC TTACTTTACC
CACCCGCATA GCCGAACGCT CGTCGCTGAT TGGGAGCTAT ACGGGTTTCG CAAGGACGGC
ACCGAATTTC CAGTGGAAAT CAGCTTGAGT CCCATTGAAA CGGATGAAGG CAGGTTTAGC
ATAAGTGCAA TCCGCGATAT CAGCGATCGA AAAAAAGCGG AGCAGAAATT TCGCGGACTG
CTGGAAGCGG CACCCGATGC GATCGTTATT GTGAATCGGA ATGGCGAAAT CGTGCTGGTG
AATTCTCAAA CCGAAAAATT GTTTGACTAC CAGCGAGAAC AGTTGCTCGG TAAAAAGATG
GAGATGCTTA TTCCACCCCG TTACCGCTCA AAACATCCCG GGTTTCGGGA AGATTTTTTC
CGTGCACCCC GCACGCGTCC CATGGGCATC GGACTCGAGC TCTATGGTCT GCGCCGCGAT
GGCACTGAAT TTCCTGTTGA AATCAGTCTT AGTCCCCTTG AAACCGAAGA AGGAATCCTG
GTCTCCAGCG CCATCCGCGA TATCACCGAG CGCAAGCGTA CCAAGGAAAT CCAAACCCAA
CTCAGACGGG ATCTCACTGA GCGTGAAATA GCGGAAAAGG CGCTTTTTGA AGAAAAGGAA
CTCCTGCGCG TTACATTGAG CTGTATTGGT GACGCCGTTA TCACAACGGA TACCGAAGGA
AGCGTCACCT ATCTGAATCC CGTTGCGGAA GCCATGGCCG GGTGTAGTTC CGATGACGCA
AAGGGTTTGC CTCTTCAAGA CGTATTTCAA ATCATCCATG CAGAAACCAA CGAGCTGGCA
TCCAGTCCGG CCGAAAGGGT TTTAAGGAAC AAAGAGACCG TAAGCCTTAA TTCGCATGCG
CTGCTCATAC GGCGAGACGG CCAAACCTTT CCGATCGAAG ACTCGGCCGC GCCCATTCGG
GACCAAAATG GTTCAATCAT CGGCGTCGTG CTGGTATTCC GCGATGTCAG TCATGCACAG
AAGATGGCAA TGGAAATGCG CTATCAGGCG ACGCACGACG CCTTGACAGG CCTGATCAAC
AGGCATGAAT TCGAGCGGCG CCTGAAACAG GTCCTGGAGC GCGATAAAGG AGCGGACGAG
CATACTCTGC TTTATCTTGA TCTCGATCAA TTCAAGATCG TAAACGACAC TTGCGGTCAC
CTTGCAGGAG ACGAGTTGCT GAAACAACTC ACCAGCCTCC TGCAAGCAAA GTTGCGGAAG
GATGATACTT TGGCGCGCCT GGGAGGAGAT GAATTCGGCT TATTGCTTGA AAGATGTCCA
AGGGGATCGG CATTTCGCGT GGCTGAAGTG CTGCGGCAAA CCGTGCAGGA GTTCCGTTTC
GTCTGGGAAG AGCGGATATT CTCGCTCGGA ATAAGCATCG GGCTGGCTAC TTTCTCCGGT
GGCGAGCAAA CTTTCTCGGA CGTTTTACGC ATGGCGGACA CGGCTTGCTT TCTTGCCAAG
GATAAGGGGC GTAACCGTGT TCAACTCTAT GCTTCCGATG ACAAGAACCT CGAGAAACGC
CGTGGCGAGA TGGGATGGGT GGAGCGCCTG CACAAAGCTC TCGACAAGCA GCGGTTCGTA
CTGTATTCAC AGAAAATATT GCCCCTTTCC GCCCCTTCAA GCACTAGCCA CTACGAGATA
CTGCTGCGAA TGAAAGGAGA AAATGGCGAG CTGGTGCCTC CCATGGCCTT CATCCCGGCG
GCGGAGCGCT ACGGCCTCAT GCCTCAACTC GACAGATGGG TAATCACCAA TGCCTTTGCC
CAATACGCTT CACATCCCTC ACGGGGCATA AAAGACACCT GCACTATCAA TCTCTCAGGC
GCATCTATTT GCGATGAGCA TTTATACGAT TTCGTGGTGG ATCAATTCAA GCAATCTCAG
ATCGATCCTG CCGGGATCTG TTTCGAGATC ACGGAAACTG TGGCAATCGC AAACCTTACT
CAGGCTGCCA CATTGATTCG CAAGCTAAAG GAGCTCGGCT GCCGCTTCTC CCTCGACGAC
TTCGGCAGCG GAATGTCCTC CTTTACTTAC CTCAAGCACT TGCTTGTAGA CTATTTAAAA
ATCGATGGAG CATTCATCAA GGGCATGCTC AGTGATCCTA TTGACCATGC TATGGTTGAG
GCGATTAATC ATATAGGCCA TGTAATGAAA ATCCAAACGA TAGCGGAATG GGTAGAGGAA
GAGTCTTTTC TTGAGGCGCT GCGAAAGATA GGTGTTGACT ATGCCCAAGG GTATGCGATT
GAAGAGCCGC GCCCAGCCAT AACGCTTTGC CATTGA
 
Protein sequence
MESTVIPTDF GKLVLDEMPG GAIVTTGDGV VVYWNKGAQS IFGYTVAEAL QRRLTELVGA 
PDHRPKINQS LRNTHETRAS VDEILCRKKD GSLLYVSMSC KVLSQDQRQD GYVLITSTNI
THIKALRDAK LINTKFGNLL DSIPDGIVIV NSTGRIVLVN TQTEKLFGYS AHELCGQPVE
MLLPPRKRGL HSGERAAYFT HPHSRTLVAD WELYGFRKDG TEFPVEISLS PIETDEGRFS
ISAIRDISDR KKAEQKFRGL LEAAPDAIVI VNRNGEIVLV NSQTEKLFDY QREQLLGKKM
EMLIPPRYRS KHPGFREDFF RAPRTRPMGI GLELYGLRRD GTEFPVEISL SPLETEEGIL
VSSAIRDITE RKRTKEIQTQ LRRDLTEREI AEKALFEEKE LLRVTLSCIG DAVITTDTEG
SVTYLNPVAE AMAGCSSDDA KGLPLQDVFQ IIHAETNELA SSPAERVLRN KETVSLNSHA
LLIRRDGQTF PIEDSAAPIR DQNGSIIGVV LVFRDVSHAQ KMAMEMRYQA THDALTGLIN
RHEFERRLKQ VLERDKGADE HTLLYLDLDQ FKIVNDTCGH LAGDELLKQL TSLLQAKLRK
DDTLARLGGD EFGLLLERCP RGSAFRVAEV LRQTVQEFRF VWEERIFSLG ISIGLATFSG
GEQTFSDVLR MADTACFLAK DKGRNRVQLY ASDDKNLEKR RGEMGWVERL HKALDKQRFV
LYSQKILPLS APSSTSHYEI LLRMKGENGE LVPPMAFIPA AERYGLMPQL DRWVITNAFA
QYASHPSRGI KDTCTINLSG ASICDEHLYD FVVDQFKQSQ IDPAGICFEI TETVAIANLT
QAATLIRKLK ELGCRFSLDD FGSGMSSFTY LKHLLVDYLK IDGAFIKGML SDPIDHAMVE
AINHIGHVMK IQTIAEWVEE ESFLEALRKI GVDYAQGYAI EEPRPAITLC H