Gene Nmul_A1957 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A1957 
Symbol 
ID3785135 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp2248290 
End bp2251217 
Gene Length2928 bp 
Protein Length975 aa 
Translation table11 
GC content51% 
IMG OID637812045 
Productdiguanylate cyclase/phosphodiesterase 
Protein accessionYP_412644 
Protein GI82703078 
COG category[T] Signal transduction mechanisms 
COG ID[COG5001] Predicted signal transduction protein containing a membrane domain, an EAL and a GGDEF domain 
TIGRFAM ID[TIGR00229] PAS domain S-box
[TIGR00254] diguanylate cyclase (GGDEF) domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTGCAGG ATAAGTATTT TCATAATCAG GCTATGCAGA GAAATGGAGC GATAGCACCT 
GAAAAGCCCC AAAAATTTCT GGGACTGAAA TGGAAAGTTT TGTTGCTGAG CAGCCTTATA
CTGATTGCCA TCGTAGTCTC ATTCACTGGG ATTACCTATC TGAGCCTGAT GGATGACTTC
GAGAGCCAGA GAAATGCCCA GCACCAGCGC TATGCCAAGG AAGTTGAAGG ACTGATCGAC
CAGGTTTCGA TAAATTTGCA TCAACTCGCC GGATTGATTC CTTTCCTGGA AGGAATGGAT
AAAAGTCTGC TCTCAGGTAA CAAGGAATAC GTCACCCAAG CATTTGATCC ATATTGGTCG
CCTTTGCAAC TCAATAAAGG TATCGAATTG CTGCGTTTTT ACGATAGCTC AAACCAGCAA
TTGGCAGGAT GGGGAACTTC CCAGCCTAAT ACTTACGACG CATTGATGTC AGCCTGGGTG
CACGAGGTGA ATGCCCAGGA AAAGCCGATG AGCCATCTCA GCTGCCCCAC CAGCTGCATG
CAGTTTGCCG TGGCACCCTT GCTCGTGGAA GGTAACAACG TCGGCGTGAT CGTCATCGGA
ACGCCCCTGG TCGATGTGAT TCTGGGTTTC AAGGATATCT CGGGTGCCGA TATCGCATTG
CTGGTGAGGG AAAAAGGCGA TTTGCCCGAA AGCAACAAGG TGAAAATTGC AAACTGGGAT
GTCACGCTTG CGGCACACAG CAGGGAAATG AACATTACTG TTCTGGATGA AGTTGCAGCG
AATTATCCGG ACCTGGAGAG TCTGGAGGAG GGCATCCTGG TTTCGTGGCA CGATAGACAC
CTCCAGATAA AGCCGCTGTT TCTGGAGAGA ATGGCCGTGT CGGAAGATGG TGCTCGCTTC
ATGGTTGTAA CTGACATCAC CTCCACGATC CGCACCATTC ACAGCTCGAC CCAGCAAAAC
ATGATAATCG GCGTGGTCGG GCTCATATTC TCTGAAATAC TGCTGTTCAT CATTCTCACC
AAGCCACTGT CGAGGCTCAA GCACATCGTT TTTACTCTCC CTCTTCTGGC CCGCAGCAGT
TTCAAGAATT TTCGCCTGGA GCTTCGCTCT GCCGGCCAGA AACGATGGAT GAAAGATGAA
ATTGACCTGC TGGATGAAAC AGCGGTGGCC CTGTCACATC AACTGGAGAA GCTCGAGGAT
CAGGTGGCGG ACAGGACCCG TATACTGGCC AGCAAGATGG ACGAGCTAAG CAAGGAGAGG
GATTTCATTA ACAATCTGCT GGACATTGCG CAAGTAATCG TAATCACGCA AAAGGCGGAT
GGCGAGATTC TCACACTCAA TGCCTATGGT GAAACGCTTA TTCAGTATAC GGAAAAAGAA
CTGCAGGGAA CGCCCTTTCT CCATCTTCTG GCACTCGATG GAAATCTGCA CGACCTTCCC
GTTCACCTGG AAGAGGTGCG GTGCCAACGA AGAGACCAGC TTCGGCACGA AGCGAATGTC
GTATGCAAGG ATGCTTCCAT TCGTAATATT CTGTGGCTGC ATTCACGCCT CACCCGGCAC
AGCGAGGATG ACCCCGCAAT GTTATCGGTG GGTCTCGACA TGACGGAACA CAAGCGTGCT
GAAGGGCGTC TTGCCTGGCT GGCGGATCAT GATCCGCTCA CGGATCTTTT CAATCGCCGC
CGCTTCCAGG AAGAACTGGA ACAGATGCTG AATCTTGCCG CGCGTTACGG GTACTCGGGA
GCCTTGCTCT TTTTTGACCT TGATCAGTTC AAATACATCA ACGATACCAG CGGGCATCAG
GCGGGGGATG CCTTGCTTAA AATGGTCGCG CGTCTGCTGC TTGGCAATAT TCGCAGCGTC
GACATACTTG GCCGCCTGGG AGGCGATGAA TTCGCAGTGA TTTTGCCCCA AACGACAGCC
GAAGGGGCGA TAGAAGTGGC AAAAAATACG CTTGCGAGCC TGAATCAGGG AAAGATTACG
ATAAATGGTC GCACTCATAA AGCGTCAGCC AGCGTCGGTA TCGCACTTTT TCCGGAGCAC
GGCAGCAATG TCCATGATCT CCTGGCCGCA GCCGATCTCG CCATGTATCA GGCAAAGGAA
GCCGGACGAG GAGGATGGCA TTTGTTTTCT GACGAAGAGA AAACACGTGA ACGCATGCAT
ACTCTTGTCT ATTGGAAGGA GAAGATTGAG TATGCTCTTT CACACGAACG TTTCCTGTTC
TATTTCCAGC CCATCATGCA TGTTCGACGC AGGACCATCG ATCATTACGA GGTGCTGCTT
CGCATGATCG ACAATGATGG AACTGTTCTT GCACCCCAGT TTTTCATCCC CGCCGCGGAA
CAGACAGGTC TTATTCATGC CATCGACCAT ATGGTTTTAC GTAAATCCAT TGCGCAATCA
GCTGAAATAC AACGCGCCGG TCAGTGCATC CGTTTTTCCA TAAACTTGTC GGCGCATGCA
TTTCACGATC CGGAACTGCT GCCGATACTG AAAGATGCAT TTGCCGAGTA TGGCGCGGAT
CCATCGAATT TCATGTTTGA AATAACCGAG ACAGCAGCGC TTGAGGATTT GCCCGCGGCG
CGGGAACTCA TGGAGATGAT TAAAAAGCTG GGCTGCAGTT TCACGCTGGA TGATTTCGGT
GTCGGTTTCT CCTCCTTCTA TTACATCCGG CAACTGCCGA TCGATGTTGT AAAGATCGAT
GGCTCCTTCA TACGAAATCT GGCAGACAGC CCCGATGACC AAATACTGGT GCAGGCTTTG
TGCGATGTGG CAAGGGGATT CGGAAAGAAG ACAACGGCGG AGTTCGTGGA AAATGCAGCG
ACCTTTTCAA TCCTTGAGAA AATGCAGATC GACTATGCCC AGGGATTTTT GATTGGAACG
CCTTCTCCCG CTTATGACAC ATCGTTCAGC GATTTCGCGA AGATGTGA
 
Protein sequence
MLQDKYFHNQ AMQRNGAIAP EKPQKFLGLK WKVLLLSSLI LIAIVVSFTG ITYLSLMDDF 
ESQRNAQHQR YAKEVEGLID QVSINLHQLA GLIPFLEGMD KSLLSGNKEY VTQAFDPYWS
PLQLNKGIEL LRFYDSSNQQ LAGWGTSQPN TYDALMSAWV HEVNAQEKPM SHLSCPTSCM
QFAVAPLLVE GNNVGVIVIG TPLVDVILGF KDISGADIAL LVREKGDLPE SNKVKIANWD
VTLAAHSREM NITVLDEVAA NYPDLESLEE GILVSWHDRH LQIKPLFLER MAVSEDGARF
MVVTDITSTI RTIHSSTQQN MIIGVVGLIF SEILLFIILT KPLSRLKHIV FTLPLLARSS
FKNFRLELRS AGQKRWMKDE IDLLDETAVA LSHQLEKLED QVADRTRILA SKMDELSKER
DFINNLLDIA QVIVITQKAD GEILTLNAYG ETLIQYTEKE LQGTPFLHLL ALDGNLHDLP
VHLEEVRCQR RDQLRHEANV VCKDASIRNI LWLHSRLTRH SEDDPAMLSV GLDMTEHKRA
EGRLAWLADH DPLTDLFNRR RFQEELEQML NLAARYGYSG ALLFFDLDQF KYINDTSGHQ
AGDALLKMVA RLLLGNIRSV DILGRLGGDE FAVILPQTTA EGAIEVAKNT LASLNQGKIT
INGRTHKASA SVGIALFPEH GSNVHDLLAA ADLAMYQAKE AGRGGWHLFS DEEKTRERMH
TLVYWKEKIE YALSHERFLF YFQPIMHVRR RTIDHYEVLL RMIDNDGTVL APQFFIPAAE
QTGLIHAIDH MVLRKSIAQS AEIQRAGQCI RFSINLSAHA FHDPELLPIL KDAFAEYGAD
PSNFMFEITE TAALEDLPAA RELMEMIKKL GCSFTLDDFG VGFSSFYYIR QLPIDVVKID
GSFIRNLADS PDDQILVQAL CDVARGFGKK TTAEFVENAA TFSILEKMQI DYAQGFLIGT
PSPAYDTSFS DFAKM