Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmul_A1957 |
Symbol | |
ID | 3785135 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosospira multiformis ATCC 25196 |
Kingdom | Bacteria |
Replicon accession | NC_007614 |
Strand | + |
Start bp | 2248290 |
End bp | 2251217 |
Gene Length | 2928 bp |
Protein Length | 975 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 637812045 |
Product | diguanylate cyclase/phosphodiesterase |
Protein accession | YP_412644 |
Protein GI | 82703078 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG5001] Predicted signal transduction protein containing a membrane domain, an EAL and a GGDEF domain |
TIGRFAM ID | [TIGR00229] PAS domain S-box [TIGR00254] diguanylate cyclase (GGDEF) domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCTGCAGG ATAAGTATTT TCATAATCAG GCTATGCAGA GAAATGGAGC GATAGCACCT GAAAAGCCCC AAAAATTTCT GGGACTGAAA TGGAAAGTTT TGTTGCTGAG CAGCCTTATA CTGATTGCCA TCGTAGTCTC ATTCACTGGG ATTACCTATC TGAGCCTGAT GGATGACTTC GAGAGCCAGA GAAATGCCCA GCACCAGCGC TATGCCAAGG AAGTTGAAGG ACTGATCGAC CAGGTTTCGA TAAATTTGCA TCAACTCGCC GGATTGATTC CTTTCCTGGA AGGAATGGAT AAAAGTCTGC TCTCAGGTAA CAAGGAATAC GTCACCCAAG CATTTGATCC ATATTGGTCG CCTTTGCAAC TCAATAAAGG TATCGAATTG CTGCGTTTTT ACGATAGCTC AAACCAGCAA TTGGCAGGAT GGGGAACTTC CCAGCCTAAT ACTTACGACG CATTGATGTC AGCCTGGGTG CACGAGGTGA ATGCCCAGGA AAAGCCGATG AGCCATCTCA GCTGCCCCAC CAGCTGCATG CAGTTTGCCG TGGCACCCTT GCTCGTGGAA GGTAACAACG TCGGCGTGAT CGTCATCGGA ACGCCCCTGG TCGATGTGAT TCTGGGTTTC AAGGATATCT CGGGTGCCGA TATCGCATTG CTGGTGAGGG AAAAAGGCGA TTTGCCCGAA AGCAACAAGG TGAAAATTGC AAACTGGGAT GTCACGCTTG CGGCACACAG CAGGGAAATG AACATTACTG TTCTGGATGA AGTTGCAGCG AATTATCCGG ACCTGGAGAG TCTGGAGGAG GGCATCCTGG TTTCGTGGCA CGATAGACAC CTCCAGATAA AGCCGCTGTT TCTGGAGAGA ATGGCCGTGT CGGAAGATGG TGCTCGCTTC ATGGTTGTAA CTGACATCAC CTCCACGATC CGCACCATTC ACAGCTCGAC CCAGCAAAAC ATGATAATCG GCGTGGTCGG GCTCATATTC TCTGAAATAC TGCTGTTCAT CATTCTCACC AAGCCACTGT CGAGGCTCAA GCACATCGTT TTTACTCTCC CTCTTCTGGC CCGCAGCAGT TTCAAGAATT TTCGCCTGGA GCTTCGCTCT GCCGGCCAGA AACGATGGAT GAAAGATGAA ATTGACCTGC TGGATGAAAC AGCGGTGGCC CTGTCACATC AACTGGAGAA GCTCGAGGAT CAGGTGGCGG ACAGGACCCG TATACTGGCC AGCAAGATGG ACGAGCTAAG CAAGGAGAGG GATTTCATTA ACAATCTGCT GGACATTGCG CAAGTAATCG TAATCACGCA AAAGGCGGAT GGCGAGATTC TCACACTCAA TGCCTATGGT GAAACGCTTA TTCAGTATAC GGAAAAAGAA CTGCAGGGAA CGCCCTTTCT CCATCTTCTG GCACTCGATG GAAATCTGCA CGACCTTCCC GTTCACCTGG AAGAGGTGCG GTGCCAACGA AGAGACCAGC TTCGGCACGA AGCGAATGTC GTATGCAAGG ATGCTTCCAT TCGTAATATT CTGTGGCTGC ATTCACGCCT CACCCGGCAC AGCGAGGATG ACCCCGCAAT GTTATCGGTG GGTCTCGACA TGACGGAACA CAAGCGTGCT GAAGGGCGTC TTGCCTGGCT GGCGGATCAT GATCCGCTCA CGGATCTTTT CAATCGCCGC CGCTTCCAGG AAGAACTGGA ACAGATGCTG AATCTTGCCG CGCGTTACGG GTACTCGGGA GCCTTGCTCT TTTTTGACCT TGATCAGTTC AAATACATCA ACGATACCAG CGGGCATCAG GCGGGGGATG CCTTGCTTAA AATGGTCGCG CGTCTGCTGC TTGGCAATAT TCGCAGCGTC GACATACTTG GCCGCCTGGG AGGCGATGAA TTCGCAGTGA TTTTGCCCCA AACGACAGCC GAAGGGGCGA TAGAAGTGGC AAAAAATACG CTTGCGAGCC TGAATCAGGG AAAGATTACG ATAAATGGTC GCACTCATAA AGCGTCAGCC AGCGTCGGTA TCGCACTTTT TCCGGAGCAC GGCAGCAATG TCCATGATCT CCTGGCCGCA GCCGATCTCG CCATGTATCA GGCAAAGGAA GCCGGACGAG GAGGATGGCA TTTGTTTTCT GACGAAGAGA AAACACGTGA ACGCATGCAT ACTCTTGTCT ATTGGAAGGA GAAGATTGAG TATGCTCTTT CACACGAACG TTTCCTGTTC TATTTCCAGC CCATCATGCA TGTTCGACGC AGGACCATCG ATCATTACGA GGTGCTGCTT CGCATGATCG ACAATGATGG AACTGTTCTT GCACCCCAGT TTTTCATCCC CGCCGCGGAA CAGACAGGTC TTATTCATGC CATCGACCAT ATGGTTTTAC GTAAATCCAT TGCGCAATCA GCTGAAATAC AACGCGCCGG TCAGTGCATC CGTTTTTCCA TAAACTTGTC GGCGCATGCA TTTCACGATC CGGAACTGCT GCCGATACTG AAAGATGCAT TTGCCGAGTA TGGCGCGGAT CCATCGAATT TCATGTTTGA AATAACCGAG ACAGCAGCGC TTGAGGATTT GCCCGCGGCG CGGGAACTCA TGGAGATGAT TAAAAAGCTG GGCTGCAGTT TCACGCTGGA TGATTTCGGT GTCGGTTTCT CCTCCTTCTA TTACATCCGG CAACTGCCGA TCGATGTTGT AAAGATCGAT GGCTCCTTCA TACGAAATCT GGCAGACAGC CCCGATGACC AAATACTGGT GCAGGCTTTG TGCGATGTGG CAAGGGGATT CGGAAAGAAG ACAACGGCGG AGTTCGTGGA AAATGCAGCG ACCTTTTCAA TCCTTGAGAA AATGCAGATC GACTATGCCC AGGGATTTTT GATTGGAACG CCTTCTCCCG CTTATGACAC ATCGTTCAGC GATTTCGCGA AGATGTGA
|
Protein sequence | MLQDKYFHNQ AMQRNGAIAP EKPQKFLGLK WKVLLLSSLI LIAIVVSFTG ITYLSLMDDF ESQRNAQHQR YAKEVEGLID QVSINLHQLA GLIPFLEGMD KSLLSGNKEY VTQAFDPYWS PLQLNKGIEL LRFYDSSNQQ LAGWGTSQPN TYDALMSAWV HEVNAQEKPM SHLSCPTSCM QFAVAPLLVE GNNVGVIVIG TPLVDVILGF KDISGADIAL LVREKGDLPE SNKVKIANWD VTLAAHSREM NITVLDEVAA NYPDLESLEE GILVSWHDRH LQIKPLFLER MAVSEDGARF MVVTDITSTI RTIHSSTQQN MIIGVVGLIF SEILLFIILT KPLSRLKHIV FTLPLLARSS FKNFRLELRS AGQKRWMKDE IDLLDETAVA LSHQLEKLED QVADRTRILA SKMDELSKER DFINNLLDIA QVIVITQKAD GEILTLNAYG ETLIQYTEKE LQGTPFLHLL ALDGNLHDLP VHLEEVRCQR RDQLRHEANV VCKDASIRNI LWLHSRLTRH SEDDPAMLSV GLDMTEHKRA EGRLAWLADH DPLTDLFNRR RFQEELEQML NLAARYGYSG ALLFFDLDQF KYINDTSGHQ AGDALLKMVA RLLLGNIRSV DILGRLGGDE FAVILPQTTA EGAIEVAKNT LASLNQGKIT INGRTHKASA SVGIALFPEH GSNVHDLLAA ADLAMYQAKE AGRGGWHLFS DEEKTRERMH TLVYWKEKIE YALSHERFLF YFQPIMHVRR RTIDHYEVLL RMIDNDGTVL APQFFIPAAE QTGLIHAIDH MVLRKSIAQS AEIQRAGQCI RFSINLSAHA FHDPELLPIL KDAFAEYGAD PSNFMFEITE TAALEDLPAA RELMEMIKKL GCSFTLDDFG VGFSSFYYIR QLPIDVVKID GSFIRNLADS PDDQILVQAL CDVARGFGKK TTAEFVENAA TFSILEKMQI DYAQGFLIGT PSPAYDTSFS DFAKM
|
| |