Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmul_A2039 |
Symbol | |
ID | 3786729 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosospira multiformis ATCC 25196 |
Kingdom | Bacteria |
Replicon accession | NC_007614 |
Strand | + |
Start bp | 2333764 |
End bp | 2335062 |
Gene Length | 1299 bp |
Protein Length | 432 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 637812127 |
Product | diguanylate cyclase |
Protein accession | YP_412725 |
Protein GI | 82703159 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG2199] FOG: GGDEF domain |
TIGRFAM ID | [TIGR00229] PAS domain S-box [TIGR00254] diguanylate cyclase (GGDEF) domain |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.0000319282 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATAGCGC CCTCACGCAA TTTATTACTC TGCAATAAAG GAAAGACCGA ATTGAACCAT TCGTTCTCCA TGCTGCAAGC CATCGTAGAA TCGACGCCCT ATGGGCTTCT GGTGACCAAC AAGCACGGGC ACTTGCTTTG CTATAACCAG CCCTATATGG ATATGTGGCG CATTCCCCCT GAGGTCATGG TCAATGCGAG ACACCAGATA ATTTTCGGGC ATTACTCCAG TCAATTAAGA GATCCACAAC AATTCCTTCA CTCAACCGAG GTAATTTATG GCACGTGGTT GCCCGAAAGT TTCGACATCC TCGAATTCCT CGATGGGCGG GCATTCGAAC GCCATACAAA AGCCACGACC TTGGAAGGGC CAAACATGGT GCGTGTTTGG AGCTTCAAGG ACATCACTGA ACGCAGGCAG GCGGAGGCCT ATAAAGCGCA GTTGGCCGCA ATGGTCGAGT CCTCGGACGA TGCAATCATT GTCAAGGACC TGAACGCCAT TATCACAAGC TGGAATGCCG GGGCGGAACG GATTTTCGGG TACCAGGCAA GCGAAATAAC AGGCTCTTCC ATACTGGCCT TAATTCCCCC GGAGCGTCAT GAAGAAGAGA AGGGGATCAT GAGCCTCGTC AAGAGCGGGA ACCGCGTGGA CCATTTTGAA ACCATGTGGT GGGGAAAGGG CAAGAAACCG ATTGATGTCT CGGTCACGAC ATCGGCAGTG AAGGATAGTG ACGGCAATAT TGTGGGCATA TCACAGATTG CGCGGGATAT CACCAAGCGC AAGGAATCAC AGAAACGCAT CGAGTATCTT GCCCATTACG ATCCGCTAAC CGGGCTGCCC AATCGCGCGT TGCTCGCAGA CAGAATGAAA ACTGCCATCG AGAATGCCAA GCGTTATTCC TTCCAGTTGG CAGTCCTGTT TGTGGACCTT GACCACTTCA AGCTAATCAA TGATTCGCTT GGTCATGAAA TCGGTGACAA GCTGCTCAAG ATCGTAGCCG AGCGCATGCG ATCCAGGCTG CGTCAAACCG ATACCGTCAG CCGGCTGGGA GGTGACGAAT TTATTATCCT GCTGAGCCGG ATTAATGCAG CATCCGATGC CGCTTGCGTC GCCGAGAAAA TTATTGCAGC ACTATCCCAG CCCTACCACC TTGAACAGCA TGAGTTGGGG CTGGGCGCGA GCATCGGGAT CAGCATTTAT CCGGACAGCG GCAAGGATAC CAGCAGCTTG TTGCGCAGTG CCGATGAAGC GATGTACTCT GCCAAAGGAC AGGGTAGGAA CCGCTATCAT GGTCCGTAG
|
Protein sequence | MIAPSRNLLL CNKGKTELNH SFSMLQAIVE STPYGLLVTN KHGHLLCYNQ PYMDMWRIPP EVMVNARHQI IFGHYSSQLR DPQQFLHSTE VIYGTWLPES FDILEFLDGR AFERHTKATT LEGPNMVRVW SFKDITERRQ AEAYKAQLAA MVESSDDAII VKDLNAIITS WNAGAERIFG YQASEITGSS ILALIPPERH EEEKGIMSLV KSGNRVDHFE TMWWGKGKKP IDVSVTTSAV KDSDGNIVGI SQIARDITKR KESQKRIEYL AHYDPLTGLP NRALLADRMK TAIENAKRYS FQLAVLFVDL DHFKLINDSL GHEIGDKLLK IVAERMRSRL RQTDTVSRLG GDEFIILLSR INAASDAACV AEKIIAALSQ PYHLEQHELG LGASIGISIY PDSGKDTSSL LRSADEAMYS AKGQGRNRYH GP
|
| |