Gene Nmul_A1709 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A1709 
Symbol 
ID3784808 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp1949725 
End bp1953054 
Gene Length3330 bp 
Protein Length1109 aa 
Translation table11 
GC content53% 
IMG OID637811796 
Productdiguanylate cyclase/phosphodiesterase 
Protein accessionYP_412399 
Protein GI82702833 
COG category[T] Signal transduction mechanisms 
COG ID[COG5001] Predicted signal transduction protein containing a membrane domain, an EAL and a GGDEF domain 
TIGRFAM ID[TIGR00229] PAS domain S-box
[TIGR00254] diguanylate cyclase (GGDEF) domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0904197 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTCGTT CTCCCTGGCT TATCGTCTTG CTTTACGTCG CTTTATCGAC CCTCTGGATG 
GCCGTTGCGG GCTATCTGAT ATCACTCATG CTTGAGGATC CGGCTTTGCG GAGCCGCGCC
TATCTGGCAA AGGAACTGGT TCTTATTGCC ATTAGCAGTA TCCTATTCTA TGCATTGCTT
AAACTGGGCA AGGGCGCTAC TACTGCCCGC GAAGCCGATG TGACAGCGGC AGACGTCGCA
GCTTCGGCTG CGGCAGGTTT CCGGCTGAAC CGGTTGATGT TGGCGTTCTT CTCGCTGGCG
ATGATGGCAC CCATTATCAG CATAATGATT ATAAAGATGT ATGGTCCGGA GATAGAGCAA
GGGGCCTATG CTGATCTGCA AACCATCGTG GATCTTAAGG CCGAGCAGAT CGAACTCTGG
CTTGCTGAAC GGCATAACGA CGCGGAGGCA TTGCTGGCGA ACCAGGCGCT TATCGAGCAA
GTAGTCGATC TCAAACGAAG AAGAAACGCG CATGAACTGG AGCTTATCCG TAATCGCCTG
GAGGCAGTGC GGCAAGCCTA CAGCTACGAG TCAGTGATAT TGCTTGATGT CGAAAGCCGG
CCTCTGCTCG TGTTAGGCGA AAAGCATGAA TTGCCACCCA TTACCCGCGA ATTACTATCT
ACCGCGTTAC GTTCGCGCCA GATACAGAGT ACCGATTTCT TTCTGGATGA AAACGGAAAA
CCTCTGCTGG ATATTGCAGT GCCTCTGGTA GCGGAGACAA CAAATAAGGA ACCTGGTGCC
ATTGTGCTCT TACGAGCGGA TCTGGAGCAA TTCCTTCTTC CACTCGTCGA GAAGTGGCCC
CGTATCAGTT GCAGCGGAGA GATACTGCTT ATCACCCAGA AGAATGAAAC CGTCAATTAT
CTTAATAAAT TGCACCGCTT TCAGCGGACA CATGGCACAC ATGGATATCA TGCGCTCGCC
AGGGACGAAC TTGCTAGCGC CATTGCCACC CGGGACGAAA AACAGGGGAC GGTACATGGA
ATCGATTACC GGGGAGAGCA GGTGCTCGCG GCCTATCGAC CCCTTACCGG GACCGGTTGG
CGTTTGCTTG CAAAAATCGA CCAGAATGAA GTGCTGGCGC AGCTATGGAC TCTTGTATTC
TGGATGAGTG CCGTCATTCT CATTGCGGTT ACTGCGGTAA GCGTCGTGCT GCTGCTGTTA
TGGCGTCAAC AGCAACGCGC CCACCAGCTG GCACTGATCA TTCACACCGC GGATCAGGAT
CGCCTTCTCA AGTATTTCTA TGACCTGCCA TTCATCGGCA TGGCGATTAC CTCCCCCGAT
ACCAAACGCT GGTTGCGCTT CAATAATCAG TTTTGCGAGA TGATGGGATA CTCAACTGAG
GAGTTGGCAA AAAAAAGCTG GATTGAGATA ACTCATCCGG ATGATGTCGC TAAAAGTACT
GCCGAGCGCG AGCGCATCCT GAAAGGGGAG TCTGAAGGCT ACGCGATGAA CAAGCGCTTC
ATCCGTAAGG ATGGCTCGAT AATCTTCGCC AATGTGGATG TCAAATGTGT CCGCAGGGAT
GATGGCACGG TGCATTATTT TGTTGCCATG ATCCGGGATA TTACCGAGCA AGAACGCCGG
AAAACCGAGA TTCTGGCAGC GCGACGCCAG CTCCAGGCCA CGCTCGACGC CATTCCCGAT
TTACTGTTCG AGCTTGATGC AGACGGTTGT GTACACGCGT GGCATTCGGT CCGGCGTACC
GAGTTTCCGA CTGTGTCCGG TGAAAGCCTT GTAGGCAAAA AGGTCGCGGA TTTTCTCCCC
ACCGGAGCAG TAGATATTAT CCTGTCGGCA TTGGCGGAGG CACAGGAGAA AGGGCTCTCC
TCGGGGAAAC AATTGGAACT TCGCCTTCCC GGCCGGGAAG ACAAATGGTG GTTCGAGCTT
TCCGTCTCGC GCAAGCATGT CGATTCCGCA GCGGGCCTTC GTTTTATCGT TCTTGCGCGT
GATATTACGG AACGCAAGGC GTCAGAACAA AGAATACTGC ATCTGGCGCA TTACGATTCG
CTCACCGGCC TGCCAAACCG CGCTCTGCTG GCGGACCGGA TGAGAGTGGC CATCAATCGT
GCAGCGCGGC AAGCCAAACG CCTCGCTGTA CTGTTCGTGG ACCTCGATCG TTTCAAGGCA
ATCAATGATT CGCTTGGTCA TGACGTTGGC GACCACCTTC TCAAGGTTGT GGCCGAGCGG
ATGCAAACCT CGATACGCAG CGTAGATACG GTAAGCCGGG TGGGAGGTGA CGAATTTGTT
GTGCTGCTGA ACGAAATTGA AACCGCGGAG GATGCAGCTC GTGTGGCGCA AAAAATCATA
GATGGTCTTT CACAACCGTA CCAGATCGAA AAACATGAAC TGCTGCTGAC CGGAAGCATT
GGCATTTGCA TCTATCCCGA TAACGGCAAG GAACCGAATA TCCTCCTGCG CAATGCGGAT
GCGTCCATGT ATACAGCAAA GGAAGCTGGA CACAACCGTT ACCAGTTCTA CTCAGAAGAC
ATGACAGCGC GGGCGATCGA GCGACTCAGT CTGGAACATG ATCTCCGGGG GGCGGTCGAA
CGCGGTGAAA TGTTCCTGGT TTATCAGCCC CAGATCGAGC TTGGGACAAG CCGCGTTATC
GGGGTGGAGG TGCTGATGCG CTGGCGTCAT CCCGCCCGTG GATTGATTTC CCCTGTTCGC
TTTATTCCCG TTGCCGAGGA TACGGGACTG ATTCTCTCGA TCGGTGAATG GGGTTTGCGC
GAATCGTGCA GGCAGGCGCA GCTATGGTAC GAACGTGGAC TGCTGAATGC GTGTATCTCG
GTCAATGTTT CGGCAGTGCA ATTCCGTCAG ACTGATTTTG TCGGAATCAT TGAAAATGCA
CTTCAGGAGT CCGGTCTGGC GCCCACTAAC CTGGAACTGG AACTTACTGA AAGCGCAGTG
ATGCAAGGGG CGGAACCTGC ACTGAACAAG CTGCGCGAAC TGGATGCGCT TGGGGTGAAA
GTCGCCATCG ACGATTTTGG CACAGGTTAC TCAAGTCTCG CCTATTTACG GCAGTTTACG
GTTGACCGCC TGAAAATCGA TCAATCATTC GTACGGGACG TGCCGGGGAA CAATGATGCC
GAAGCCATTG CAGCGGCAAT CGTGGCAATG GGTCTCAACC TGGGCTTCCG TATCATTGCC
GAAGGCGTAG AAACGGAAGC GCAAGCGGAA TTTCTCCAAA GTGTCTTATG CAAGGAAGGC
CAGGGTTATC TTTTTGCGTG GCCTATGACT GCTATTGAGT TCGAGGCATG GATAGCCGGA
TGGCAAAACC GCGCAGGAAG CACGTCCTAG
 
Protein sequence
MSRSPWLIVL LYVALSTLWM AVAGYLISLM LEDPALRSRA YLAKELVLIA ISSILFYALL 
KLGKGATTAR EADVTAADVA ASAAAGFRLN RLMLAFFSLA MMAPIISIMI IKMYGPEIEQ
GAYADLQTIV DLKAEQIELW LAERHNDAEA LLANQALIEQ VVDLKRRRNA HELELIRNRL
EAVRQAYSYE SVILLDVESR PLLVLGEKHE LPPITRELLS TALRSRQIQS TDFFLDENGK
PLLDIAVPLV AETTNKEPGA IVLLRADLEQ FLLPLVEKWP RISCSGEILL ITQKNETVNY
LNKLHRFQRT HGTHGYHALA RDELASAIAT RDEKQGTVHG IDYRGEQVLA AYRPLTGTGW
RLLAKIDQNE VLAQLWTLVF WMSAVILIAV TAVSVVLLLL WRQQQRAHQL ALIIHTADQD
RLLKYFYDLP FIGMAITSPD TKRWLRFNNQ FCEMMGYSTE ELAKKSWIEI THPDDVAKST
AERERILKGE SEGYAMNKRF IRKDGSIIFA NVDVKCVRRD DGTVHYFVAM IRDITEQERR
KTEILAARRQ LQATLDAIPD LLFELDADGC VHAWHSVRRT EFPTVSGESL VGKKVADFLP
TGAVDIILSA LAEAQEKGLS SGKQLELRLP GREDKWWFEL SVSRKHVDSA AGLRFIVLAR
DITERKASEQ RILHLAHYDS LTGLPNRALL ADRMRVAINR AARQAKRLAV LFVDLDRFKA
INDSLGHDVG DHLLKVVAER MQTSIRSVDT VSRVGGDEFV VLLNEIETAE DAARVAQKII
DGLSQPYQIE KHELLLTGSI GICIYPDNGK EPNILLRNAD ASMYTAKEAG HNRYQFYSED
MTARAIERLS LEHDLRGAVE RGEMFLVYQP QIELGTSRVI GVEVLMRWRH PARGLISPVR
FIPVAEDTGL ILSIGEWGLR ESCRQAQLWY ERGLLNACIS VNVSAVQFRQ TDFVGIIENA
LQESGLAPTN LELELTESAV MQGAEPALNK LRELDALGVK VAIDDFGTGY SSLAYLRQFT
VDRLKIDQSF VRDVPGNNDA EAIAAAIVAM GLNLGFRIIA EGVETEAQAE FLQSVLCKEG
QGYLFAWPMT AIEFEAWIAG WQNRAGSTS