Gene Nmul_A1605 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A1605 
Symbol 
ID3784837 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp1840420 
End bp1842486 
Gene Length2067 bp 
Protein Length688 aa 
Translation table11 
GC content54% 
IMG OID637811694 
Productdiguanylate cyclase/phosphodiesterase 
Protein accessionYP_412298 
Protein GI82702732 
COG category[T] Signal transduction mechanisms 
COG ID[COG5001] Predicted signal transduction protein containing a membrane domain, an EAL and a GGDEF domain 
TIGRFAM ID[TIGR00229] PAS domain S-box
[TIGR00254] diguanylate cyclase (GGDEF) domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0013251 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAGAAA CGCTGGAACT GAGCCAATCA CTCTCCTTGC TGGAGACAGT CTTAAAGGCG 
ACGGCTGACG GCCTCCTTGT GGCGGACAAG CTGGGCAAGG TGCTTTGCTA CAACCAGCTT
TACGTGAATA TGTGGCACAT TCCAGGTGAA CTCCTGGCTC ATGCGAGACA CCAGGCAATC
CTCGACTATT GCGCCGGGCA ATTAAGAGAT CCCGAGCAAT TCCTGCACTC GACCGAAGAA
ATTTATGGCA CTTGGACACC GGAAAGCTTT GATATCTTCG AATTTTGTGA TGGGCGGGTA
TTTGAGCGGT ATACAAAAAC CAAAACCCTT GAAGGGTCAA ACATGGTTCG CGTTTGGAGC
TTCAGGGATA TTACTGAACG AAGACAGGGG GAGAGCTACA AAGCGCAGTT AGCGGCAATC
CTCGACTCCT CGAACGATGC GCTCATTATC AAGGATCTCA ATGGCATCAT TAGCGGGTGG
AACACCGGTG CGGAACGGAT TTTTGGTTAC CGTGCGAGCG AAATCATAGG TAGTCCGATC
TCCCGACTAA TTCCACCCGA TCGCCTGGAG GAAGAGGACA GAATCATGAA CCTCGTCAAG
AGCGGAAAAC AAACAAATCA TCTGGAAACC GTGCGATGGG GGAAGGGCAA GAAACCGATT
GATGTCTCCG TCACGATATC CCCGGTGAAG GATAACGCAG GCCACATTGT CGGTGCATAC
AAGATAGCGC ACGATATCAC TCAGCGCAAG GAATCGCTAA AACGTATCGA GTATCTCGCC
CATTACGATT TGCTGACCGG GCTGCCCAAC CGCGCGCTGT TCACCGACCG CATGAGGATC
GCGATCGAGA ACGCCAGCCG CTATGCCTTC CGGCTGGCAT TACTATTCGT GGACCTCGAT
CGCTTCAAGC TGATCAACGA TTCGCTTGGT CATGCAATCG GCGACAAGCT GCTCAGGGCA
GTTGCAGAGC GCATGCAGTC CACGGTTCGC CAGGCGGATA CAGTCAGTCG GCTGGGGGGG
GATGAGTTCG TTGTCCTGCT GGGCCGGATT CATACAGCGA CGGAGGCGGC GCGAGTTGCC
GAGAAGCTTA TTGCAGTCTT GTCTCAGCCC TATCAAATAG AACAGCATGA ACTCTTGCTG
ACCGCAAGCG TCGGGATCAG CATTTATCCG GATAGCGGTA AGGACGTCAA CAGCTTGATG
CGCAATGCCG ACGTTTCGAT GTATTCGGCC AAGGGGCAGG GCAGGAACCG CTACCAGTTC
TATTCTGAAG ATTTGACTTC CGGCGCAGAC GAGCGGCTCA GGCTGGAATA CGACCTGCGC
AGCGCTCTTG CGCGTGACGA AATTTTCCCG GTCTACCAGC CACAGCTGGA GCTTGCTACC
GGACGCGTGG TAGGCGTGGA GGCGCTCATG CGCTGGAAGC ACCCCGAGCG AGGGCTAGTC
TCCCCTGCAA GCTTTATTCC CGTAGCCGAG GACAGCGGTT TGATTCTGTC TCTGGGGGAG
CACATCCTGA GGGAATCCTG CCTGCAGGCA CGCCAATGGC ATGAGCAGCA GGGATTCGAG
GGAACAATCG CGGTAAACGT TTCAGCAGTA CAATTCCGTC AGAATGATTT CACGGATGTT
GTACTGCGCG CACTTGCAGA CAGCGGTCTT TCGCCAGAAT GCCTGGAACT CGAGTTGACC
GAAAGTGTGG TGATGCATGG AGTGGAATCG GCTACACAGA AAATGTGCTT CCTGGAGTCC
CGGGGTATAA AACTGGCTAT TGACGATTTC GGTACCGGCT ATTCCAGTTT GTCCTATTTG
CGGCAATTTG CCATTGACCG GCTAAAAATA GATCGAAGCT TCGTTCGTGA TCTCCCCCAG
GACACTGATG CCAAAGCTAT TATTCGCGCC ATTGTGGGAA TGGGGCGCAG CCTTGGTTTA
CGCGTGATAG TTGAAGGTGT GGAGACGGAA GGGCAAGCGG AGTATTTGCG AAGCGTTCAA
TGCGATGAAA GCCAAGGCTA TCTGTATGCA AGGCCCATGA GGCCAGATGA TTTTGAGGCT
TGGGTAAAAA CTAGGAATCC GGTTTAG
 
Protein sequence
MKETLELSQS LSLLETVLKA TADGLLVADK LGKVLCYNQL YVNMWHIPGE LLAHARHQAI 
LDYCAGQLRD PEQFLHSTEE IYGTWTPESF DIFEFCDGRV FERYTKTKTL EGSNMVRVWS
FRDITERRQG ESYKAQLAAI LDSSNDALII KDLNGIISGW NTGAERIFGY RASEIIGSPI
SRLIPPDRLE EEDRIMNLVK SGKQTNHLET VRWGKGKKPI DVSVTISPVK DNAGHIVGAY
KIAHDITQRK ESLKRIEYLA HYDLLTGLPN RALFTDRMRI AIENASRYAF RLALLFVDLD
RFKLINDSLG HAIGDKLLRA VAERMQSTVR QADTVSRLGG DEFVVLLGRI HTATEAARVA
EKLIAVLSQP YQIEQHELLL TASVGISIYP DSGKDVNSLM RNADVSMYSA KGQGRNRYQF
YSEDLTSGAD ERLRLEYDLR SALARDEIFP VYQPQLELAT GRVVGVEALM RWKHPERGLV
SPASFIPVAE DSGLILSLGE HILRESCLQA RQWHEQQGFE GTIAVNVSAV QFRQNDFTDV
VLRALADSGL SPECLELELT ESVVMHGVES ATQKMCFLES RGIKLAIDDF GTGYSSLSYL
RQFAIDRLKI DRSFVRDLPQ DTDAKAIIRA IVGMGRSLGL RVIVEGVETE GQAEYLRSVQ
CDESQGYLYA RPMRPDDFEA WVKTRNPV