Gene Namu_4044 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_4044 
Symbol 
ID8449663 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp4458084 
End bp4460192 
Gene Length2109 bp 
Protein Length702 aa 
Translation table11 
GC content71% 
IMG OID645043089 
Productdiguanylate cyclase/phosphodiesterase with PAS/PAC sensor(s) 
Protein accessionYP_003203325 
Protein GI258654169 
COG category[T] Signal transduction mechanisms 
COG ID[COG5001] Predicted signal transduction protein containing a membrane domain, an EAL and a GGDEF domain 
TIGRFAM ID[TIGR00229] PAS domain S-box
[TIGR00254] diguanylate cyclase (GGDEF) domain 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.0246164 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.385301 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGCATCA GCGACGAGGC CATCCGGGCC TTTGCCGCCG AGTGGGCGCG AACCATCGCC 
GGCACCAGTT ACGTGCCACT CGGCCTGGTC GAGCTTGAGA GCACGCTGAC CGAGCTGACC
GGCGAGTTGC GGTGCGTGCT GGCCAGTGAG CCGTTCGATC CGGCCCCCGC CGCCCGGGTC
GGGGCTGCGC TGGTCGATCG CCACTTCACC CACCCGGACA GCATCGCCTG CACCGTCGGC
CTGGTCTCGC GTTCGTGGGG TGGGGTCAAC GATCCGGACC TGCGGGATCG GGTCGCCGGC
CTGGCCGGCG CGCTGGCCGC GGGCTACGCG GAACGACTCC AGGAACGGAC GTTCGCCGAG
CAGGACGCCA TCCGCCGGGC CGCGATGATC GCCCAGGACG AGGCCGAGCG GGCCGCCCGT
GAGCTCCAGG CCCGGTTCCG GGCCGTGTTC TCCGGCGCGG CCGTCGGCAT CGGCGTCGGC
GACGTCGACG GCCGGATCCT GGACGTCAAC CCGGCCCTGC AGAAGATGCT CGGGTACACC
GTCGAGGAGA TGCGCCGGCG CAACGTCGGC GAGTTCATCC ACCCGGCCGA CACCGCCGAC
GTGTGGCAGC TCTACCAGCA GTTGATCACC GGCACGATCG ACACCTTCCG CACCGCCAAG
CGGTTCGTTC GTCAGGATGG CGACACCGTG TGGACGCAGC TCGCGGTGTC GCTGATCCGG
GACAACCGCG GGCGACCGGA ATTTCAGATC GCCGTCATGG AGGACGTCAC CGATCTGCGC
CGCCTGCAGA TCCGCCTGGA GCACCAGGCC CACCACGACG CGTTGACCGG GCTGGCCAAC
CGGGCCCTGT TCCAGCAGCG TCTGGGGGAG CTCGCCGCGA CCACGGACCC GACCGAGCGG
ATCGGCCTGT GCCTGCTGGA CCTGGACGGG TTCAAGGCGA TCAACGACAG TGTCGGGCAC
GGTGTCGGCG ACCGGGTGCT GGTGGAGATC GCGGCCCGGC TCGGCCAGAC CGTCGACGGC
GACCGGCAGC TGGTCGCCCG CCTCGGCGGC GACGAGTTCG TCATCCTGGT CGCGGGCACG
TCGGACGCCC AGGACATGGT CGCCGTGGCC GAGGCGGCCC TCGCGGCGGT GGCCCGACCG
GTCTCGGTGG CGGGCGGCGT CTTCAACGTC ACCGCCAGCG CCGGGCTGGT CGAGCGGACG
GCCGCCGGCG CCGATCCCGC CGAGCTGGTT CGCGCGGCCG ACATCACCCT GTACTGGGCC
AAGGCCGAGG GCAAGGCTCG ATGGGCCATC TTCGATCCCG AGCGCAGCCT GCGCGAGGTC
GACCAGTACA CGCTCGCCCA GTTGCTGCCC ACCGCGCTGG ACCGGGGCGA GTTCCGCCTG
CACTATCAGC CCCTGATCGC CCTGGCCGAC GGCCGGTTCA CCGGTTTCGA GGCGCTGCTG
CGGTGGGAGC ACCCGCGGCT GGGTCGGCTC CGGCCGGATC GGTTCATTGC CGCCGCCGAA
GAGTCGGGCA TCATCGTGCC GATCGGCCAG TGGGTCCTGC AGGAGTCGTG CCGGCAGGCT
CGCCGGTGGT ACGAGATCAC CGGCATCCGG CCCTGTGTCA GCGTCAACGT GGCCCTGCGC
CAGCTGTCGG ACGGCGGCCT GCTCGACGAC GTGCTGCGGG TGATCGAGGC CGAACAGCTC
ACGCCCGACC AGCTGCAGCT CGAGCTCACC GAACGTGCCG TCATCGGATC GGACCACGAA
CCGCTCGCCG AGCTGCAGGC TCTGGCCGCG CTCGGGGTGG GGATCGCGAT CGACGATTTC
GGCACCGGCT ACTCGAACAT GACCTACCTG CGCCGGTTGC CGATCACGGC GCTCAAACTC
GACCAGTCGT TCGTCAGCGA GCTGCGGCCC GACGCCGCGG ACCAGACGGA CGCCAAGATC
GTCCGGTCGA TCCTGACCCT GGCGCACGAC CTGGGCCTCA CCGTCACCGC CGAGGGGGTC
GAAACCGCCG CCCAGGCCCG GGCCCTGCGC GAACTCGGGT GCGATCAGGC GCAGGGGATG
TACTTCGGCG CGCCTGTCCC CGCCGACGCA CTCCTGCGCA CGCCCGATGC CCCCGGCGAA
ACGCGATGA
 
Protein sequence
MSISDEAIRA FAAEWARTIA GTSYVPLGLV ELESTLTELT GELRCVLASE PFDPAPAARV 
GAALVDRHFT HPDSIACTVG LVSRSWGGVN DPDLRDRVAG LAGALAAGYA ERLQERTFAE
QDAIRRAAMI AQDEAERAAR ELQARFRAVF SGAAVGIGVG DVDGRILDVN PALQKMLGYT
VEEMRRRNVG EFIHPADTAD VWQLYQQLIT GTIDTFRTAK RFVRQDGDTV WTQLAVSLIR
DNRGRPEFQI AVMEDVTDLR RLQIRLEHQA HHDALTGLAN RALFQQRLGE LAATTDPTER
IGLCLLDLDG FKAINDSVGH GVGDRVLVEI AARLGQTVDG DRQLVARLGG DEFVILVAGT
SDAQDMVAVA EAALAAVARP VSVAGGVFNV TASAGLVERT AAGADPAELV RAADITLYWA
KAEGKARWAI FDPERSLREV DQYTLAQLLP TALDRGEFRL HYQPLIALAD GRFTGFEALL
RWEHPRLGRL RPDRFIAAAE ESGIIVPIGQ WVLQESCRQA RRWYEITGIR PCVSVNVALR
QLSDGGLLDD VLRVIEAEQL TPDQLQLELT ERAVIGSDHE PLAELQALAA LGVGIAIDDF
GTGYSNMTYL RRLPITALKL DQSFVSELRP DAADQTDAKI VRSILTLAHD LGLTVTAEGV
ETAAQARALR ELGCDQAQGM YFGAPVPADA LLRTPDAPGE TR