Gene Namu_5214 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_5214 
Symbol 
ID8450845 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp5812669 
End bp5815788 
Gene Length3120 bp 
Protein Length1039 aa 
Translation table11 
GC content71% 
IMG OID645044245 
Productdiguanylate cyclase/phosphodiesterase with PAS/PAC sensor(s) 
Protein accessionYP_003204469 
Protein GI258655313 
COG category[T] Signal transduction mechanisms 
COG ID[COG5001] Predicted signal transduction protein containing a membrane domain, an EAL and a GGDEF domain 
TIGRFAM ID[TIGR00229] PAS domain S-box
[TIGR00254] diguanylate cyclase (GGDEF) domain 


Plasmid Coverage information

Num covering plasmid clones36 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones52 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGGACAG AAGGGCAGCT TCACACGGTC GGGGGGTACG GGATGAGGCC AGTCTTTCGG 
CCGGACCAGC GACGGTGTTC GGCGCCGTCC TCCGGTCCGG TCCGGGTGCT GGTCGCCCTG
GGGCTGATCG GGTGGGTGTT GGTGAGCGCG GTGTCCGACC CGCGCCTGGG CCGGGTCGCG
GTCGGACTGG CGGCGACGGT GGCCATCGTC GGCGGGGTCA TCGCCGGGCT GCGCGGCGGG
CCGGCCCGGG GTTGGTGGTG GGCCTCCGGG GCGATCGCGC TCTGGACCGT CGGGGACATG
ATCTGGTGGG CCGATTCGGC GGCCGACCAG GCACCGGGCA TTGTCGCCAC CGTCGGCCTC
TTCGGGTACG CCGGCCCGGT CCTGGTCGCG GCCTGGGTGC TCGACGGCGC GGCCGAGCGG
CCGATGACCT TCATCCGGCA GGTGCTGGAC GGCCTCATCG TCGGCGGCTC GGTCCTGTTC
GCCGCGTGGG TGCTGCTGAT GCATCTCGCG GGCGGCCGAA CCGGCTCCGA GCAGGGGCGA
TGGGCGGCCC TACTGACCCA GACGGCACTG GCCGCGGCCC TGGCCACCAC CCTGGTGATG
ATCGGGTTGA CCTACGCCGA GGGGGCCCGG AGGCCCTGGC TGCTGGCGGC CGGCGGCCTG
GGCCTGCTCG CGCTGACCAG CGCGGTGGCG TTGACCAACC CGGCGACCTG GCCGCTGGCC
GCCGGGTCGG TGGCCGCGCT GCTGCTGCTC GCCGCGGCCA CCTGGCGTGA TCCCGGGCCG
CAGGGGCGAC GACCCCGGGC CGTCCTGACC CCGGCCCAGG AGCTCATTCC CAACGCGGCC
GTCGTGGTGA TCCTGGTGGC CGCCGTCGTC GGCGACATCG TCCGCCGCCC GCCACTGATC
GTCGCCACCC TGGTCCTGTT GTCCATGGTG GCCGGCCGGC GGCTGCTGAT CAGCCAGGAA
AAGCACGCAC ACGCGACGAC CCTGGATCGA GACCTGACCC GGAGCATGGC CGAACTCGAC
GACGTCCAGT CGCTGCTGCA GCGAGCCTTC GAGGACTCGT TGACCGGCGG ATTGTGGACC
TCTATCGACG GTCGAATCCA CCGGGCGAAC CCTGCCTACT GCCGAATGCT CGGCCGCCCG
GAGTCGGACC TGGTCGGCAC TCGCTTGCCC ACGTGGGTCG ACCCGGCGAG TGCCGCACGG
GCTGAGGAGA TGTTCACCCG AGTGGGCACC ATCGACGGCG CCGGTCTGCC GGTCGAGCTT
CGATACCGCA AGCCCGACGG CAAGCCGCTG TGGGCCCAAC ACACGGCGGC GGTGTTCCGC
ACCGTCGAAG GCGCCCCGAT CCAGATGGGG GTGCAGGTCA TCGACGTGAC CGAGGCCCGG
ATGGCGGCCC GCCGGCAGGC CCATCAGGCC CGCTTCCTGG ACGCGATCCT GGAAAACATC
GGTTCCGCGG TCGTGGCCTG CGACGCCCAG GGGCAGGTGA ACCTGATCAA CTTGTGCACC
CGGGAGATGT TCAATCTGCC GCCCGCTCCC TGGATGCCGC CGCCCGGCAT CGAGCAGCCG
CCGGTGTACC ACGCCGATGG GATCACCCCG ATTCCACCCG ACGAGCGGCC GCTGAGCCGG
TCGCTGCGCG GCGAGCTGGT CCGCGACGTG GAGACCGTGT TCGGCGAACC GGGCGCGGAT
CGCCGCATCG TGCTGTCCAG CGCCCACCGC CTCCAGGACG AGGACGGCGC CGTGCTGGGC
GCCGTGCTGG TGATGCACGA CGTGACCGAG CGCCGGCGGG TCGAGTCCGC GCTGATCCGG
CAGGCCCTGC ACGACCCGCT CACCGACCTG GCCAATCGCG CCCTGCTGGG CGACCGGCTG
GATCAGGCGA TCGCCCGCCA GCAGCGCCAG CCCGAGCCGT TCGCCCTGCT GCTGCTGGAC
CTGGACGGAT TCAAACTGGT CAACGACAGC CTGGGTCATC AGGCCGGGGA CCAGGTGCTG
ATCACGGTGG CGACCCGGCT GCGCACGTCC CTGTGGGCCG AGGACACGGT CGCCCGGTTG
GGCGGCGACG AGTTCGCGGT CCTGCTGGAG CGCACCTGCG ACAGCCAGGC GCTGGCCATC
GCCGAACAGC TCCTGGGAGT GCTGCGGCGA CCGGTCCAGG TGCACGCCCA CACCATCACC
CCGGACGCGT CGGTGGGGGT GGCCCTGTCC ACCGGCGACG ACACCGCCGA GTCGATGCTG
CGCAACGCCG ACCTGGCCAT GTACGCGGCC AAGGACGCCG GCAAGGGGGT GGTGCAGGTC
TACCGCGCCT CGATGCACGA GAGCGTGCTG CAGCGGCTGA TCATGGACGC CGAGCTCCGG
CTGGCCATCG ACGAGCAGCA GTTCAGCGTG CACTACCAGC CGATCATCTC GCTGATCTCC
GGGCAGCTGC GCGGGTTCGA GGCGCTGCTG CGCTGGCAGC ACCCGGTCCG CGGCGATATC
CCGCCCAGCA GCTTCATCCC GCTGGCCGAG TCCTCCGGAC TGATCGTGCC GCTGGGCGGG
TGGGTGCTGC GGGAGGCCTG CAGCCAGGCG GCGCGGTGGC GGCGGCTGTT CCCCCCGACA
CGATCCCTGA CCATGTCGGT CAACCTGTCG GTGCGCCAGA TCCAGGAGCC CACTCTGGTG
GCGACCGTGC TCGAGGCCCT CACCGATGCC GGGCTCGAGC CCGGGGATCT ACAGCTGGAG
ATCACCGAGA GCACGCTGGA TCAGCGCAAC CTGATCCTGG GCGTGCTGCA CGAGTTGCAC
GACAAGGGGA TCAAGCTGGC CATCGACGAC TTCGGCACCG GGTACTCGTC ACTGAGCCGG
CTGCACACGC TGCCGGTGGA CCGGGTCAAG ATCGACCGGT CGTTCATCGA GTTGCTGGCC
GGTTCGAATC CGGCACCGCT GGTCGCCGCG ACCGTGGCCA TGGCGCACAG CCTGGGCATG
CAGACGACCG CCGAGGGCAT CGAGTCGGCC GACCAACTGC CCATGCTCCG GATGTACGGG
TGCGACGACG GGCAGGGCTA CTACTTGGGC CGGCCGATGT CCGCCGAGGC CGCGACCGCG
CTGATCCACA CCCAGCTCCG GCGGGAGAGC GCGGACTCGT GGCCGCCGGC CGCCCGTTGA
 
Protein sequence
MGTEGQLHTV GGYGMRPVFR PDQRRCSAPS SGPVRVLVAL GLIGWVLVSA VSDPRLGRVA 
VGLAATVAIV GGVIAGLRGG PARGWWWASG AIALWTVGDM IWWADSAADQ APGIVATVGL
FGYAGPVLVA AWVLDGAAER PMTFIRQVLD GLIVGGSVLF AAWVLLMHLA GGRTGSEQGR
WAALLTQTAL AAALATTLVM IGLTYAEGAR RPWLLAAGGL GLLALTSAVA LTNPATWPLA
AGSVAALLLL AAATWRDPGP QGRRPRAVLT PAQELIPNAA VVVILVAAVV GDIVRRPPLI
VATLVLLSMV AGRRLLISQE KHAHATTLDR DLTRSMAELD DVQSLLQRAF EDSLTGGLWT
SIDGRIHRAN PAYCRMLGRP ESDLVGTRLP TWVDPASAAR AEEMFTRVGT IDGAGLPVEL
RYRKPDGKPL WAQHTAAVFR TVEGAPIQMG VQVIDVTEAR MAARRQAHQA RFLDAILENI
GSAVVACDAQ GQVNLINLCT REMFNLPPAP WMPPPGIEQP PVYHADGITP IPPDERPLSR
SLRGELVRDV ETVFGEPGAD RRIVLSSAHR LQDEDGAVLG AVLVMHDVTE RRRVESALIR
QALHDPLTDL ANRALLGDRL DQAIARQQRQ PEPFALLLLD LDGFKLVNDS LGHQAGDQVL
ITVATRLRTS LWAEDTVARL GGDEFAVLLE RTCDSQALAI AEQLLGVLRR PVQVHAHTIT
PDASVGVALS TGDDTAESML RNADLAMYAA KDAGKGVVQV YRASMHESVL QRLIMDAELR
LAIDEQQFSV HYQPIISLIS GQLRGFEALL RWQHPVRGDI PPSSFIPLAE SSGLIVPLGG
WVLREACSQA ARWRRLFPPT RSLTMSVNLS VRQIQEPTLV ATVLEALTDA GLEPGDLQLE
ITESTLDQRN LILGVLHELH DKGIKLAIDD FGTGYSSLSR LHTLPVDRVK IDRSFIELLA
GSNPAPLVAA TVAMAHSLGM QTTAEGIESA DQLPMLRMYG CDDGQGYYLG RPMSAEAATA
LIHTQLRRES ADSWPPAAR