Gene Daro_3052 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDaro_3052 
Symbol 
ID3566227 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDechloromonas aromatica RCB 
KingdomBacteria 
Replicon accessionNC_007298 
Strand
Start bp3296565 
End bp3298136 
Gene Length1572 bp 
Protein Length523 aa 
Translation table11 
GC content57% 
IMG OID637681523 
ProductGGDEF 
Protein accessionYP_286252 
Protein GI71908665 
COG category[T] Signal transduction mechanisms 
COG ID[COG2199] FOG: GGDEF domain 
TIGRFAM ID[TIGR00254] diguanylate cyclase (GGDEF) domain 


Plasmid Coverage information

Num covering plasmid clones48 
Plasmid unclonability p-value0.927179 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0159246 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCATCAC TTCGCTACCG CCCCACCCTG CTTGCGCTGA TGAGCGCCTG GCTGGGAGTC 
ATTGCATTCG CAACCCACCT CCTGCTCTCG GCCGAAATCG GGCGCTGGGA GCGTAATTTC
GATTCGGAAA TTCTGCGCCT GAGCAGCGAA GTCAAAAACA AGCTGGACAT GAATGAAGCC
GTCCTGTCGG GCTTTGCTGC CTTTCTGCAA GCGGTCGAAC GCAACGACAT GGCCTCGGCC
ACCCGGTATG CGGCTTCAGC AATGGCCTCC TACCCGCACA TCTACATGCT GGAGGTGGCG
CGCCAGGTCC CACTCGCCGA GCAAGCTGAA TTCCAGTCCA CCTTGCGCCG GGAATGGCTG
CCCAGTTTTA CGCTGAAGGA TTTTCCGACA ATCACCCAGC GGCCCTCCCA AAAGCTGCGC
ATGCAGAACT ACACCTGGCC AATCCTGTTC ATGTACCCAC CCTTACCTGA AGCAGAGGAG
ATTTTTGGTG TGCGACTGGA AACGGTCGAC TACCTTGGCC ATTCCTTGGC GCTGGCCCAC
AAAAGTTCAC GACCGGTCGC TTCTCCGATT TTCAAGATGT TCGAGGGAGG AAGCGCCTAC
ATATTGCTCA AGGAAGTTGA CCGCTCTGAA CAAGCGTCCA GCTCGGCCGG GCAAAGCTTG
TTCGGGAATA CCATGACTGC ACTACTGCTT ATCAAGACCG AGTCCATGCT CCCCGGCAAA
GGCCTCGCGC ACGATGTGGC GACGCTTGCT TACCTGGCAT CGATGACTTC CGATACCAAT
CCTGGGAGCA CGTTGTTCGC CAGACATGCT GAAGCCACTG GAACATTGGA CGAATTCTTC
TTGCCACGCT TCAACCGCCA AATACTGATC GATAACGCCT CGCAACCAAC GCTCATACAG
TTCGAACAAC AGTTACGTTG GCGCGATCTA CTGACGAGAG AGCTTGTCAC CATCTTTGCA
CTGCTGGCCT GCGCCCTGGT CAGCATTCCC ACGCTCACCC TTCGTCACTA CAAGGCACTG
GAACGCGGCG TCCGTGAACA TGAACGATCG GCCTACCTGG CAACGCACGA TCTGCTGACC
GGGCTACCGA ACCGTTTCCT GTTCCTTGAT CGCTTTGAGC AGGCCGTCCA GCAACATGTG
CGAAACGCCA ATTCATTTGC GTTACTCCTA ATCGATCTTG ATCACTTCAA GGAGATCAAC
GACAGCTACG GCCACGAGGT CGGCGACGAG GTATTGGTCG AAACCGCAAA ACGCATGACG
GCCGAGATTC GGGCGTGCGA TACGGTGGCC CGCCATGGTG GTGATGAATT CGTGATCCTT
CTGGCCAATA CGCTTAACGT CGACGATGCA CAAACAGTTG GCGAGAAGCT GCTGGCAGCA
ATTTCGACAC CGATGAAGAC CAGCGCGGGG CAGTTGCGAC TCACCGGCAG CATCGGTGTC
GCCATATACC CAGAGCATGG CACGAGCCTC GATGCAATAC GTCGAGCCGC CGACCAGGCC
ATGTATCAAG CGAAGAAAAT GGGGCGGAAC ATGGTGTCAA CGCCGGGCGG CGACCCGGCA
ACACTGTCGT AG
 
Protein sequence
MPSLRYRPTL LALMSAWLGV IAFATHLLLS AEIGRWERNF DSEILRLSSE VKNKLDMNEA 
VLSGFAAFLQ AVERNDMASA TRYAASAMAS YPHIYMLEVA RQVPLAEQAE FQSTLRREWL
PSFTLKDFPT ITQRPSQKLR MQNYTWPILF MYPPLPEAEE IFGVRLETVD YLGHSLALAH
KSSRPVASPI FKMFEGGSAY ILLKEVDRSE QASSSAGQSL FGNTMTALLL IKTESMLPGK
GLAHDVATLA YLASMTSDTN PGSTLFARHA EATGTLDEFF LPRFNRQILI DNASQPTLIQ
FEQQLRWRDL LTRELVTIFA LLACALVSIP TLTLRHYKAL ERGVREHERS AYLATHDLLT
GLPNRFLFLD RFEQAVQQHV RNANSFALLL IDLDHFKEIN DSYGHEVGDE VLVETAKRMT
AEIRACDTVA RHGGDEFVIL LANTLNVDDA QTVGEKLLAA ISTPMKTSAG QLRLTGSIGV
AIYPEHGTSL DAIRRAADQA MYQAKKMGRN MVSTPGGDPA TLS