Gene Daro_1976 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDaro_1976 
Symbol 
ID3570225 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDechloromonas aromatica RCB 
KingdomBacteria 
Replicon accessionNC_007298 
Strand
Start bp2124344 
End bp2127235 
Gene Length2892 bp 
Protein Length963 aa 
Translation table11 
GC content58% 
IMG OID637680447 
ProductPAS:GGDEF 
Protein accessionYP_285192 
Protein GI71907605 
COG category[T] Signal transduction mechanisms 
COG ID[COG5001] Predicted signal transduction protein containing a membrane domain, an EAL and a GGDEF domain 
TIGRFAM ID[TIGR00229] PAS domain S-box
[TIGR00254] diguanylate cyclase (GGDEF) domain 


Plasmid Coverage information

Num covering plasmid clones45 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.616763 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGATTGAGT CCGCTGGCCG GGATGGCCAG CGTTTACTTG GCGCTCTGCA GGGTAGTGGG 
CATTCCGTAG TGACATGTCA GGTTTCGCAG CGTGCGGATT TGAAGCGTGC ATTGCGCGGT
GAGGTGTGGG ATGTGGCGCT CCTCAGTTGC TCACTTGCAG AGTTGCCTGC TGCAGAAGCG
ATTGTGTTGT TTCGTGAACA CGCCCCGCGT ATGCCGGTGA TCCTGACCAT CGAGCGAGGG
GCGCAGGAGT TTCCGTTCGA ACTGCTGGAA AATGGCGCCT GTGACTTTGT CTTCAAGAGC
AATCCGGCTC GCCTGCTTGC GGTGATCGAG CGCGAATGCG CCAACGTCAA ACTGTCGCGT
GAAGAGCAGC AGGTTCTGGT CAGCCCCGAG CGTATTGAAC AGATCGATCA GGGGGTTGCC
CGCTTTTTCC AGTTGGCCAG CAATACCCCC GAATGTTATT GGCTGACCGA TGCGGCTACC
CAGCGCGTGA CCTATGTCAG TGCGGGGTAC GAAAAGATCT TGGGGCGCCA TGTCGAAGCG
CTTTATGCGG ACTCTCACGA CTGGCTGAAC TATGTTCATC CTGAAGACCA AGATCGAGTG
CTTGCCGCCG TGCGCACTTA TCGACTGGGT GGGCTGGACG CCAAATTTCG CGTTCGGCGC
CCCGGGGATG TCCAGTGCTG GCTGCATGCC CGAAATTTCC CGGTCCGTGA CGAAGAGGGA
AATATCGTCA GCGTTGGCGG CGTGGCCACG GACATCACCA GCTTGCTGGC CGATCAATGG
AAATCTCCGT ATTTCGCCCA TTTCGATGCG CTGACGGCGC TTCCCAATCA GTTGATGTTT
TACGATCAGG TGAAGCGGAT GATTGCGCTG TCCAAGCGCA AGGATCTGCC GCTCAGCCTG
ATGGTGGTCG ATATCGATCG TTTTCGTCAG CTCAACCAGA CGCTGGGTCA TGCCTCCGGA
GACGAGTTGC TGCGTCAGGT GGCTGGCCGC CTTTCCGGCT CGCTTCGTGA GTCGGATATT
CTGGGGCGTC TGGGGACTGA TGTCTTCGGC ATTCTTTTGC CGGATGTGGC CGATACCCAC
CAGGCGAGCA TCGTTGCTCG GCGGATCATC GACACGATGA TCATGCCGGT CCGTGTCGCC
GGCCAGGATG TTTTTGCAAC GGCGGGCGTC GGTATCGTTT TCTACCCGCA GGACGGCAAT
GATGTGCACG AACTGGTAAC CAATGCCGGG ATTGCCGGGC GTCACGCTAA GAACTTGGGA
CGCAACAGCT ACCAATACTA TTTTCCTGGC ATGCATGAAG GCGCCCGGGA CCGGATGTTC
CTGGAAATCG ACTTGCGTCA CGCCACCTTG CGCAATGAGT TTGTGCTGTA TTACCAGCCC
AAGGCCAGTT GCGCCGACGG CCGGATTACC GGTGTCGAGG CCTTGCTTCG CTGGCAGCAT
CCAGAGCGGG GGATCGTGTC GCCTGATCAG TTCATTCCCT TGCTTGAGGA AACCGGACTG
ATCGTTCAGG TTGGTCGCTG GGTCCTTGAG GAGGCCTGTC GGCAGGCTGT CGAATGGCAA
GCCGCCGGGT TGAATATCCC CAGCGTCTCG GTGAACCTTT CGGCCCGGCA GCTGCAGGCT
GAGACATTGC TGACGGACGT TGCGGCAACC CTGGAGAAAA CGGGACTCAA TGCCGCCTGC
CTCGATCTTG AAATCACCGA GAGCATGCTG ATGGATAACG CCGACATGGC GATCCAAACG
TTATCGGCCC TGAAAAAAAT GGGCGTCACC ATTTCGCTTG ATGATTTTGG TACGGGCTAT
TCGAGCCTGG CTTACCTGAA ACGCTTCCCG CTCGATGCCA TCAAGGTCGA TCGCTCCTTC
GTGCAGGATA TCGCTGCCGA TTCGGATGAC GCCTCGATCA CCCGGGCCGT GATTACCATG
GCTCACCACC TGAAGCTCAA GGTCGTGGCC GAAGGTGTCG AAACGCCCGA GCAACTGGCG
TTGCTCATTT CACATCAGTG CGACGTCATT CAGGGCTATT TCTTCTCGCG GCCTATGCCA
GCATCAGGAA TGACTGAGTT GTTGGTCTCT GACCGGCGCC TGCCTGACAA TATGTTGCGT
TCCGGCACTC GCAAGCCTAT GGCCTTGTTC GTCGCTGTCG ATGGCTTTGA ACAAGTTATT
TCAACGCTGA TCCGGGCCGG ACACCAGGTT TGTACTGCGC CGGACATGCC GGGGGCGTTG
CAGTGGCTTT CCGGTAATGT GGTTGATGTG CTGGTCTGCG GTGCTCCTCG CAAGGGTTTC
AACGCAGAGG AGCTGATCCG GCAGGCGGCT GGAATGCAGC CACGCTGCGA GCGCATGCTG
CTGGCCGACA GCCGCCAGTG GAATCGCAAG GCAGTGGCCG ACCTGAGCAG TTCAGGGCTG
GTCCATCGGG TCATTCATCT GCCGGTCGAG GCCGATGCTT TCCAATTGGT CGTCGAGGAA
TCATTGAGTC GGCGACATAT TTCGGATGAA TACAGCCGCC TGTCGCATGA AGTCGAGGTG
GTCGAACGGC AATTGGTGAG TATCGAGGAA GATCGCCGTC GCCTGCTCGA AGAAAATCAG
GTGCTGCAGG TTCAGGAGCG TCAGGGCTAC CGCATTTTGC AGGAGGTGCT TGCCGAATTG
CCGTTATCGG TCATCGGGAT CGATGAAAAC GGCCAGATCG TGCTGGCCAA TGATGCCGCG
CTTCAGGCAT TTGCATCTCG CGGCCTGTTT CTCGGTGCCG AGCTCGAACG TGTGCTGCCA
GAGGTGGCAT CGCTGAGCGA CAACGAAACA CTGAGTATCA ACGGTAACTT TTACCACTGT
CGCTGGCGTC AGCTCAGTCT CGAAGAATCC AGGGTTGGAC GTCTCCTGTT GCTGGAGGCG
CTTGAACAAT GA
 
Protein sequence
MIESAGRDGQ RLLGALQGSG HSVVTCQVSQ RADLKRALRG EVWDVALLSC SLAELPAAEA 
IVLFREHAPR MPVILTIERG AQEFPFELLE NGACDFVFKS NPARLLAVIE RECANVKLSR
EEQQVLVSPE RIEQIDQGVA RFFQLASNTP ECYWLTDAAT QRVTYVSAGY EKILGRHVEA
LYADSHDWLN YVHPEDQDRV LAAVRTYRLG GLDAKFRVRR PGDVQCWLHA RNFPVRDEEG
NIVSVGGVAT DITSLLADQW KSPYFAHFDA LTALPNQLMF YDQVKRMIAL SKRKDLPLSL
MVVDIDRFRQ LNQTLGHASG DELLRQVAGR LSGSLRESDI LGRLGTDVFG ILLPDVADTH
QASIVARRII DTMIMPVRVA GQDVFATAGV GIVFYPQDGN DVHELVTNAG IAGRHAKNLG
RNSYQYYFPG MHEGARDRMF LEIDLRHATL RNEFVLYYQP KASCADGRIT GVEALLRWQH
PERGIVSPDQ FIPLLEETGL IVQVGRWVLE EACRQAVEWQ AAGLNIPSVS VNLSARQLQA
ETLLTDVAAT LEKTGLNAAC LDLEITESML MDNADMAIQT LSALKKMGVT ISLDDFGTGY
SSLAYLKRFP LDAIKVDRSF VQDIAADSDD ASITRAVITM AHHLKLKVVA EGVETPEQLA
LLISHQCDVI QGYFFSRPMP ASGMTELLVS DRRLPDNMLR SGTRKPMALF VAVDGFEQVI
STLIRAGHQV CTAPDMPGAL QWLSGNVVDV LVCGAPRKGF NAEELIRQAA GMQPRCERML
LADSRQWNRK AVADLSSSGL VHRVIHLPVE ADAFQLVVEE SLSRRHISDE YSRLSHEVEV
VERQLVSIEE DRRRLLEENQ VLQVQERQGY RILQEVLAEL PLSVIGIDEN GQIVLANDAA
LQAFASRGLF LGAELERVLP EVASLSDNET LSINGNFYHC RWRQLSLEES RVGRLLLLEA
LEQ