Gene Daro_3549 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDaro_3549 
Symbol 
ID3567627 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDechloromonas aromatica RCB 
KingdomBacteria 
Replicon accessionNC_007298 
Strand
Start bp3799074 
End bp3802022 
Gene Length2949 bp 
Protein Length982 aa 
Translation table11 
GC content59% 
IMG OID637682022 
ProductPAS/PAC sensor hybrid histidine kinase 
Protein accessionYP_286748 
Protein GI71909161 
COG category[K] Transcription
[T] Signal transduction mechanisms 
COG ID[COG0642] Signal transduction histidine kinase
[COG0745] Response regulators consisting of a CheY-like receiver domain and a winged-helix DNA-binding domain 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones38 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.288363 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGCGTCG TATTCAATAC CCTGTCGGTT GCCGAAGCCT TGGTATTCCG TCGAACGCTG 
CTGGCAATCG TCGTGGCGAT CAATTGCCTG GCGATTTGCG TCGGCGCCTA CACCTTGTAC
GACAGTCGGC AGCTCTACGA TCGCCAAGCC GAAGCGGAAA GCCGGAATCT GGCGCGTGCC
CTCGATGAAA ACCTTGCCGT CAGCCTGGGG CGTGTCGAGG TGACACTGGG TAGCGTGGTC
GATCGTCTCG AAGAAGAGTT GAACCGACAT GCCACGCTCC AGGAGGTGTC TCTGGCGTCG
TTCATCAAAC GTGCGGAGAG CCATCTTGGC GCCCACGTCA GGGTTCGGGT GTCGGATGAG
AGCGGCATGG TGATTCTGGG CGGGGATGTG GTGCCGGGAA CCACCACATG GGGAGGGCGG
AGTTTTTTCA GGCAACTCAA GGAGCATCCG GAAACCGGTA CCCTGATCAA CGACCCGGTG
CTCGGCTACG TCACCAAGGT CCACCTGGTT CCGGTCGCGG CACGCTATCG CTATCCGGAT
GGCCGATTTG CCGGCATCGT CTCGGTCGCG GTTCCCGTTT CCTACTTGTC CGATCAGCTG
GCCAAGCTCG ACTATGGCCC CCATGGCCTG GCTGTCCTGC GCGATGCCAA ACTCAACCTG
ATCACCCGCT ATCCTGCATT GAACAAGCCG GAAGGTGAGT TGGGGGCGCC AATTTACAAC
AAGACCCTGA TTGCCGGAAT TGAGGACGGC AAGCGCAGTT TCTTCTATCA CACCCAGACG
ACACCGGATG GCGTTTCGCG GCTGATTACC TATCAACGGC TGTCGTCGAT GCCATTCCAC
CTGATCGTCG GCCTGGCGCA TGACGACTAT CTGCTGCCCT GGTATGAGCG ATTGCAGCAA
ACCGTCATCG CCCTTGTCAT TTTTGCGTTC GCCACCTTTG CCCTGAGCTT GTTGCTCCTG
GGCATGCTGT CCAGCTTGCG GCGGCAAGGC GAGCATGCCG TTGCGCTACT CAAGAATGCG
AGTGACGGTG TCCATATTCT CGACCGGCAT GGCGTCATCG TCGAAGCCAA CGACGCTTTC
TGCGCCCTGC TGGGTTATTC CCGGCAGGAG GTCATCGGCA TGGCCATCGG GCAATGGGAT
TCGTCAGTCA AGGATGTGGA TGTCGAGGCG ATGCTGGCCC AGCGCTTTCA GTCGCCCGTG
CCCGATCAGT TTGAAACACG CCACCGCCGG AAGGACGGCA GCGAGTTTCC GGCAGAGGTC
AGTTTCCAGC CGATGGTCAT CGAGGGCGTC GAGTTGCTCT ACGCCTCATC GCGGGATATC
AGTGACCGGA AGCGCGCAGA ATTGGTGCTG GCCCGTGAAC GTTCGATGTA CCGGACCTTG
ATCGACACGC TTCCCGACCT GATCTGGCTG AAGGATGCCG ATGGTGTCTA TCTCAGCTGC
AACCGCCGCT TCGAGCAGTT CTTCGGGACC AGCGAACAGG AAATTGTCGG CAAGACGGAC
TATGATTTCG TCAGCAAGGA ATTGGCTGAT TTCTTCCGGG AACATGATCG CAAGGCCATG
GAAAAGGATG GACCATCAAT TAACGAGGAG GAGGTGCCCT TCGCCTCGGA TGGTCACCGG
GAGCTGCTCG AAACGACGAA GATGTCGATG CGCGATGCCA GCGGGAAGCT GATCGGTGTG
CTGGGGATCG GGCATGACAT CACGGCCAAG CGACGTTCGG AGCTGGAACT GGATGGACAC
CGCCAGCACC TGCAGGAACT GGTCGACTCG CAGACCGCCG ATCTGATGGT GGCCAAGGAG
GTCGCCGAAA CAGCCAGTGT GGCCAAGAGC GCCTTCCTGG CCAACATGAG CCACGAAATT
CGCACGCCGA TGAATGCCAT CACCGGCATG GTGCATATCC TGCGCAACAT GGGGGTGACC
CCTCCGCAGT CTGAAAAGCT CGATATCATC GAAAACGCCA GCAAGCACCT GCTGGGAATC
ATCAACGATG TGCTCGACCT GTCGAAGATC GAGGCCGGAA AATTTACGCT GGAGGATGTG
CCGCTCAGTA TCCGCGCCCT GTTCGGCAAT GTAGCCTCGA TGCTCGGCCC GAAAGTGCGG
GAAAAGGGGC TGGTTTTGAA CATCGAAGCC GATGATGTGC CACGCAACTT GCGCGGTGAT
CCGACGCGAC TCCAACAGGC GCTGCTCAAT TTCGCCGGCA ACGCGCTGAA ATTCACCGAG
CGGGGGCATA TCACCCTGCG CGCCCGGATT CTGGAGGAGA CAGAGCTATC AGCCAACTTG
TGTATCGAGG TCGAAGATAC GGGAGTTGGC ATTGCGCCGG AAGCCTTGCC CAAACTGTTC
GGTGCGTTCG AGCAGGCCGA TAATTCGATG ACCCGCAAAT ACGGCGGGAC GGGGCTGGGC
CTGGCCATTA CCAAAAAGAT AGCGGAAGTG ATGGGCGGCA CGGCCGGCGT TAACAGCACG
CCGGGGGGAG GAAGTACCTT CTGGTTCACC GTGATCCTCG GTAAATCAGA TGAAATCGCC
GAGGTGCCCA GCCGTACCGC CATGACGGAG GCCGATGCGA TGCTGAGGCG CGATTGCGCC
CACCGGCGGA TTCTCTTGGC CGAAGACGAG CCGATCAATC GGGAAATTGC CCAAATGCTG
CTCGAAGCGG TCGATCTGGA CGTCGAAATG GCCGAGAACG GGGAGATTGC GGTGCGCATG
GCCCTGGCCA ACTGCTATGA CCTGATCCTG ATGGATATGC AGATGCCGGT GCTCGATGGT
TTGGCGGCCA CGCGGCGCAT CCGTGAACTG CCGGCCTGCG CAGCGGTGCC GATTCTTGCG
ATGACAGCCA ATGCCTTTGC CGAGGACAAG GCTCAATGCC TGGCGGCGGG GATGAACGAT
TTCATCGCCA AGCCGGTCGT TCCGGAGGTT CTTTACGATA CCTTGCTGAA GTGGCTGCGT
CGTCTGTAA
 
Protein sequence
MGVVFNTLSV AEALVFRRTL LAIVVAINCL AICVGAYTLY DSRQLYDRQA EAESRNLARA 
LDENLAVSLG RVEVTLGSVV DRLEEELNRH ATLQEVSLAS FIKRAESHLG AHVRVRVSDE
SGMVILGGDV VPGTTTWGGR SFFRQLKEHP ETGTLINDPV LGYVTKVHLV PVAARYRYPD
GRFAGIVSVA VPVSYLSDQL AKLDYGPHGL AVLRDAKLNL ITRYPALNKP EGELGAPIYN
KTLIAGIEDG KRSFFYHTQT TPDGVSRLIT YQRLSSMPFH LIVGLAHDDY LLPWYERLQQ
TVIALVIFAF ATFALSLLLL GMLSSLRRQG EHAVALLKNA SDGVHILDRH GVIVEANDAF
CALLGYSRQE VIGMAIGQWD SSVKDVDVEA MLAQRFQSPV PDQFETRHRR KDGSEFPAEV
SFQPMVIEGV ELLYASSRDI SDRKRAELVL ARERSMYRTL IDTLPDLIWL KDADGVYLSC
NRRFEQFFGT SEQEIVGKTD YDFVSKELAD FFREHDRKAM EKDGPSINEE EVPFASDGHR
ELLETTKMSM RDASGKLIGV LGIGHDITAK RRSELELDGH RQHLQELVDS QTADLMVAKE
VAETASVAKS AFLANMSHEI RTPMNAITGM VHILRNMGVT PPQSEKLDII ENASKHLLGI
INDVLDLSKI EAGKFTLEDV PLSIRALFGN VASMLGPKVR EKGLVLNIEA DDVPRNLRGD
PTRLQQALLN FAGNALKFTE RGHITLRARI LEETELSANL CIEVEDTGVG IAPEALPKLF
GAFEQADNSM TRKYGGTGLG LAITKKIAEV MGGTAGVNST PGGGSTFWFT VILGKSDEIA
EVPSRTAMTE ADAMLRRDCA HRRILLAEDE PINREIAQML LEAVDLDVEM AENGEIAVRM
ALANCYDLIL MDMQMPVLDG LAATRRIREL PACAAVPILA MTANAFAEDK AQCLAAGMND
FIAKPVVPEV LYDTLLKWLR RL