Gene Daro_1934 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDaro_1934 
Symbol 
ID3567863 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDechloromonas aromatica RCB 
KingdomBacteria 
Replicon accessionNC_007298 
Strand
Start bp2080560 
End bp2083607 
Gene Length3048 bp 
Protein Length1015 aa 
Translation table11 
GC content54% 
IMG OID637680405 
ProductPAS/PAC sensor hybrid histidine kinase 
Protein accessionYP_285150 
Protein GI71907563 
COG category[T] Signal transduction mechanisms 
COG ID[COG0642] Signal transduction histidine kinase
[COG0784] FOG: CheY-like receiver 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value0.452261 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.114848 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGGGGA ATCCGAAGTC GGCCGTATCC ATAGAGAAAC GCCAAAGCAT TCTTCGATTG 
CTATGGGGTG GCGCTTTCCT GATCAACCTC TTCGTTGTCA GTATGGTCAT CCTCGTGCTC
GAAAGAAACC GGGCTCAGGA GATCAGTCAG GCGGAGATTC TGACTGAAAA TTACTCGAAG
ATTCTGGAGG AAAGTCTTGC CGGATTTATC AGCAAGATCG ATATCACGCT GCTGACGGTC
GCCGAAGAGG TGGAGCGGCA GATGGCCAGC GGCGGAATTA ACCAGAAGGC GCTGGAGAAA
TATATCGCGC TTCAGGATGC TCATGTTCCG GAAGCGCTCG GACTGCGCGT GGTGGATGCT
CAGGGCATCA TCCGCTACGC GGTTAATGAC ATCAAGGTTC GCAATGCCAG CATTGCCGAC
CGCCCACAGT TCATTCGCCT GCGGGATGAT CCAAATGCCG GGCTGGTTTT TTCAAAGCCG
GTCATGGGCC GCGCGGCAGA GAAATGGATG ATCACCTTGG GGCGCCGTAT CAACCATCCC
GATGGTTCTT TTGCCGGTGA TGTCCATGTG GCAGTTGCCG TTGATCGTTT CATTGCCATG
TTCGCCAATA TAGATCTGGG CGAAAAAGGA AATGTCGGTC TGTGGGACCA GAGTACATTG
ATTGCACGTT ATACCAGGTC CGATACTCGT GGTGCATCGG TGGGTTTTGC TAACCCTTCT
GCGGACTTGC ATGCCCTTAT TAATTCGGGC AAACGTGCCG CCAGTTACCA TACTCGTTCC
GGGGTGGACG GGATTACTCG GACCTTCTAC TTCCGGCAGC TTGGTCACTA CCCCCTATAT
CTGGTCGTTG GATTGGCCGA AGAGGATTAC CTGGCCAAAT GGTGGAAGGA TTCCCTGAGT
ATTGTCTTGC TGGCCGGTTT GTTCGTGCTG GCCAGCCTGG TTTCATCATT TCTGATTGGG
CGCGCCTGGA AAAGGCGGGA AGCGGATCGG GAGGCGCTGC TGCGCCAAGA TGCGGAATAT
ACCGCCAAGC TTGAGCAATT GAATCGCCAG ACTGATGCTG CATGGCGGCA GAGTGAACTG
ATTCTCTCGT CGGCAGCCGA AGGTATATGT GGTGTCGATC TGGAGGGCCG GGTGATTTTC
GTTAATCCGG CGGCGCGCAA GATGTTTGGT TGGAATGAGA ATGAAGGCAT CGGGCTCAAT
CTGCATGATC TGACGCATCA TCATGATATC GATGGGAAAC CTTTTCATAA CGAAGACTGC
CCGGTTTTCA AGACGCTTCG CGACGGTGAG CGGCGGCATG TGAGTGACGG GTTGTATTGG
CGAAAGGATG GCTCATCCTT CCCCGTCGAA TTTACGGTTT CATCCATTGA GCGGGATGGA
AAAGTGACCG GGGCGGTCAA CGTATTTCGG GACATCACCG AGCGCACACG GATCGAAGCC
GAGTTGGAGC GGCATCATCG TCATCTGGCA GAGCTCGTTC AGCAGCGCAC TTCGGAACTG
ATGCAAACCG AGGCCAGGGC CAGCCATATT TTGGAATCCA GCGCCGATGG GTTATACGGG
ATCGATTGCA ACGGCATCAT CACCTTTATC AATCCAGCGG CCTGCGCGAT TCTTGGCTAT
GGCGCCGAGC AGGTCATCGG CCTACCTGCG CACTCCTTGT TTCACCACAG CAAGCAGGAT
GGTTCTCCTT ATTCTGCCGC TGATTGCCCC AGCTACAATG CTTTGCGACT GGGCCAGAAG
GTTCGTGTCG ATGATGAGGT TTACTGGCAT GCCGACGGTC ACGCCGTTCC GGTGATGTAT
GCCACCCATC CCATGCTTCA GAACGGTCAA ATTACTGGCG CGGTCACCAG TTTCATTGAC
GTTAGCGTTC AGCGGGCTGC TGCGCAGGCT CGGGAGCAGG CGCTGATTGC CGCAGAGAGT
CTGGCCAAGA TAAAGAGTGA GTTTCTTGCC AATATGAGCC ATGAAATTCG AACGCCGCTC
AATGGCGTCC TGGGCTTTGC CCAGATTGGC TATCGCAACG CCGAGAATAG CGAGAAAGCA
CGAGATTCGT TTGCCAAGAT ACAGCTTTCC GGCAACCGTC TGCTTGGTGT GATCAATGAC
ATTCTCGACT TTTCCAAGAT CGAGGCTGGG AAGCTACGGA TCGAGCAGAC GGAGGTCGTT
CTTTCGGAGG TTGTCGAACA TGCGCTCGAC TTGCTCAGGG AGCGTGCCCG CGCCAAGCAG
ATCGAACTGC AGGTCGAACT CGCGCCCGAT CTGCCGCTGA CCTGTATCAG CGACCCGTTA
CGCATGGGGC AGGTACTTCT CAACATACTT TCCAATGCGG TCAAATTCAC TGAGGTCGGC
AGCGTTACCC TGTCAGTCAA TTGTCGAGAT GGCATGTTGC TCTTCAAGGT TGCGGACACC
GGGATCGGCA TGAGTGCCGA GCAGATTGGC ACTTTGTTCA ATCCTTTTCA TCAGGTGGAT
GCTTCGGCCA GCCGAAAGTT TGGTGGTACC GGCCTTGGAC TGGCGATCAG CAAACGCATT
CTGGAGTTGA TGAACGGAAA AGTCAGCATC GACAGTCTCC CCGGTGTTGG TACCAGCGTC
GAATTTTGTC TGCCCTACGT GAAGCCAGAA CCGTCTGTGG ATAGGCAAGC CGCCTCGCAA
GGAGAAGTGG ATAAGGTGCG GAAGCCTCTG GCGGGTATTT CAGTGCTGGT TGCCGATGAT
GAGGCGATCA ATCGCCTGGT GCTTGAAGAA ATTTTGATTG AATATGGTGC GGGTGTCGTT
CTGGTCAGCA ATGGCTTGGA GGCAGTGGAG CGAGTGATCC ACGATGGTCA GGATGCCTAT
GATGTTGTGC TGATGGATCT ACAGATGCCT GAGATGGACG GTTTTGAAGC GACGCGTCGG
ATACACGAAC TGCTACCCGA GCTGCCCATT ATTGCCCAGA CTGCTCACGC CTTCAGTGAA
GAGCGGCAAA AATGCTTTGC TACCGGAATG GTGGACCACA TTGCCAAACC GATAGAGCCC
GAGGCGCTGG GCAAAATTAT TCTGCAGCAC GTTTTGTCCA AACCATAA
 
Protein sequence
MTGNPKSAVS IEKRQSILRL LWGGAFLINL FVVSMVILVL ERNRAQEISQ AEILTENYSK 
ILEESLAGFI SKIDITLLTV AEEVERQMAS GGINQKALEK YIALQDAHVP EALGLRVVDA
QGIIRYAVND IKVRNASIAD RPQFIRLRDD PNAGLVFSKP VMGRAAEKWM ITLGRRINHP
DGSFAGDVHV AVAVDRFIAM FANIDLGEKG NVGLWDQSTL IARYTRSDTR GASVGFANPS
ADLHALINSG KRAASYHTRS GVDGITRTFY FRQLGHYPLY LVVGLAEEDY LAKWWKDSLS
IVLLAGLFVL ASLVSSFLIG RAWKRREADR EALLRQDAEY TAKLEQLNRQ TDAAWRQSEL
ILSSAAEGIC GVDLEGRVIF VNPAARKMFG WNENEGIGLN LHDLTHHHDI DGKPFHNEDC
PVFKTLRDGE RRHVSDGLYW RKDGSSFPVE FTVSSIERDG KVTGAVNVFR DITERTRIEA
ELERHHRHLA ELVQQRTSEL MQTEARASHI LESSADGLYG IDCNGIITFI NPAACAILGY
GAEQVIGLPA HSLFHHSKQD GSPYSAADCP SYNALRLGQK VRVDDEVYWH ADGHAVPVMY
ATHPMLQNGQ ITGAVTSFID VSVQRAAAQA REQALIAAES LAKIKSEFLA NMSHEIRTPL
NGVLGFAQIG YRNAENSEKA RDSFAKIQLS GNRLLGVIND ILDFSKIEAG KLRIEQTEVV
LSEVVEHALD LLRERARAKQ IELQVELAPD LPLTCISDPL RMGQVLLNIL SNAVKFTEVG
SVTLSVNCRD GMLLFKVADT GIGMSAEQIG TLFNPFHQVD ASASRKFGGT GLGLAISKRI
LELMNGKVSI DSLPGVGTSV EFCLPYVKPE PSVDRQAASQ GEVDKVRKPL AGISVLVADD
EAINRLVLEE ILIEYGAGVV LVSNGLEAVE RVIHDGQDAY DVVLMDLQMP EMDGFEATRR
IHELLPELPI IAQTAHAFSE ERQKCFATGM VDHIAKPIEP EALGKIILQH VLSKP