Gene Daro_4142 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDaro_4142 
Symbol 
ID3566642 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDechloromonas aromatica RCB 
KingdomBacteria 
Replicon accessionNC_007298 
Strand
Start bp4441175 
End bp4443925 
Gene Length2751 bp 
Protein Length916 aa 
Translation table11 
GC content57% 
IMG OID637682614 
ProductPAS/PAC sensor hybrid histidine kinase 
Protein accessionYP_287338 
Protein GI71909751 
COG category[T] Signal transduction mechanisms 
COG ID[COG0642] Signal transduction histidine kinase 
TIGRFAM ID[TIGR00229] PAS domain S-box
[TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type 


Plasmid Coverage information

Num covering plasmid clones45 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGAAAA CCCTTTTGCG CCAGCTCAAA CGCAGTATCG GTGTCGGTAG CGAAGCCGAA 
CTGGCAGACT TGCTCGCGTC GATACGGACT GCTGGCACGA CGCTAGAGCC GGCTTTCCGG
GGGATTCTTG AAGGCCTGGG CGACCTCCTG AGCAGGGTCG ATGCCAGTTA CGAACAGTAC
GAACGTGATC TTGAACTGCG CACGCGCAGC CTTGAGATTT CGTCGGCCGA ACTTTCAGCC
GCCAACGAAA AACTCCGCCA GTCCCTTGCC GGGCGGGATA ACGCACTGCG CTCGCTCCGA
AAAACCATCC GTGACCATCT GCCGGAAGCC AATGAAGATG CCGAGACACT GGCCGAAGAG
GATATCGCGG TACTTTCCAG GCGCGTCGCG GCCATGGTCG CCGAAAGCGA ACATGGCTTA
CGGGAACTGG CCAACCAGAA GTTTGCTCTC GACCAGCATG CCATCGTCAG CATCACTGAT
ACGGCCGGCA TTATCACCTA TGCCAATGAC CGCTTCTGCG CCATCAGTGG CTACGCACGG
GAAGAACTGA TCGGGCAGAA CCACAGGATC GTCAACTCCG GCATCCATCC GCCTGAGTTG
TTTCGCGATA TGTGGCACAC CATCGTACTG GGCAAGGTTT GGCACGGTGA AGTTTGCAAC
CGCGCACGCG ATGGGCACCT TTATTGGGTC AATGCGACGG TCGTGCCGCT GCTCGATGCC
AAGGGTGAGC CTGAGCGATA TATCGCCATT CGTACCGAGA TCACCGACCG CAAGCGCATG
GAGGCGCAAT TGTCGGAACA ATTGCACCTG GTTGAAGAAC TGATCGAAAA CATTCCCCTG
CCGATCTACC TGAAAGACGG CAGCGGGCGC TATATCAGAC TCAACCGCGC TTTCGAACAG
TTCTTTGAGG TTCGCCGCGA AGCCTTCATC GGGCGCACGC TGCACGATCT TTTGCCAGCA
GAAGACGCTC GTCAGCATGC AGAAATGGAT GCTGATCTTT TTGCCGCCAA GGGCACGCAA
ACCTACGAAG CCACAGTCCA TAGCCAGGAC GGCATCGCTC ACGAAACAAT TTACCGCAAA
GCGGTATTGA CCCGCCGCGA CGGCAGCGTT TCGGGATTAC TCGGCATCAT TGTCGACATC
ACTGACCGGA AACGAGCAGA GATCGAGGTG CTGCGGGCCA AGGAAGCGGC GGAAGCGGCC
AACCGGGCAA AGAGTGATTT TCTGGCTAAC ATGAGCCACG AAATCAGGAC ACCGATGAAC
GGGGTCATCG GCATGACGGA TCTCGCCCTC GAAACCGCAT TGACCGAGGA ACAACGTGAA
TACCTAAACA TCGCAAAATC CTCGGCCGAA TCCTTGCTCA CCGTCATCAA CGACATCCTC
GATTTTTCGA AGATTGAAGC CGGGAAGCTA CAGGTCGAAG ACATCCCCTT TGACCTGCAT
CGTCTGATTG CCGACACACT GAAACCGCTG GCGCTGCGGG CCCATGAGAA AGACATCGAG
ATACTCAGTG ACGTAGTGCA TGACGTTCCC CGATTTGTTA AGGGCGACCC CAGCCGGATC
CGGCAGGTAC TCGTTAACCT GGTCGGCAAT GCCATCAAAT TCACCGAACA GGGCGAGATT
GCCCTCCAAG CTGACTTGAT GCAGCTTCAG AACGGCCACG CTGTCATCCA CTTCGCAGTG
CGCGATACGG GGATTGGCAT CTCGGCCGAC AAGCAGATGC TGATTTTCGA TGCCTTTGCC
CAGGAAGACT CGTCGACCAC CCGGAAATAC GGAGGCACCG GCCTGGGCCT CTCGATTTGT
CGCCGCCTGG TCGAGTTGAT GGGCGGTACG TTGTGGTTAC ATAGCGAGCC GGGCAAGGGG
AGTACCTTTC ATTTCTCAGT CCAGTTGCAG ATCTCGGAAA ACGCTATTCC GCTCGTGATT
CACCCTGTCG ACGTGACGGG CCGTCACATG CTTATCGTTG ATGACAACGC CACTAACCGG
CGCATTCTCC GCGGCATGCT GGCCACCTTG CATGTCACTT GCCAGGAAGC CGACTCGGGA
AAGACAGCAC TAGCGCTGAT GCGTGAGCAA GGCAGCCAGT TCGATTGCAT CCTGCTCGAT
GCCCAGATGC CTGAGATGGA TGGTTATGAG CTTGCCCGCC ATCTGCACGC CGAGCATCCC
GCTTTGCCCC CAATGTTGAT GCTCTCCTCG GGTGCCCTGC GGGGTGATGC GCTACGTTGT
CAGGAGGCCG GCATTGCCGG ATTCTTCGCC AAGCCGATCT CATCAGACGA ATTGCTCTCC
GCCCTTGGCC GCCTGTTCGA TAACAGCCCA AAGGAATCCT CACCGGAACC CAGCCCGCTA
CTGACACGCC ACTCCCTGCG CGAGGCGCAA CGCGTCCTGA ACATCCTGCT GGTCGAAGAT
CATCCGACCA ACCAGAAGTT GGCGCTCGGC CTCCTCAACA AATGGGGGCA CCATGGAACA
TTGGCCCAAA ATGGACAGGA AGCCCTCGAT ATCCTTGCCA GCCAATCCTT CGACATCATC
CTGATGGACA TGCAAATGCC CGTCATGAGT GGTGTCGAGG CGACCCAGCG CATTCGGGCG
CGCGAAGCGG CCATGCAACT ACCGCGCACC CCCATCATCG CGATGACTGC CGCAGCCATG
CAAGACGACC GGGATGCTTG CCTGGCTGCC GGAATGGATG ACTATCTGGC AAAACCAATC
AAGGTCAAGG AGTTGCAAGC ACTGCTGCTT GCCTACACTT CCGCCCCATA A
 
Protein sequence
MQKTLLRQLK RSIGVGSEAE LADLLASIRT AGTTLEPAFR GILEGLGDLL SRVDASYEQY 
ERDLELRTRS LEISSAELSA ANEKLRQSLA GRDNALRSLR KTIRDHLPEA NEDAETLAEE
DIAVLSRRVA AMVAESEHGL RELANQKFAL DQHAIVSITD TAGIITYAND RFCAISGYAR
EELIGQNHRI VNSGIHPPEL FRDMWHTIVL GKVWHGEVCN RARDGHLYWV NATVVPLLDA
KGEPERYIAI RTEITDRKRM EAQLSEQLHL VEELIENIPL PIYLKDGSGR YIRLNRAFEQ
FFEVRREAFI GRTLHDLLPA EDARQHAEMD ADLFAAKGTQ TYEATVHSQD GIAHETIYRK
AVLTRRDGSV SGLLGIIVDI TDRKRAEIEV LRAKEAAEAA NRAKSDFLAN MSHEIRTPMN
GVIGMTDLAL ETALTEEQRE YLNIAKSSAE SLLTVINDIL DFSKIEAGKL QVEDIPFDLH
RLIADTLKPL ALRAHEKDIE ILSDVVHDVP RFVKGDPSRI RQVLVNLVGN AIKFTEQGEI
ALQADLMQLQ NGHAVIHFAV RDTGIGISAD KQMLIFDAFA QEDSSTTRKY GGTGLGLSIC
RRLVELMGGT LWLHSEPGKG STFHFSVQLQ ISENAIPLVI HPVDVTGRHM LIVDDNATNR
RILRGMLATL HVTCQEADSG KTALALMREQ GSQFDCILLD AQMPEMDGYE LARHLHAEHP
ALPPMLMLSS GALRGDALRC QEAGIAGFFA KPISSDELLS ALGRLFDNSP KESSPEPSPL
LTRHSLREAQ RVLNILLVED HPTNQKLALG LLNKWGHHGT LAQNGQEALD ILASQSFDII
LMDMQMPVMS GVEATQRIRA REAAMQLPRT PIIAMTAAAM QDDRDACLAA GMDDYLAKPI
KVKELQALLL AYTSAP