Gene Daro_0833 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDaro_0833 
Symbol 
ID3569648 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDechloromonas aromatica RCB 
KingdomBacteria 
Replicon accessionNC_007298 
Strand
Start bp904633 
End bp906573 
Gene Length1941 bp 
Protein Length646 aa 
Translation table11 
GC content58% 
IMG OID637679289 
ProductGAF:ATP-binding region, ATPase-like:histidine kinase, HAMP region:histidine kinase, dimerization and phosphoacceptor region 
Protein accessionYP_284059 
Protein GI71906472 
COG category[T] Signal transduction mechanisms 
COG ID[COG3850] Signal transduction histidine kinase, nitrate/nitrite-specific 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones54 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0228489 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTCGCCA CCTCAGGCAA ACTCTCGCGA AAAATCATTG GCGTACTGGT CGTGTTTTTT 
CTCGTTGCGA CCTCTGCCAT TGGTTTGACC CTGCTCATTT CCTGGCAACT GGAAGGGGTG
GCTGCGGCGA TCAATGACGC GGGCAGCCAG CGCATGCGGA CCTACCGTTT AGGCCACTTG
ATGGCCCGCG GCCTCGAAGC AGAAACGCAG GTTGCCGCGT TGACTGCCGC CTTGAGCGAA
GAAGTCGTGC GTTTCGACAA GGTGCAGCGT GACCTGCAAT TGGGCGATCC GCTCAGGCCG
CTGTCTTCGC CCCGGAATTT CGAAGTGCAG GACAGGCTGC ATGACGTCGA ACAGTCGTGG
CGGAGGATCA TTCGGCCGCT GGTGGAAAGT TATCTGGCCG GTGATCACCA GGCGCGCTCG
GAGGTGCTTG ATCGCTTCGA TCTCGAACTG GAACCGTCCG TCATGGGAAT TAACGAATTG
GTGTTGGCCA TGGAGCGCAG CTATGCCTAC GACACCAATC TGTTGCGCTA TGTCCAGGTG
GCGCTGGTGT TGCTGGCGAT CTTTGGTACG GTCATCCTGA TCCGGTTTTT TGAGCAGCTG
GTTATCCGGC CGGTTAGCCT GCTGCATGTC GGCATGCGGC GCATGACCGG AAACGATTTG
ACGGTGCGCG TCCCGGTGAC CAGCGACGAT GAACTGGGCG GGCTTGCCGC AGGTTTCAAC
CAGATGGCCG AGCATCTGGA GGATGCTTAC GGCTCACTGG AACAGCGTAT CGAGGCCGAA
ACACGACGTC TCGCCCAGCG CAATCACGAA CTCGGTATTT TGTATGCGGC GACTTCGTTT
CTCAGTGAAC CCGCACCACT GGAGGCCCTG TGCGAGGGTT TTCTTGATCA CATCAAGAAC
GCGCTGGGGG CTGACGCCGG TGCGGTGCGA CTGTATGTGC CGCAGACGGA CAAGCTCTAC
CTGATGACCC ACGAAGGGCT TTCAGACGAG TTTGTCGCCA ACGAAGGCCA GCTGAATTGC
GGCGAATGCC TGTGCGGCGA GGTGTTTCAG AGCAGCCGGC CGGCCGCCTT TGCAACCGCC
AATCCACCGG AGGGCATGCG CCTCCGGAGC TGCATTCGTG AAGGATTTGC GACGGCAACG
GCGTTCAGCA TCCTCTACGA CAAGCAGCGA CTTGGCGTCT ACAACCTCTA TTTCCGCCGT
TCCCAAGCCT TGTCCGAGCA GGAAATCCAC TTGCTTGAAA CGCTTGGCCA TCACCTCGGT
GTAGCGATTG AGAACCAGCG CCTGAAGTCC CGCGAAAAGG AGTTGGCCGT TTCCGAGGAG
CGCAACCTGC TGGCTCAGGA ACTGCACGAT TCGATTGCCC AAGGCCTGGC TTTCCTGAAT
ATCCAGGTCC AGCTGCTGCA GGATTCGCTG CGCAAGGGCA AGGCTGATGA GGCCATGCAG
ACGGCCGGGC AATTGCGCGA AGGTGTTCAG GAGAGTTATG ACGATGTTCG CGAGTTGCTG
GTGCATTTCC GGACGCGCGT CCATCAGTCC GATCTCGATT CGGCGATCAA TGCCGCGCTC
GAAAAGTTCG AAGGGCAAAC CGGCATCCAG ACCGAGTTTG AACGGATTGG AGCGACAACG
CCGTTGCCGC CCACGGACGA AATCCAGATC ATGCATATCG TGCAAGAGTC CTTGTCCAAT
ATTCGCAAGC ATGCCAAGGC AAAGCGTGTG CGGGTCACTG TGCGTCAGGA GCTTGGTACC
AGCAAAGTCA TGGTTGAAGA TGACGGTATC GGCTTCGACC CGCAAAACGA TCCGAATTGC
CTGTCCGATC GCCATGTCGG TCTGAAAATC ATGCGGGAGC GAGCCCATCG CATCGGTGGT
GAGTGCCGAA TAACATCAAA TCCAGGCCAA GGCTCGTGCG TTATATTGAG CCTGCCGAAG
GAAAATAGGG GGAGTGTCTG A
 
Protein sequence
MFATSGKLSR KIIGVLVVFF LVATSAIGLT LLISWQLEGV AAAINDAGSQ RMRTYRLGHL 
MARGLEAETQ VAALTAALSE EVVRFDKVQR DLQLGDPLRP LSSPRNFEVQ DRLHDVEQSW
RRIIRPLVES YLAGDHQARS EVLDRFDLEL EPSVMGINEL VLAMERSYAY DTNLLRYVQV
ALVLLAIFGT VILIRFFEQL VIRPVSLLHV GMRRMTGNDL TVRVPVTSDD ELGGLAAGFN
QMAEHLEDAY GSLEQRIEAE TRRLAQRNHE LGILYAATSF LSEPAPLEAL CEGFLDHIKN
ALGADAGAVR LYVPQTDKLY LMTHEGLSDE FVANEGQLNC GECLCGEVFQ SSRPAAFATA
NPPEGMRLRS CIREGFATAT AFSILYDKQR LGVYNLYFRR SQALSEQEIH LLETLGHHLG
VAIENQRLKS REKELAVSEE RNLLAQELHD SIAQGLAFLN IQVQLLQDSL RKGKADEAMQ
TAGQLREGVQ ESYDDVRELL VHFRTRVHQS DLDSAINAAL EKFEGQTGIQ TEFERIGATT
PLPPTDEIQI MHIVQESLSN IRKHAKAKRV RVTVRQELGT SKVMVEDDGI GFDPQNDPNC
LSDRHVGLKI MRERAHRIGG ECRITSNPGQ GSCVILSLPK ENRGSV