Gene Daro_3813 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDaro_3813 
Symbol 
ID3567969 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDechloromonas aromatica RCB 
KingdomBacteria 
Replicon accessionNC_007298 
Strand
Start bp4095470 
End bp4098361 
Gene Length2892 bp 
Protein Length963 aa 
Translation table11 
GC content64% 
IMG OID637682287 
ProductPAS/PAC sensor hybrid histidine kinase 
Protein accessionYP_287011 
Protein GI71909424 
COG category[T] Signal transduction mechanisms 
COG ID[COG4191] Signal transduction histidine kinase regulating C4-dicarboxylate transport system 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.000539796 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0501343 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTTCGG AAAACGCAAT GCCCAACAAG GAAATGCAAG CCTCCGTCAA AGGAGTGACA 
CTGGTGCCTG ACGATGCCAC GGAGGTTCGT CGGCAGAAAC TGGCGCGCAT CATTCTCGAT
GCGATGTACC AGTTCCTCGG CCTGCTCGAT GTCGACGGCA CGGTGCTCGA GATCAACCGG
GCCGCGCTGG AAGGGGCCGG TATCTGCCTC GATGAGGTGA TCGGCAAGCC GTTCTGGGAG
GCACGCTGGT GGGCCATCTC CGAAGAGGCC CGCAATCGCG TGCGGAGCAT GGTCGAGCAG
GCCAGGAATG GCGAATTCGT CCGTTGCGAC ATCGAGATTT TTGGCGATTT GCAGGGCAAG
AAAAGCATCT TCGTGGACTT TTCCCTGACG CCGATCCGCG ACGATGCCGG GCGGGTCGCC
TTTTTGCTGC CGGAAGGGCG GAACATCACC GAAAAGATCG CCATCGAGGC CGAACTGACG
CGCAAGAACG GCGAACTGCA GCTGGCACTG GAGAAGCTGC GGGAAATCGA TGGCTTCAAG
ACCAAGTTCT TCGCCAACGT CAGCCACGAA CTGCGGACGC CGCTGGCGCT GATCCTCGGG
CCGGTCGATC AAATGCTGCG CGAGTCCGAG CAACTGGGGG AGCGGGAACG CTTCCGTCTG
ACCACGATCA AGCGCAACGC CCAATCCCTG CACCAGCAGG TGAATGACTT GCTCGATCTG
GCCCGTATCG ATGCCCAGCA AATGCCGCTG GCCTACGTCT GCGTCAATGT CGTCGCCCTG
CTCCGGGAGG TCGCGGCCGG GTTTGCCGCT GCCGCCGAAG AGCGGGCCAT TTCGCTGATT
ATCGAGGGGG CGGACGAGCT TCAGGCCGAC GTCGACCGGG CCAAGTTCGC CCGGGTGCTG
GCCAACCTGC TGTCCAATGC CTTCAAGTTC ACCCCGGCCG GCGGGCGCAT CTGCTGTTCG
ATCACCCGCG TCGCCAACGA CCGCTTCCTG CTCAGCGTGC AGGACAACGG GCCCGGCGTG
CCGCCGCCGA TGAAGCAGCA AATCTTCGAT CGCTTCGCGC AGGGGCAGGG CGGCCTCTCC
GGCATCGGCA GCGGGCTTGG CCTGAATATC GTCAAGGAAT TCGTCGAACT GCATTTCGGA
ACGGTGGTCG TTCTCGATGC ACCGGGCGGC GGGGCGATCT TCCAGGTCGA GATGCCGAAG
CGGGCGCCCA ACGGCGTCTT CGTGCGCGAA AGCGGCGAGG GCATCGGCCT GGTCACGCCG
CAGGATATCG ATTTCCTCGA GCCGTCGAGC CACCCGGCCA GCGCCTACAA GTCAGGCACG
CCACGCATCC TGGTCGTCGA GGACAATCCC GATCTGCGCC ATTTCCTCTA CGACGTGCTG
ATCGACGATT ACAACGTGAC GCTGGCCGCC AACGGCGCCC TCGCCCTGAC CTCGGCACTG
GAAGATCCGC CGGATCTGGT GATCACCGAT TTGATGATGC CGCATTTCGA TGGCGAGCAA
TTCGTCCGCG AACTGCGGAC CAGCGGCTGC TTCCCGAATC TGCCGGTCCT CGTGCTGTCG
GCCCGGGCCG ACGATGCCCA GCGGGAAACG CTGCTCGAAG AGCTGGTTCA GGACTACCTG
ACCAAGCCGT TTTCTCCGCA GGAACTGCGG GCGCGCGTGC GCAATCTGGT CACGGTCAAA
CGGACCGTCG ACATCCTGCA GAAGGAACTC AATACCCAGG CTTCGGACGT CGGCGAACTG
ACGGCCGGCC TGGTCGCCAG CCGTAAATCG CTGCAGGACA GTCTGGTCGC GCTGCAGATC
TCGGAGCGGC GCTGGCAGGG GCTGTACCGG AATTCCGCGG TCGGCATTGC GCTGGCCGAC
CGGGAGGGAC GTATCCTGAA AGCCAATCCC GCCTTGCAGC AGATGCTCGG CTACAGCGAA
GCGGAGATTG TCGGCGTATC GTTCATCGAC ATTTCGGACG AGTCCCAGCG CGCCATGACC
CTGCGCAATG TGCACGGCCT GTTCGATGGC AGCATTGACC ACTACCATGT GCAGAAACGC
TACGAAAGAA GGGACGGCAG CTTCCTCTGG GCCAATGTCA GCGCCTCCCT GATTCCGGCG
GTCGATGTGG AAGGGCCGAG GCTGGCGGTC ATCGTCGAGG ATGTCAGTTC GCGCAAGGAG
GCGGAGAGCG CGCTGGCCGC AACGCAGACC GAACTGGCCC GGGTGTCGCG CTTCACCGCG
ATGGGCGAAC TGGTCGCCTC GATTGCCCAC GAGGTCAACC AGCCGCTGTC GGCGATCGTC
ACCAACAGCC AGGCCGCCCT GCGCTGGCTG GCCCGCGAAA CGCCGGATTA TCAGGAAGTG
GTTGCCGCCC TGAACCGGGT CAATCGCGAT GCCAGTCTGG CCGGGGAGGT CATTGCCCGT
ATCCGCAACT TTCTCAGCAT GGGCGGCATG CAGCGCGAGC GACTGGTCGT CCGCCCCATC
CTCGAGAACC TGCTGCAGAT GTTGCAGACC ATGTTGCAGG AGGCCGACGT CGAGGTTGAT
CTGCGGATTG CCCCGGGCTT GCCCGATCTG CTGGCCGATC CGGTCCAGTT GCAGCAGGTG
CTGCTCAATC TGGTGGTCAA CGCGGTCGAT GCGATGCGCG AGGAGAAGGA GCGGGCGCGC
CGCTTGAGCA TATCGGTCAG TGCAGACACG GCCGGCAGCG TCCTGTTTTC AGTCAGCGAT
ACCGGTCCGG GCATCCCGCC CGACAAGGCG GCGAAGATCT TCGATGCCCT GTTCTCGACC
AAGAGCCGTG GCCTGGGCAT GGGCCTGGCC ATCAGCCGAT CCATCGTCGA AAACCACGGC
GGGCGCCTGC GGCTGGTGCC TGAGGCCGCC GGTGGTGCTC ATTTCGTGTT CAATATTCCG
GTCCAACCAT GA
 
Protein sequence
MTSENAMPNK EMQASVKGVT LVPDDATEVR RQKLARIILD AMYQFLGLLD VDGTVLEINR 
AALEGAGICL DEVIGKPFWE ARWWAISEEA RNRVRSMVEQ ARNGEFVRCD IEIFGDLQGK
KSIFVDFSLT PIRDDAGRVA FLLPEGRNIT EKIAIEAELT RKNGELQLAL EKLREIDGFK
TKFFANVSHE LRTPLALILG PVDQMLRESE QLGERERFRL TTIKRNAQSL HQQVNDLLDL
ARIDAQQMPL AYVCVNVVAL LREVAAGFAA AAEERAISLI IEGADELQAD VDRAKFARVL
ANLLSNAFKF TPAGGRICCS ITRVANDRFL LSVQDNGPGV PPPMKQQIFD RFAQGQGGLS
GIGSGLGLNI VKEFVELHFG TVVVLDAPGG GAIFQVEMPK RAPNGVFVRE SGEGIGLVTP
QDIDFLEPSS HPASAYKSGT PRILVVEDNP DLRHFLYDVL IDDYNVTLAA NGALALTSAL
EDPPDLVITD LMMPHFDGEQ FVRELRTSGC FPNLPVLVLS ARADDAQRET LLEELVQDYL
TKPFSPQELR ARVRNLVTVK RTVDILQKEL NTQASDVGEL TAGLVASRKS LQDSLVALQI
SERRWQGLYR NSAVGIALAD REGRILKANP ALQQMLGYSE AEIVGVSFID ISDESQRAMT
LRNVHGLFDG SIDHYHVQKR YERRDGSFLW ANVSASLIPA VDVEGPRLAV IVEDVSSRKE
AESALAATQT ELARVSRFTA MGELVASIAH EVNQPLSAIV TNSQAALRWL ARETPDYQEV
VAALNRVNRD ASLAGEVIAR IRNFLSMGGM QRERLVVRPI LENLLQMLQT MLQEADVEVD
LRIAPGLPDL LADPVQLQQV LLNLVVNAVD AMREEKERAR RLSISVSADT AGSVLFSVSD
TGPGIPPDKA AKIFDALFST KSRGLGMGLA ISRSIVENHG GRLRLVPEAA GGAHFVFNIP
VQP