Gene Daro_3017 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDaro_3017 
Symbol 
ID3568685 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDechloromonas aromatica RCB 
KingdomBacteria 
Replicon accessionNC_007298 
Strand
Start bp3261606 
End bp3263180 
Gene Length1575 bp 
Protein Length524 aa 
Translation table11 
GC content59% 
IMG OID637681488 
Productsensor histidine kinase 
Protein accessionYP_286217 
Protein GI71908630 
COG category[T] Signal transduction mechanisms 
COG ID[COG4191] Signal transduction histidine kinase regulating C4-dicarboxylate transport system 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones43 
Plasmid unclonability p-value0.413812 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.591209 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACGCCC ACGTCGCCCC AGTCAGCAAG CGGACAATCC CACGCGTCGT TCGCGAACGG 
CGATGGTGGC TGTTACTGCT GGTGATCTGG GCAGTTGCCG TGGCTTTCTC ACTGAATTCC
CAGATCGAGC AAATTCGCCA GCAAACGACA GCGGTCGCCA TCGAAGGCGC ACGAAACATG
TTTCGCATGG TGCTGCTCAC CCGCAACTGG AATGCCAGCC ACGGCGGCAT TTACGTACCG
GTGACCGAAC GTACCCAGCC CAATCCTTAT CTGGATGTGC CGCACCGCGA CGTGACAACC
ACCGACGGTA TGGAATTGAC GATGGTCAAC CCGGCCTACA TGACGCGTCT GATCGCCGAG
ATGGCAGAAT CAGCCTCGGG TGCCGTATTC CGGCTGACCA GCCAGCGCCC CATTCGTCCC
GAGAACAAAC CGGATCATTG GGAGCATCGG GCGCTCGAAG CCTTCGAACA AGGACTTAAA
GAGTTTTCCG GTGTCGAAGC CAGCCCGGAA GGTGACATGC TGCGTTACAT GGCCCCGCTC
AGGGTGCAGG AAAGCTGCCT GCAGTGCCAT CGCAAGCAAG GCTATAAAGT CGGTGACATT
CGCGGCGGCA TCAGCGTTTC ACAGCGCAAC GCGCCGATCG AGGCCGCCGT GCAGGCTGGT
TGGCGCAAGG TCTTGCTCAC GCACGGCATG GCTTTCATGC TGGTGCTGAT CGCCGGCTGG
TTGATGCTCG AAATGTTGCG TCGGCGCTGG CTTGAGCTCG GCGACAAGAT CCAGGAATTG
CAGGATGCAC AGAGCCAGTT GCTGCAATCC GAAAAGATGG CTGCCATCGG CCAACTGGCC
GCCGGGGTGG CCCATGAAAT CAACAACCCG GTCGGCTTTG TCAGTTCCAA TCTCGGCTCG
CTGAAAAACT ACAGCGAAAA GATGATCACC CTGCTTGACC GTTGTCGCAG CAGAGAGGCC
AGCGAAGCCG ACTTCATCGC CGCCGATTTT GATTACCTGA AGGAAGACCT GGCCGACCTG
CTGCGCGAAT CTCGCGATGG CCTGGGGCGA GTCACGAAAA TCGTCAGCGA CCTCAAGGAT
TTCGCGCATA TCGATGAAGC TGCCTGGCAA GAGACCAATC TCAACGCCGG GATCGAGGCA
ACGCTCAACG TGGTCTGGCA CGAATTGAAA TACAAGGCTG AAGTCGTTCG CGAACTCGGC
GAGTTGCCGC CAGTCACCTG TATTGCCGCG CAGATCGATC AGGTCGTGAT GAATCTTCTG
GTCAATGCGG CACACGCCAT CGAAACCCGC GGCACCATTA CCGTCCGTAC TGGCCACGAC
GATGCCTGGG TATGGATCGA GGTGGCCGAT ACCGGCAAGG GCATGACGCC TGCCGTCATT
CAGCGCATCT TCGAGCCCTT CTACACCACC AAGCCTGTTG GCAAGGGGAC AGGACTCGGG
CTATCGCTTT CCTACGACAT CGTGAAAAAG CACGGTGGGC GCATTGAAGT GAATAGCGAG
CCAGGCAAGG GCAGCACATT CCGGGTCTGG ATCCCGCAGC AAGGCAACCC GACGGCAGAA
CTAGCGCCGA AATAA
 
Protein sequence
MDAHVAPVSK RTIPRVVRER RWWLLLLVIW AVAVAFSLNS QIEQIRQQTT AVAIEGARNM 
FRMVLLTRNW NASHGGIYVP VTERTQPNPY LDVPHRDVTT TDGMELTMVN PAYMTRLIAE
MAESASGAVF RLTSQRPIRP ENKPDHWEHR ALEAFEQGLK EFSGVEASPE GDMLRYMAPL
RVQESCLQCH RKQGYKVGDI RGGISVSQRN APIEAAVQAG WRKVLLTHGM AFMLVLIAGW
LMLEMLRRRW LELGDKIQEL QDAQSQLLQS EKMAAIGQLA AGVAHEINNP VGFVSSNLGS
LKNYSEKMIT LLDRCRSREA SEADFIAADF DYLKEDLADL LRESRDGLGR VTKIVSDLKD
FAHIDEAAWQ ETNLNAGIEA TLNVVWHELK YKAEVVRELG ELPPVTCIAA QIDQVVMNLL
VNAAHAIETR GTITVRTGHD DAWVWIEVAD TGKGMTPAVI QRIFEPFYTT KPVGKGTGLG
LSLSYDIVKK HGGRIEVNSE PGKGSTFRVW IPQQGNPTAE LAPK