Gene Daro_3147 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDaro_3147 
Symbol 
ID3567648 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDechloromonas aromatica RCB 
KingdomBacteria 
Replicon accessionNC_007298 
Strand
Start bp3386130 
End bp3387566 
Gene Length1437 bp 
Protein Length478 aa 
Translation table11 
GC content60% 
IMG OID637681618 
Productsensor histidine kinase 
Protein accessionYP_286347 
Protein GI71908760 
COG category[T] Signal transduction mechanisms 
COG ID[COG0642] Signal transduction histidine kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones39 
Plasmid unclonability p-value0.0895878 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTGGAA CTGAACCGAC GCCTGAAGAT TCAGGCGCCA AGGAGCACAA TCAGGGACTG 
GCCCGCTCAC TATTCCGGGT CTCGCTACCG CGCTGGCGTG GCAGTCGCTA CAGCCTGTCC
AAGCTGTTCT TCAATTTCTA CCTGCTGGCG ATGGGCTCCT TCGTCGCCAT CGCCTTTACC
GCCGACTTCG TCATCTCTAC CGCCCAGCGC GGCATTACCG ATGATTACGC GCGGCGCTTC
ATGCGCGGCA CGATTACGCT GATTGAAGAT GAACTGTTCC ACCATCCGCG CCGGGAATGG
CAGAAAAAGA TCAAGGATAT CGACGAGAAA TTCTCCTACA ACCTGGGCAT CGTCGAACGG
ATCACGCTCG ATAGCAAACT GACCCCCTCT CAGGTGATCA AGCTCGACGC CGGCGATATC
GCCATCGACC ATGATGGCGA CATCATGTAT CACCGACTCG GCACATCGAG CCAGGTTCTC
GTCGTCGGCC CGCTGGCATC GAATCGTAAT CCCGAACTAA AAGACCGCCT GCCGCTCGAA
TTACGGCTGC GCCTGCTGAC CTGGAGCCTG ATTGGCGTCA TTTTCGCCAT CGCTCTCTGG
TTCTGGATTC GCCCCATCTG GCGTGACCTT GAAGCGCTGC GCCAGACGGC CCGCGATCTC
GGTGACGGCC ATTTCGATGC CCGCTCACCG GCCGCCCGCA CGCAGCTCTT TGCCCCGCTT
TCCGACACCA TGAACAGCAT GGCAGACCGT ATACGACAGC TGCTGGCCAC TCATCGCGAA
CTTTCCTGCG GTATCTCGCA CGAGCTGCGC ACGCCGATTG CCCGCATGCG TTTTGCCCTG
GAAATGCTGT CCGAAACCGA GCAACGCGAT GAGCGCGAAC GCCTGTGGGC CATGATGGAA
GCTGACCTCG ACGAGCTCGA CCAGCTGATC GATACCAGTC TGACCTACGC CCGCTTCGAG
CGCGAAGCGC CGCAAGCGCA CTTTTCCAGC GTCAAATTCG CCGAGTGGCT AAGCGACGAA
GTCGACGCGG TCCGCCTGCT GGGCCGTCAG CTTGAGGTGG TCGTCGATAC CGGAAAACTG
CCAGAAAACC TGTTCGTCGA TCTTGACCGC AAGGCGATGC CCTACGCCCT GCGCAACCTG
CTGCGCAATG CCTTCAAATA CGCCAGCAAG CGTATCTCGG TCAACGCGGA GCTGGTTGGC
GAAAATATAC AGATCCACGT CGACGACGAT GGCATCGGCA TTCCGCTGGA AGAGCGCGAA
CACATCTTTT CAGCCTTTAC CCGCCTCGAC CGTTCACGCG ACCGATCGAC GGGCGGCTAC
GGCCTGGGTC TGGCCATTGC CCGTCGCGTA CTGGAGTTGC ATGGCGGCAC CGCCATTGCC
GACGCTTCTC CTCTCGGCGG CGCCCGCTTT ACGCTGTCCT GGAAGGCGCA GCAGTAG
 
Protein sequence
MTGTEPTPED SGAKEHNQGL ARSLFRVSLP RWRGSRYSLS KLFFNFYLLA MGSFVAIAFT 
ADFVISTAQR GITDDYARRF MRGTITLIED ELFHHPRREW QKKIKDIDEK FSYNLGIVER
ITLDSKLTPS QVIKLDAGDI AIDHDGDIMY HRLGTSSQVL VVGPLASNRN PELKDRLPLE
LRLRLLTWSL IGVIFAIALW FWIRPIWRDL EALRQTARDL GDGHFDARSP AARTQLFAPL
SDTMNSMADR IRQLLATHRE LSCGISHELR TPIARMRFAL EMLSETEQRD ERERLWAMME
ADLDELDQLI DTSLTYARFE REAPQAHFSS VKFAEWLSDE VDAVRLLGRQ LEVVVDTGKL
PENLFVDLDR KAMPYALRNL LRNAFKYASK RISVNAELVG ENIQIHVDDD GIGIPLEERE
HIFSAFTRLD RSRDRSTGGY GLGLAIARRV LELHGGTAIA DASPLGGARF TLSWKAQQ