Gene Daro_3794 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDaro_3794 
Symbol 
ID3567950 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDechloromonas aromatica RCB 
KingdomBacteria 
Replicon accessionNC_007298 
Strand
Start bp4078821 
End bp4080374 
Gene Length1554 bp 
Protein Length517 aa 
Translation table11 
GC content60% 
IMG OID637682269 
Productmethane/phenol/toluene hydroxylase:YHS 
Protein accessionYP_286993 
Protein GI71909406 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value0.00765464 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00329078 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGACATGA AAGCCAACAA GAAAAAGCTG GGCCTGAAGG AAAAGTACGC CCACATGACA 
CGCGGCCTGG ATTGGGAAAC CACCTACCAG CCGAAGGACA AGGTATTTCC GCAAGCCACC
TTCGAGGGCA TCAAGGTGCA CGACTGGGAC AAGTGGGAAG ACCCCTTCCG CCTGACCATG
GATGCCTACT GGAAATACCA GGCTGAAAAG GAGCGCAAGC TGTACGCCGT GCTCGACGCC
TTCGCCCAGA ACAACGGCCA CCTCGGGATT ACCGATGCGC GCTATCTGAG TGCCGTCAAA
CTGTTCCTGA CCGGCATTTC GCCGCTGGAG TATATGGCCC ACCGTGGTTT CGCTGCAGCC
GGCCGTAATT TCCCCGGCGT TGGTCCCCGC GTTGCCTGCC TGATGCAGTC GATCGACGAA
GTGCGTCATG CCCAGACCCA GATCCACGCC CTGTCGAACT ACAACAAGTT CTACGAAGGC
TTCCATGCCG GCGCAAGCCA CCAGATCGAA CGCCTCTGGT ACCTGTCGGT GCCCAAGTCC
TTCTTCGACG ACGCCTTCAG TGCCGGCCCC TTCGAATGGA TGATCGCCAT CGGCTTCTCC
TTCGAATACG TGCTGACCAA CCTGTTGTTC GTGCCCTTCA TGTCCGGTGC TGCCTACAAC
GGCGACATGG CGACCGTGAC CTTCGGTTTC TCGGCGCAAT CCGACGAAGC CCGCCACATG
ACGCTGGGCC TCGAGTGCAT CAAGTTCATG CTCGAACAGG ATCCGGACAA CCTGCCCATC
GTCCAGAAGT GGATCGACAA GTGGGCCTGG CGCGGTATCC GCGTGTTGAG CCTGGTCTCC
ACGATGATGG ACTACATGCT GCCGAAGCGC GTGATGAGCT GGAAGGAAGC CTGGGAAATC
TACTTCGAGC AGAACGGCGG CGCGCTGTTC AACGATCTGG CCAAGTACGG CATCAAGGTC
CCGGACTGCA TCGCCCAATG CACCGTCGAC AAGGAGCATC AGTCGCACCA GCTGTGGCTG
ACCTTGTGCA CCCACTCGCA TGCGATGGGC CTGCACACCT GGCTGCCCGA TGCCGACGAG
ATGGACTGGT TGTCGGCGAA ATACCCGAAC ACCTTCGACA AGTACTACCG TCCGCGTTTC
GACGAGCTGC GCGAACGTGC CGACAAGGGC GAGCGCTTCT TCGCCAACAC GCTGCCCATG
CTGTGCCAGG TCTGCCAGAT TCCGATGCTC TTCACCGAGC CGGACGACCC GACCAAGATC
TGCTATCGCG AGTCGGAGTT CCAGGGCGAG AAGTACCACA CCTGCTCGGA TGGCTGTAAG
CACATCTTTG ACGACGAGCC GGAGAAGTAC ATCCAGGCCT GGTTGCCGGT ATATCAGCTT
TATCAGGGCA ATTGCTGGCC GGAGGGCACT GATCCGACGG CCGAAGGATT CAATCCGGTG
GCCAAGTATC TCGAATGGTG CCACATCGAT GCCAAGGACA CCGGCGATTA CGAAGGCTCC
GGTGACCAGG CGAACTTTGC TGCCTGGCGT GGCGCGGCCA CCCAGAACAC CTGA
 
Protein sequence
MDMKANKKKL GLKEKYAHMT RGLDWETTYQ PKDKVFPQAT FEGIKVHDWD KWEDPFRLTM 
DAYWKYQAEK ERKLYAVLDA FAQNNGHLGI TDARYLSAVK LFLTGISPLE YMAHRGFAAA
GRNFPGVGPR VACLMQSIDE VRHAQTQIHA LSNYNKFYEG FHAGASHQIE RLWYLSVPKS
FFDDAFSAGP FEWMIAIGFS FEYVLTNLLF VPFMSGAAYN GDMATVTFGF SAQSDEARHM
TLGLECIKFM LEQDPDNLPI VQKWIDKWAW RGIRVLSLVS TMMDYMLPKR VMSWKEAWEI
YFEQNGGALF NDLAKYGIKV PDCIAQCTVD KEHQSHQLWL TLCTHSHAMG LHTWLPDADE
MDWLSAKYPN TFDKYYRPRF DELRERADKG ERFFANTLPM LCQVCQIPML FTEPDDPTKI
CYRESEFQGE KYHTCSDGCK HIFDDEPEKY IQAWLPVYQL YQGNCWPEGT DPTAEGFNPV
AKYLEWCHID AKDTGDYEGS GDQANFAAWR GAATQNT