Gene Daro_3820 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDaro_3820 
Symbol 
ID3567976 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDechloromonas aromatica RCB 
KingdomBacteria 
Replicon accessionNC_007298 
Strand
Start bp4103332 
End bp4104837 
Gene Length1506 bp 
Protein Length501 aa 
Translation table11 
GC content62% 
IMG OID637682294 
Productmethane/phenol/toluene hydroxylase:YHS 
Protein accessionYP_287018 
Protein GI71909431 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones47 
Plasmid unclonability p-value0.898066 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.0706856 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTGTGC TTAATAGAAT GGACTGGTAC GACCTGGCCA GGACTACCAA CTGGACGCCC 
AAGTACGTGA CCGAAGATGA ACTCTTCCCG CCACTGTTGG CAGGGGATTT CGGTCTGCCG
CAAAACGCCT GGGAAAAATA CGACGAGCCG TACAAGCAGA CCTACCCGGA ATACGTGAAG
GTCCAGCGCG ACAAGGATGC CGGTGCCTAC TCGGTGAAAG CTGCGCTGGA GCGTAGCCGG
ATTTACGAAA ATGCCGATCC GGGCTGGAAG TCGGTCATGA AGGCCCACTA CGGCGCCATC
GCCCGCGGCG AATATGCCGC GGCCAGTGCC GAGGCCCGCA TGATGCGCTT CTCCAAGGCG
CCGGGCATGC GCAACATGTC GACGCTGGGT TGTCTGGATG AAATCCGCCA CGGCCAGATG
CAGCTCTACT TCCCGCACGA GCACGTCTCC AAGGATCGTC AGATGGACTG GGCCTTCAAG
GCCTACGACA CCAACGAGTG GGCGATGATC GCGGCCCGTC ACTTCTTCGA CGACATCATG
ATGACCCGCG ACGCGATCAG CGTCTCGATC ATGTTGACCT TCAGCTTCGA AACCGGCTTC
ACCAACATGC AGTTCCTCGG CCTGGCGGCC GATGCCGCCG AAGCGGGCGA TCACACCTTT
GCCAACCTGA TCTCCAGCAT CCAGACCGAC GAGTCGCGTC ATGCCCAGAT CGGCGGCCCG
GCGCTGAAGG TGCTGATCGA GAACGGCCAC AAGGCCGAGG CGCAGAAGCG CGTCGACATC
GCCGTCTGGG GCGCCTGGAA GCTGTTCTCG GTGCTGACCG GTCCGATCAT GGATTACTAC
ACCCCGCTCG AGCACCGCAA GCAGTCGTTC AAGGAATTCA TGGAGGAATG GATCGTTGCC
CAGTTCGAGC GCGCCCTGAC CGACATGGGC CTCGATTTGC CCTGGTACTG GGACATCTTC
CTGAAGGACA TCGCCCAGAC CCACCACGGC ATGCACCTCG GCTCCTATTT CTGGCGCCCG
ACCCTGTGGT GGAACCCGGC CGCCGGCGTG ACGCCGGACG AGCGGGCCTG GCTGGAAGAG
AAGTATCCCG GCTGGAACGA TACCTGGGGT CAGTGCTGGG ATGTGTTCAT CGACAACGTG
GTCGACGGCA ACATGGCCAT GACCTATCCG GAAACCCTGC CTTACGTCTG CAACATGTGT
CAGCTGCCGA TCCTCGGCAC GCCGGGCAAG GGCTGGAACG TCAAGGACTA CCCGCTCGAA
TACAACGGTC GCCTCTATCA CTTCGGTTCC GAAGTCGACC GCTGGGTCTT CGAGCAGGAG
CCGGAACGCT ACGCCGGCCA CCTGTCCATC GTCGACCGTT TCCTGGCCGG GATGATCCAG
CCGATGGATC TGGGCGGGGC GCTGCAGTAC ATGAACTTGG CGCCCGGCGA GATCGGCGAC
GACGCCCACA ACTATGCCTG GGCCGAAGTG TACCGGGCCA TGCGGGCTGC CAAGAAGGCT
GGCTGA
 
Protein sequence
MAVLNRMDWY DLARTTNWTP KYVTEDELFP PLLAGDFGLP QNAWEKYDEP YKQTYPEYVK 
VQRDKDAGAY SVKAALERSR IYENADPGWK SVMKAHYGAI ARGEYAAASA EARMMRFSKA
PGMRNMSTLG CLDEIRHGQM QLYFPHEHVS KDRQMDWAFK AYDTNEWAMI AARHFFDDIM
MTRDAISVSI MLTFSFETGF TNMQFLGLAA DAAEAGDHTF ANLISSIQTD ESRHAQIGGP
ALKVLIENGH KAEAQKRVDI AVWGAWKLFS VLTGPIMDYY TPLEHRKQSF KEFMEEWIVA
QFERALTDMG LDLPWYWDIF LKDIAQTHHG MHLGSYFWRP TLWWNPAAGV TPDERAWLEE
KYPGWNDTWG QCWDVFIDNV VDGNMAMTYP ETLPYVCNMC QLPILGTPGK GWNVKDYPLE
YNGRLYHFGS EVDRWVFEQE PERYAGHLSI VDRFLAGMIQ PMDLGGALQY MNLAPGEIGD
DAHNYAWAEV YRAMRAAKKA G