Gene Csal_0347 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCsal_0347 
Symbol 
ID4026891 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChromohalobacter salexigens DSM 3043 
KingdomBacteria 
Replicon accessionNC_007963 
Strand
Start bp387394 
End bp388581 
Gene Length1188 bp 
Protein Length395 aa 
Translation table11 
GC content65% 
IMG OID637965496 
Product4-hydroxybenzoate 3-monooxygenase 
Protein accessionYP_572408 
Protein GI92112480 
COG category[C] Energy production and conversion
[H] Coenzyme transport and metabolism 
COG ID[COG0654] 2-polyprenyl-6-methoxyphenol hydroxylase and related FAD-dependent oxidoreductases 
TIGRFAM ID[TIGR02360] 4-hydroxybenzoate 3-monooxygenase 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAACCC AAGTCGCGAT CATCGGTGCC GGCCCCTCCG GACTGCTGCT GGGGCAGTTG 
CTGCAACGCG CGGGGATCAA CAACGTCATC CTCGAGCGGC GCAGCGGCGA ATACGTCTTG
AGCCGCATCC GTGCCGGCGT ACTGGAGCAG GGCATGGTCG ATCTGTTGCG CGAAGCCGGT
GTCGACCGCC GCATGGATGC CGAGGGCCTG CCGCACGACG GCGTCGAGCT GGCCTTCGAC
AACCGTCGGG TGCGTATCGA CCTCGCGGCG TTGACCGGCG GCAAGCAGGT CATGGTCTAC
GGGCAGACCG AGGTGACTCG GGATCTGATG GAGGCACGCG CCGCCGAGGG CGGCCAGACG
CTCTATGAAG TGGACAAGGT GCAGCCCCAC GATCTGGAAA CCGACGCCCC GTACATCACC
TTCGAGCACA ACGGCGAAAC GCAGCGGCTC GACTGCGACT ATGTCGCCGG CTGCGATGGA
TATCATGGCG TCTCGCGCCA GTCGATTCCC GCCGACCGGC TGAAGACGTT CGAGCGGGTG
TATCCCTTCG GCTGGCTGGG GCTGCTCTCC GACACGCCGC CGGTCTCCGA CGAGTTGATC
TATGCGCGCC ACGAGCGGGG CTTCGCACTT TGCAGCATGC GTTCCCAGAC CCGCAGCCGC
TACTACGTGC AGGTGCCGCT GGACGAGAAG GTCGAGGACT GGTCCGATGC GCGCTTCTGG
GAGGAACTCA AGCGCCGCCT GCCCGAGGAC GTCGCGGCCA ATCTGGTGAC CGGTCCCTCG
CTCGAGAAGA GCATCGCGCC GTTGCGCAGC TTCGTGGCCG AGCCGATGCA GCACGGGCGG
CTGTTCCTGG TCGGCGATGC CGCGCACATC GTGCCGCCGA CCGGCGCCAA GGGGCTCAAC
CTGGCGGCCA GCGACGTCAA CACGCTGTAT CGCCTGATGG TCAAGGTCTA TCACGAGGGC
CGCACCGACC TGGTGCCGCG TTATTCGCAG ACCTGTCTCA AGCGTGTCTG GAAGGCCGAG
CGGTTTTCCT GGTGGATGAC CTCGATCCTC CACAAGTTTT CCGAGGACGA GGATTTCGGC
GCCCGCATGC AACAAGCCGA GCTGGACTAT GTCACCGGCT CCGAGGCAGG CCTGACGACC
ATCGCCGAGA ACTACGTCGG CTTGCCCTAT GAGCCCCTGG AGTCCTAG
 
Protein sequence
MKTQVAIIGA GPSGLLLGQL LQRAGINNVI LERRSGEYVL SRIRAGVLEQ GMVDLLREAG 
VDRRMDAEGL PHDGVELAFD NRRVRIDLAA LTGGKQVMVY GQTEVTRDLM EARAAEGGQT
LYEVDKVQPH DLETDAPYIT FEHNGETQRL DCDYVAGCDG YHGVSRQSIP ADRLKTFERV
YPFGWLGLLS DTPPVSDELI YARHERGFAL CSMRSQTRSR YYVQVPLDEK VEDWSDARFW
EELKRRLPED VAANLVTGPS LEKSIAPLRS FVAEPMQHGR LFLVGDAAHI VPPTGAKGLN
LAASDVNTLY RLMVKVYHEG RTDLVPRYSQ TCLKRVWKAE RFSWWMTSIL HKFSEDEDFG
ARMQQAELDY VTGSEAGLTT IAENYVGLPY EPLES