Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Csal_0347 |
Symbol | |
ID | 4026891 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chromohalobacter salexigens DSM 3043 |
Kingdom | Bacteria |
Replicon accession | NC_007963 |
Strand | + |
Start bp | 387394 |
End bp | 388581 |
Gene Length | 1188 bp |
Protein Length | 395 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 637965496 |
Product | 4-hydroxybenzoate 3-monooxygenase |
Protein accession | YP_572408 |
Protein GI | 92112480 |
COG category | [C] Energy production and conversion [H] Coenzyme transport and metabolism |
COG ID | [COG0654] 2-polyprenyl-6-methoxyphenol hydroxylase and related FAD-dependent oxidoreductases |
TIGRFAM ID | [TIGR02360] 4-hydroxybenzoate 3-monooxygenase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAACCC AAGTCGCGAT CATCGGTGCC GGCCCCTCCG GACTGCTGCT GGGGCAGTTG CTGCAACGCG CGGGGATCAA CAACGTCATC CTCGAGCGGC GCAGCGGCGA ATACGTCTTG AGCCGCATCC GTGCCGGCGT ACTGGAGCAG GGCATGGTCG ATCTGTTGCG CGAAGCCGGT GTCGACCGCC GCATGGATGC CGAGGGCCTG CCGCACGACG GCGTCGAGCT GGCCTTCGAC AACCGTCGGG TGCGTATCGA CCTCGCGGCG TTGACCGGCG GCAAGCAGGT CATGGTCTAC GGGCAGACCG AGGTGACTCG GGATCTGATG GAGGCACGCG CCGCCGAGGG CGGCCAGACG CTCTATGAAG TGGACAAGGT GCAGCCCCAC GATCTGGAAA CCGACGCCCC GTACATCACC TTCGAGCACA ACGGCGAAAC GCAGCGGCTC GACTGCGACT ATGTCGCCGG CTGCGATGGA TATCATGGCG TCTCGCGCCA GTCGATTCCC GCCGACCGGC TGAAGACGTT CGAGCGGGTG TATCCCTTCG GCTGGCTGGG GCTGCTCTCC GACACGCCGC CGGTCTCCGA CGAGTTGATC TATGCGCGCC ACGAGCGGGG CTTCGCACTT TGCAGCATGC GTTCCCAGAC CCGCAGCCGC TACTACGTGC AGGTGCCGCT GGACGAGAAG GTCGAGGACT GGTCCGATGC GCGCTTCTGG GAGGAACTCA AGCGCCGCCT GCCCGAGGAC GTCGCGGCCA ATCTGGTGAC CGGTCCCTCG CTCGAGAAGA GCATCGCGCC GTTGCGCAGC TTCGTGGCCG AGCCGATGCA GCACGGGCGG CTGTTCCTGG TCGGCGATGC CGCGCACATC GTGCCGCCGA CCGGCGCCAA GGGGCTCAAC CTGGCGGCCA GCGACGTCAA CACGCTGTAT CGCCTGATGG TCAAGGTCTA TCACGAGGGC CGCACCGACC TGGTGCCGCG TTATTCGCAG ACCTGTCTCA AGCGTGTCTG GAAGGCCGAG CGGTTTTCCT GGTGGATGAC CTCGATCCTC CACAAGTTTT CCGAGGACGA GGATTTCGGC GCCCGCATGC AACAAGCCGA GCTGGACTAT GTCACCGGCT CCGAGGCAGG CCTGACGACC ATCGCCGAGA ACTACGTCGG CTTGCCCTAT GAGCCCCTGG AGTCCTAG
|
Protein sequence | MKTQVAIIGA GPSGLLLGQL LQRAGINNVI LERRSGEYVL SRIRAGVLEQ GMVDLLREAG VDRRMDAEGL PHDGVELAFD NRRVRIDLAA LTGGKQVMVY GQTEVTRDLM EARAAEGGQT LYEVDKVQPH DLETDAPYIT FEHNGETQRL DCDYVAGCDG YHGVSRQSIP ADRLKTFERV YPFGWLGLLS DTPPVSDELI YARHERGFAL CSMRSQTRSR YYVQVPLDEK VEDWSDARFW EELKRRLPED VAANLVTGPS LEKSIAPLRS FVAEPMQHGR LFLVGDAAHI VPPTGAKGLN LAASDVNTLY RLMVKVYHEG RTDLVPRYSQ TCLKRVWKAE RFSWWMTSIL HKFSEDEDFG ARMQQAELDY VTGSEAGLTT IAENYVGLPY EPLES
|
| |