Gene TM1040_0561 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_0561 
Symbol 
ID4077912 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp597039 
End bp598208 
Gene Length1170 bp 
Protein Length389 aa 
Translation table11 
GC content59% 
IMG OID638005858 
Product4-hydroxybenzoate 3-monooxygenase 
Protein accessionYP_612556 
Protein GI99080402 
COG category[C] Energy production and conversion
[H] Coenzyme transport and metabolism 
COG ID[COG0654] 2-polyprenyl-6-methoxyphenol hydroxylase and related FAD-dependent oxidoreductases 
TIGRFAM ID[TIGR02360] 4-hydroxybenzoate 3-monooxygenase 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.061935 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.130471 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCACAC AGGTTGTAAT TGTTGGTGGC GGCCCATCAG GAATGCTGCT GGGACAGCTT 
TTGCACCTCA ATGGCATCGA CACCATCGTC CTGGAACGAC GGACCAAGGA GCATGTGCTC
AGCCGCATTC GCGCCGGCAT CCTCGAGCAG GGCTTGGTTG AGCTGATGCA TAAAGCAGGC
GTCGGTGCGC GGCTCGAGCG CGAGAGCTTT CGCCATCACG GGACATTGAT TTCACACAAT
GACGAGATGT TCGGCATAAA TTTTGAGCGC CTGATCGGTA AATCGGTGAC GCTATATGGC
CAAACCGAGG TGACCCGCGA TCTCTATGAG GCGCGCGAAA GCGTTGGCGC GACCACGTTT
TTTGACGTCG AAGATGCCAC AATCCACGAT GCTGACACCG AGAGCCCCTA TGTCACCTTT
CAAAAGGACG GCAAAGAGAC TCGCATCGAT TGCGATTTCA TCGCGGGCTG CGACGGGTTT
CATGGTGTCA GCCGACGCAC GATCCCGGCC TCTGTCCGCA CGGAATACGA AAAAGTCTAT
CCCTTTGGCT GGCTCGGCAT CCTGTCCGAA ACCCCGCCCG TCAATGAGGA GTTGATCTAC
GCCAATTCAG AGGACGGGTT CGCGCTCTGT TCGATGCGCA ACGCCAATCT CAGCCGCTAT
TACGTTCAAT GCTCTCTGGG CGATGACGTG GGCGATTGGA CAGATACCCG GTTCTGGGAC
ACCCTGCGCC GCCGCCTGCC GAGCGAGGTC GCAGAGGCCC TGGTCACAGG CCCCTCGATC
GAGAAGTCCA TCGCACCGCT GCGCTCGTTT GTGAGCGAGC CAATGCGCTG GGGGCGGCTG
TTCCTCTGCG GCGATGCGGC TCATATCGTG CCGCCAACCG GGGCGAAGGG TCTGAATACT
GCCGCCTCGG ACGTGCATTA CCTCTACACG GGATTGATCC AGTATTATGA GGACAAAGAC
AGCGAAGGGA TCGATCGCTA CTCCGAAAAA GCCCTCGCCC GTGTTTGGAA GGCGGAGCGG
TTCAGCTGGT GGATGACGTC CTTGCTGCAT CGGTTCCCCG ACCAAGGTCC GTTTGACGTA
AAGATGCAGG CGGCAGAACT GGCGTTCCTG CGCGACAACA AGGACGCGCA ACGCGTGCTT
GCCACCAACT ATGTCGGGCT GCCTTACTGA
 
Protein sequence
MRTQVVIVGG GPSGMLLGQL LHLNGIDTIV LERRTKEHVL SRIRAGILEQ GLVELMHKAG 
VGARLERESF RHHGTLISHN DEMFGINFER LIGKSVTLYG QTEVTRDLYE ARESVGATTF
FDVEDATIHD ADTESPYVTF QKDGKETRID CDFIAGCDGF HGVSRRTIPA SVRTEYEKVY
PFGWLGILSE TPPVNEELIY ANSEDGFALC SMRNANLSRY YVQCSLGDDV GDWTDTRFWD
TLRRRLPSEV AEALVTGPSI EKSIAPLRSF VSEPMRWGRL FLCGDAAHIV PPTGAKGLNT
AASDVHYLYT GLIQYYEDKD SEGIDRYSEK ALARVWKAER FSWWMTSLLH RFPDQGPFDV
KMQAAELAFL RDNKDAQRVL ATNYVGLPY