Gene RPB_0907 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_0907 
Symbol 
ID3909087 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp1044169 
End bp1045515 
Gene Length1347 bp 
Protein Length448 aa 
Translation table11 
GC content64% 
IMG OID637882800 
Producthomogentisate 1,2-dioxygenase 
Protein accessionYP_484529 
Protein GI86748033 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG3508] Homogentisate 1,2-dioxygenase 
TIGRFAM ID[TIGR01015] homogentisate 1,2-dioxygenase 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.78575 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATATCA ACGCCGCACC GCAGATCATT GGCCACGGTT CGCAGGGCGT CACGCCCGGC 
TACATGTCGG GCTTCGGCAA TTCGTTCGAA ACCGAAGCTC TCCCCGGCGC GCTGCCGATC
GGCCGCAACT CGCCGCAGCG CGCGGCTTAC GGCCTCTATG CCGAGCAATT ATCAGGCTCG
CCGTTCACCG CGCCGCGCGG CGCCAATGAG CGAAGCTGGC TGTATCGCAT CCGCCCCTCG
GTGAAGCACT CCGGCCGCTT TACCAAGGCG GACATGGGCC TGTGGCGCTC GGCGCCGTGT
CTCGAATACG ACATGCCGAT CGCGCAGCTG CGCTGGGACG CGCCGTCGAT GCCGCAGGAG
GATCTGACGT TCCTGCAAGG CGTGCGAACG ATGACGACCG CCGGCGATGT GAATACGCAG
GCCGGCATGG CGACGCATAT GTATCTGATC ACCCAATCGA TGGTCGATCA GCATTTCTAC
AATGCCGACG GTGAATTGAT GTTCGTGCCG CAGCAGGGCA GCCTGCGGCT GGTCACGGAA
TTCGGCGTCA TCAGCATCGA GCCCGCCGAA ATCGCGGTGA TCCCGCGCGG CGTCAAGTTT
CGCGTCGAAC TGGTCGACGG CCCGGCGCGC GGCTATTTGT GTGAGAATTA CGGCGGCGCC
TTCACGCTGC CGGAGCGCGG CCCGATCGGC GCCAATTGCC TGGCCAATTC GCGCGATTTC
CTGACGCCGG TGGCGGCCTA TGAGGACAGG GACGTGCCGA CCGAATTGTT CGTGAAATGG
GGCGGGGCGC TGTGGCAGAC CACGCTGCCG CATTCGCCGA TCGATGTGGT CGCGTGGCAT
GGCAACTACG CGCCGTACAA ATACGATCTG CGCACCTTCT CGCCGGTCGG CGCGATCGGC
TTCGATCATC CCGATCCGTC GATCTTCACC GTGCTGACGT CGCCGTCGGA AACCGCCGGC
ACCGCCAATA TAGACTTCGT GATCTTCCCC GAGCGCTGGA TGGTGGCGGA AAACACCTTC
CGGCCGCCCT GGTACCACAT GAATATCATG TCGGAGTTCA TGGGGTTGAT CTGCGGCGTC
TACGACGCCA AGCCGCAGGG CTTCGTCCCC GGCGGCGCGT CGCTGCACAA CATGATGCTG
CCGCACGGGC CGGATCGCGA GGCGTTCGAT CATGCCTCGA ACGGCGAGCT GAAGCCGGTG
AAACTCACCG GCACGATGGC CTTCATGTTC GAGACCCGCT ATCCGCAGCG CGTCACCGAA
TATGCCGCGA CCGCCGGCAC GCTGCAGGAC GACTACGCCG ATTGCTGGCG CGGCCTGGAG
AAGCGCTTCG ACCCGAGCCG GCCATGA
 
Protein sequence
MNINAAPQII GHGSQGVTPG YMSGFGNSFE TEALPGALPI GRNSPQRAAY GLYAEQLSGS 
PFTAPRGANE RSWLYRIRPS VKHSGRFTKA DMGLWRSAPC LEYDMPIAQL RWDAPSMPQE
DLTFLQGVRT MTTAGDVNTQ AGMATHMYLI TQSMVDQHFY NADGELMFVP QQGSLRLVTE
FGVISIEPAE IAVIPRGVKF RVELVDGPAR GYLCENYGGA FTLPERGPIG ANCLANSRDF
LTPVAAYEDR DVPTELFVKW GGALWQTTLP HSPIDVVAWH GNYAPYKYDL RTFSPVGAIG
FDHPDPSIFT VLTSPSETAG TANIDFVIFP ERWMVAENTF RPPWYHMNIM SEFMGLICGV
YDAKPQGFVP GGASLHNMML PHGPDREAFD HASNGELKPV KLTGTMAFMF ETRYPQRVTE
YAATAGTLQD DYADCWRGLE KRFDPSRP