Gene RPC_0891 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPC_0891 
Symbol 
ID3969810 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB18 
KingdomBacteria 
Replicon accessionNC_007925 
Strand
Start bp985316 
End bp986662 
Gene Length1347 bp 
Protein Length448 aa 
Translation table11 
GC content66% 
IMG OID637924007 
Producthomogentisate 1,2-dioxygenase 
Protein accessionYP_530780 
Protein GI90422410 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG3508] Homogentisate 1,2-dioxygenase 
TIGRFAM ID[TIGR01015] homogentisate 1,2-dioxygenase 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.669112 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACATCA CCACCGCTCC CGGGCTGATC GGCCGCAGCA CGCAAGCCAT CACGCCCGGC 
TATATGTCGG GCTTCGGCAA TTCGTTCGAG ACCGAGGCGC TGCCCGGCGC GCTGCCGATC
GGGCGCAACT CGCCGCAGCG CGCGCCCTAC GGGCTTTACG CCGAGCAATT GTCGGGCTCG
CCGTTCACCG CGCCGCGCGG CTCTAACGAG CGCTCCTGGC TGTATCGCAT CCGCCCCTCG
GTGCAGCACT CCGGCCGCTT CGAGAAAGCC GAGGCCGGGC TGTGGCGCTC CGCACCCTGC
CATGAGCACG ACATGCCGAT CGCGCAATTG CGCTGGGACC CGCCGCCGCT GCCGCAGCGC
GCGCAGACCT TTCTGCAGGG CGTCGAGACC ATGACCACGG CGGGCGACGT CAATACGCAA
GCCGGCATGG CGGCGCATAT GTATTTGATC AGCGCCTCGA TGGTGAACCA GCATTTCTAC
AATGCCGACG GCGAATTGAT GTTCGTGCCG CAGCAGGGTG GCTTGCGCCT CGTCACCGAA
TTCGGCGTGA TCGGCGTCGC GCCCGGCGAG ATCGCGGTGA TTCCGCGCGG CGTCAAGTTT
CGCGTCGAGC TGATCGACGG GCCGGCGCGC GGCTATCTGT GCGAGAATTA CGGCGGCGGC
TTCACGCTGC CGGAGCGCGG CCCGATCGGG GCCAATTGCC TTGCGAACGC ACGCGACTTC
CTCACGCCGG TCGCGGCTTA TGAAGATAGC GACACGCCGA CCGAGCTCTA CGTCAAATGG
GGCGGCGCGC TGTGGGTGAC GCAGTTGCCG CATTCGCCGA TCGACGTGGT GGCCTGGCAC
GGCAACTACG CGCCGTACAA ATATGATCTG CGCACCTTCT CGCCGGTCGG CGCGATCGGC
TTCGATCATC CCGATCCGTC GATCTTCACC GTGCTGACCT CGCCCTCGGA GACCGCCGGC
ACCGCCAATA TCGACTTCGT GATCTTCCCG GAGCGCTGGA TGGTGGCGGA GAACACCTTC
CGCCCGCCGT GGTATCACAT GAACATCATG TCGGAATTCA TGGGGCTGAT TTATGGCGTG
TACGACGCCA AGCCGCAGGG CTTTCTGCCC GGCGGCGCCT CGCTGCACAA CATGATGCTG
CCGCACGGTC CGGACCGCGA GGCGTTCGAT CACGCGTCGA ACGCCGAGCT GAAGCCGGTG
AAGCTCGAAG GCACCTTGGC CTTCATGTTC GAGACCCGCT ATCCGCAGCG CGTCACCGTG
CACGCCGCGA CTTCCAGCAC GCTGCAGGCC GACTACGCTG AGTGCTGGCG CGGGTTGCAA
AAGCGCTTCG ATCCGACCAA ACCCTGA
 
Protein sequence
MNITTAPGLI GRSTQAITPG YMSGFGNSFE TEALPGALPI GRNSPQRAPY GLYAEQLSGS 
PFTAPRGSNE RSWLYRIRPS VQHSGRFEKA EAGLWRSAPC HEHDMPIAQL RWDPPPLPQR
AQTFLQGVET MTTAGDVNTQ AGMAAHMYLI SASMVNQHFY NADGELMFVP QQGGLRLVTE
FGVIGVAPGE IAVIPRGVKF RVELIDGPAR GYLCENYGGG FTLPERGPIG ANCLANARDF
LTPVAAYEDS DTPTELYVKW GGALWVTQLP HSPIDVVAWH GNYAPYKYDL RTFSPVGAIG
FDHPDPSIFT VLTSPSETAG TANIDFVIFP ERWMVAENTF RPPWYHMNIM SEFMGLIYGV
YDAKPQGFLP GGASLHNMML PHGPDREAFD HASNAELKPV KLEGTLAFMF ETRYPQRVTV
HAATSSTLQA DYAECWRGLQ KRFDPTKP