Gene RPD_1018 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_1018 
Symbol 
ID4021493 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp1156630 
End bp1157976 
Gene Length1347 bp 
Protein Length448 aa 
Translation table11 
GC content65% 
IMG OID637961209 
Producthomogentisate 1,2-dioxygenase 
Protein accessionYP_568157 
Protein GI91975498 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG3508] Homogentisate 1,2-dioxygenase 
TIGRFAM ID[TIGR01015] homogentisate 1,2-dioxygenase 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.266165 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACGTCA ACGCCGCGCC TGAGATCGTC GGCCGTGCTT CGCAGGGCGT CACGCCGGGC 
TACATGTCCG GCTTCGGCAA TTCGTTCGAG ACCGAGGCGC TGCCCGGCGC GTTGCCGGTC
GGCCGCAATT CGCCGCAGCG TGCGGCCTAT GGGCTCTATG CCGAGCAGTT GTCCGGCTCG
CCCTTCACCG CGCCGCGCGG CGCCAATGAG CGCTCGTGGC TGTATCGCAT CCGACCGTCG
GTGAAGCATT CCGGCCGGTT CGCGAAAGCC GATATGGGGT TGTGGCGCTC GGCGCCTTGC
CTCGAACACG ACATGCCGAT CGCTCAGCTC CGGTGGGATG CGCCGCCGAT GCCGACCGAG
GAGGTGACCT TCGTGCAGGG CGTGCGGACG ATGACCACGG CCGGCGATGT GAACACCCAA
GCCGGCATGG CCGCGCATAT GTACCTGATC AGCCGGTCGA TGGTCGATCA GCATTTCTAC
AATGCCGATG GCGAGCTGAT GTTCGTGCCG CAGCAAGGTC GATTGCGGCT CGTCACCGAA
TTCGGCGTGA TCGCGATCGA GCCGGCCGAG ATCGCGGTGA TCCCGCGCGG CGTCAAGTTC
CGCGTCGAGC TGGTCGATGG TCCGGCGCGC GGTTATCTCT GCGAGAATTA CGGCGGCGCG
TTCACCCTGC CGGAGCGCGG CCCGATCGGC GCCAATTGCC TCGCCAATTC GCGCGATTTC
CTGACGCCGG TCGCATCCTA CGAGGACAAG GACACGCCGA CCGAGCTGTT CGTGAAATGG
GGCGGGGCGC TGTGGCGGAC GAGTTTGCCG CATTCGCCGA TCGACGTGGT CGCCTGGCAC
GGCAACTACG CGCCGTATAA ATACGATCTG CGAACGTTCT CGCCGGTCGG CGCGATCGGC
TTCGACCATC CCGATCCGTC GATCTTCACC GTGCTGACCT CGCCGTCGGA GACCGCGGGC
ACGGCGAATA TCGACTTCGT GATCTTTCCC GAGCGCTGGA TGGTGGCGGA AAACACCTTC
CGCCCGCCCT GGTATCACAT GAACATCATG TCGGAGTTCA TGGGGCTGAT CTACGGCGTC
TATGACGCCA AGCCGCAGGG CTTCGCTCCG GGCGGCGCGA GCCTGCACAA CATGATGCTG
CCGCACGGGC CGGATCGCGA AGCGTTCGAT CATGCGTCGA ACGGCGAGCT GAAACCGGTC
AAGCTCACCG GCACGATGGC CTTCATGCTG GAGACCCGCT ATCCGCAGCG CGTCACCGAA
TACGCGGCGA CCGCCGACAC CTTGCAGGAT GACTACGCCG ATTGCTGGCG CGGCCTCGAG
AAGCGCTTCG ATCCGAGCCG GCCATGA
 
Protein sequence
MNVNAAPEIV GRASQGVTPG YMSGFGNSFE TEALPGALPV GRNSPQRAAY GLYAEQLSGS 
PFTAPRGANE RSWLYRIRPS VKHSGRFAKA DMGLWRSAPC LEHDMPIAQL RWDAPPMPTE
EVTFVQGVRT MTTAGDVNTQ AGMAAHMYLI SRSMVDQHFY NADGELMFVP QQGRLRLVTE
FGVIAIEPAE IAVIPRGVKF RVELVDGPAR GYLCENYGGA FTLPERGPIG ANCLANSRDF
LTPVASYEDK DTPTELFVKW GGALWRTSLP HSPIDVVAWH GNYAPYKYDL RTFSPVGAIG
FDHPDPSIFT VLTSPSETAG TANIDFVIFP ERWMVAENTF RPPWYHMNIM SEFMGLIYGV
YDAKPQGFAP GGASLHNMML PHGPDREAFD HASNGELKPV KLTGTMAFML ETRYPQRVTE
YAATADTLQD DYADCWRGLE KRFDPSRP