Gene RPB_1705 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_1705 
Symbol 
ID3908230 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp1940200 
End bp1941177 
Gene Length978 bp 
Protein Length325 aa 
Translation table11 
GC content65% 
IMG OID637883599 
Product3,4-dihydroxyphenylacetate 2,3-dioxygenase HpaD 
Protein accessionYP_485324 
Protein GI86748828 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0346] Lactoylglutathione lyase and related lyases 
TIGRFAM ID[TIGR02295] 3,4-dihydroxyphenylacetate 2,3-dioxygenase 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.159333 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.870304 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCTGTGC CGCAGCACGT ATTCGACCCG CCGTTCAACA TCATCCGTTG CAGTCACGTC 
GTGCTCGACG TCACCGACCT TTCCGCCAGC GTCGATTTCT ATGCCGACAT CATCGGCCTG
CATGTCGAGG ATCGCGACGA CGCCACCGCC TATCTACGCG GCAGCGAGGA GCATCAGCAT
CACTCGGTGG TACTGCGCCA GGCGGACAAG CCGGCCGCGG CGCGGCTCGG CTTTCGCGTC
GGCACCGAAG CCGATCTCGA CAAGGCCGGC AGCTTCTTCG CCGAGAATGG GCTGATCTAC
AGCTTCGTCG ATCGCCCATT CCAGGGCCGC ACGCTGCACG TCACCGACCC GTTCGGCTTC
CGGCTCGAGT TCTGCGCCAG CATGGAGAAG CGGCCGCATC TTCTGCGCCG CTACGAATTG
TACAAAGGCT GTCACCCGCA GCGGCTCGAC CATTTCAACG TCTTCGCGGC CGAGACTCAG
GAGACCATCG ACTTCTACGC CCGGCTCGGC TTTCGCCTCA CCGAATACGC CGAAGAGGAC
GGCGACAACG GCCGCATCGC CGCGGCCTGG ATGCATCGCA AAGGCAACGT CCACGACTTC
GCCGTCACCA ACGGCCGCGG CCCGCGGCTG CATCATTTCG CCTATTGGGT GCCCGGTCCG
CTCAACATCA TCCATCTCTG CGACGTGATG GCGTCGCGGG GGCTCGGCCT CGAGCGCGGC
CCGGGCCGCC ACGGCATCTC GAACGCCTTC TTTCTCTACG TCCGCGATCC CGACGGCCAT
CGTATCGAGC TGTATTGCAG CGACTATCAG ACCATGGACC ACGACCACGC CCCGCTGCGC
TGGTCGCTAC GCGACCCGCG CCGGCAGACG CTGTGGGGCG CGCCGGCGCC GCGCTCCTGG
TTCGAGCAGG GCTCGGATTT CCTCGGCGAG ACGGTTCGCG AGCCGGCATT CGTGGCCGAT
GTGATGATTG CGGATTGA
 
Protein sequence
MPVPQHVFDP PFNIIRCSHV VLDVTDLSAS VDFYADIIGL HVEDRDDATA YLRGSEEHQH 
HSVVLRQADK PAAARLGFRV GTEADLDKAG SFFAENGLIY SFVDRPFQGR TLHVTDPFGF
RLEFCASMEK RPHLLRRYEL YKGCHPQRLD HFNVFAAETQ ETIDFYARLG FRLTEYAEED
GDNGRIAAAW MHRKGNVHDF AVTNGRGPRL HHFAYWVPGP LNIIHLCDVM ASRGLGLERG
PGRHGISNAF FLYVRDPDGH RIELYCSDYQ TMDHDHAPLR WSLRDPRRQT LWGAPAPRSW
FEQGSDFLGE TVREPAFVAD VMIAD