Gene Rsph17029_2119 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17029_2119 
Symbol 
ID4895601 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17029 
KingdomBacteria 
Replicon accessionNC_009049 
Strand
Start bp2246667 
End bp2248160 
Gene Length1494 bp 
Protein Length497 aa 
Translation table11 
GC content70% 
IMG OID640112713 
ProductUbiD family decarboxylase 
Protein accessionYP_001043994 
Protein GI126462880 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0043] 3-polyprenyl-4-hydroxybenzoate decarboxylase and related decarboxylases 
TIGRFAM ID[TIGR00148] UbiD family decarboxylases 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.381426 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.478377 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCAGCC TGCCCATCTT CCCGGATCTC CGGGCCTTTC TCGACTGGTG CGCCCGAAAG 
GGCGACCTCT CCCGGATCGC CGAGCCGGTC TCGCTGCGGC ATGAAACCAC AGCCGTGGCC
TCGAAGATCC TGCGCGGCGG CGGCCCCGTG CTGCGGTTCG AGAGCCTGCG CGACACCTCG
GCCACCATGC CGCTCGTGGC CAACCTCTTC GGCACGCGCG AGCGGGTGGC GGCGGGCCTC
GGCCTCGGGC TCGAGCAGAT CCCCGAGCTC GGCGCCTTCC TCGCCGCCCT GCGGGCCCCC
GCCCCGGTGG CGGGGATGCG CGACGCGCTC TCGCGCTGGC CGCAGCTTCA GGCCGCGCTG
AACACGCGCG CGAAGATCGT GCGGTCGGCC GAGGCGCAGG AGGTGGTCCA CGAGGGCACC
GCGGTGGATC TGGGCATGAT CCCGGTGCCC ACCTGCTGGC CGGGCGATGC GGGCCCTCTG
GTGACATGGC CCGTGGTGCT GACGCGGCCG CACGGCACCT CCGCCGAAGA GACGCTGCAC
TACAATGCCG GCGTCTACCG CGCGCAGGTG ATCGGGCGCG ACCGGCTCAT CATGCGCTGG
CTCGCCCATC GCGGCGGCGC CGGGCACTGC CGCAGCTGGA TGCGTGCGGG CGAGCCGATG
CCGGTGGCGC TGGCGCTCGG CTGCGATCCG GCGCTCCTTC TGGCCGCGGC GCTGCCCCTC
CCTGAGCAGG TGTCCGAGCT GACCTTCTCG GGCGTGCTCC GGGGCGCGCG GACGCCGCTG
GTGGCGGGGC GCACCGTGCC GCTGATGGTG CCCGCCACGG CGGAGATCGT GGTCGAGGGC
TGGATTCATC CCGGCGACAT GGCACCCGAG GGACCTTTCG GCGACCACAC CGGCTATTAC
AATTCGGTCG AGGATTTCCC GGTGCTGCGG GTCTCGGCCA TCACCCACCG CAAGGATCCG
CTCTATCTCA CCACCCACAC GGGCCGCCCG CCGGACGAGC CCTCGGTCAT CGGCGAGGTC
TTCAACGATC TGGCCATGCC GGTCTTCCGC CAGCAGATCC CCGAGGTGAG GGACCTTTAC
CTGCCCCCCG CCGCCTGCTC CTACCGCATC GCCATCGTCT CGATCGACAA GCGCTATCCC
GGTCAGGCGC GGCGGGTGAT GATGGCGCTC TGGGGGATGC TGGCGCAATT CTCCTACACC
AAGATGGTGA TCGTGGTGGA TGAGGACATC AACCCGCGCG ACTGGGACGA TGTGGCCTGG
GCGATGGCGA CCCGGATGGA CCCGTCGCGC GACGTGGTGC TGCTCGAGAA GACACCGATG
GACTATCTGG ATTTCGCCTC GCCCGAGCCG GGCCTTGCCG GCAAGATCGG CATCGACGCC
ACCAACAAGA TCGGCCCCGA GACCCATCGC GAATGGGGCG AGGTCATGGC GCAGTCGCCC
GAAGCCGAGG CCTTCGCCGA CCGGCTGATC GCGAAGCTGA GGATCGGAGC ATGA
 
Protein sequence
MRSLPIFPDL RAFLDWCARK GDLSRIAEPV SLRHETTAVA SKILRGGGPV LRFESLRDTS 
ATMPLVANLF GTRERVAAGL GLGLEQIPEL GAFLAALRAP APVAGMRDAL SRWPQLQAAL
NTRAKIVRSA EAQEVVHEGT AVDLGMIPVP TCWPGDAGPL VTWPVVLTRP HGTSAEETLH
YNAGVYRAQV IGRDRLIMRW LAHRGGAGHC RSWMRAGEPM PVALALGCDP ALLLAAALPL
PEQVSELTFS GVLRGARTPL VAGRTVPLMV PATAEIVVEG WIHPGDMAPE GPFGDHTGYY
NSVEDFPVLR VSAITHRKDP LYLTTHTGRP PDEPSVIGEV FNDLAMPVFR QQIPEVRDLY
LPPAACSYRI AIVSIDKRYP GQARRVMMAL WGMLAQFSYT KMVIVVDEDI NPRDWDDVAW
AMATRMDPSR DVVLLEKTPM DYLDFASPEP GLAGKIGIDA TNKIGPETHR EWGEVMAQSP
EAEAFADRLI AKLRIGA