Gene Rsph17029_3671 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17029_3671 
Symbol 
ID4899015 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17029 
KingdomBacteria 
Replicon accessionNC_009050 
Strand
Start bp772679 
End bp773914 
Gene Length1236 bp 
Protein Length411 aa 
Translation table11 
GC content68% 
IMG OID640114279 
Product4-coumarate--CoA ligase 
Protein accessionYP_001045533 
Protein GI126464420 
COG category[I] Lipid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0318] Acyl-CoA synthetases (AMP-forming)/AMP-acid ligases II 
TIGRFAM ID[TIGR02372] 4-coumarate--CoA ligase, photoactive yellow protein activation family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.180666 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0402717 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAGCCG AGGGGCCTCT GGCACTGGAA GATCGGGTGC TCGACCGGGA GGCGGTCGGG 
CGCCTCTGCG TCTCCCTGAT CGCGGCCGAG CAGCAGGACC TGCTGCGGGA AGGGCGCGTC
GGTCATCATC AGATGATCGG CGCGCGCCTC CTGACGGCAG GGCATCCGTC GCCCGACGAC
CTGCTGATCG ACGAAGACAC GCTGGGGCTC GACAGTCTGC TCATGCTCTC GCTCGTCACC
CGCGTGGCGG GCTTCTTCCA TCTGTCGGAT TCGAACACCG AGGATTATCT CCTCGTGCGG
CGCCGTCTGG GAGAGTGGGT GGATCTGATC GATCATCACC ACACCCTGAT GGGGCCGAAG
GCGCGCTTCA CCTTCGCGAC CTCGGGAAGC ATCGCAGGAC CGAAGCCCGT GACCCATAGC
GCCGCGGCAC TGCTCTCGGA AGGGCAGGCC ATCGCGAAGA TCCTCACGGA GCGGCCTCCC
GAGTTGCGCC GCGTCCTGTC CTGCGTTCCG GCCCACCACA TCTACGGCTT CCTCTGGTCC
TGCCTGTTTC CCTCCCGCCG CGGTCTCGAG GCGAAGCAAC TGGCGAACCT GTCCGCTTCC
GGCATCATGC GGCACGCGCG CTCCGGCGAT CTGGTGGTGG GCACGCCCTT CATCTGGGAG
CAGTTCGCGG ATCTGGACTA CCGGCTGCCC GACGACGTGG TCGGGGTGAC GTCCGGCGCA
CCCTCGACGG CCGAGACATG GCGCTGCGCC TCTGCGCTCG GCCCGGCGCG GATGCTGGAC
ATCTATGGCT CGACCGAAAC CGGGGGCATC GGCTGGCGCG AGCGCCGGGA CGACCCTTTC
CGAACCCTGC CCGATCTCGC CTGCTGCCAC GACACGTTGA GCAGGCTGGG CCGGCGGCTG
GACCTGCAGG ACGAGATCGC CTGGGACAAG GACGGCGGCT TCACGATTCT CGGCCGCAAG
GACGAGATCC TGCAGGTCGC GGGATCGAAC GTCTCTCCTG CCGCGGTCCG AGAGATCCTG
CTCCGGAACC CGCGTGTCCG GGATGCGGCG GTGCGGCTCG ACGGACGCAG GCTGAAGGCC
GTGATCTCTG TGGCGGAGGG CGCTGACGAG GCAGAGATCG AGATCGAACT GCGCGCGACT
GCGGCGCGGC ATCTTCCGGC ACCTGCCAGG CCGGACCGGT TCCTTTTCGC GACGCAACTC
CCGCGCACGG GTGCAGGGAA ATTGGCGGAC TGGTAG
 
Protein sequence
MTAEGPLALE DRVLDREAVG RLCVSLIAAE QQDLLREGRV GHHQMIGARL LTAGHPSPDD 
LLIDEDTLGL DSLLMLSLVT RVAGFFHLSD SNTEDYLLVR RRLGEWVDLI DHHHTLMGPK
ARFTFATSGS IAGPKPVTHS AAALLSEGQA IAKILTERPP ELRRVLSCVP AHHIYGFLWS
CLFPSRRGLE AKQLANLSAS GIMRHARSGD LVVGTPFIWE QFADLDYRLP DDVVGVTSGA
PSTAETWRCA SALGPARMLD IYGSTETGGI GWRERRDDPF RTLPDLACCH DTLSRLGRRL
DLQDEIAWDK DGGFTILGRK DEILQVAGSN VSPAAVREIL LRNPRVRDAA VRLDGRRLKA
VISVAEGADE AEIEIELRAT AARHLPAPAR PDRFLFATQL PRTGAGKLAD W