Gene RPB_4656 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_4656 
Symbol 
ID3912474 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp5266879 
End bp5268432 
Gene Length1554 bp 
Protein Length517 aa 
Translation table11 
GC content67% 
IMG OID637886561 
Productbenzoate-CoA ligase family 
Protein accessionYP_488250 
Protein GI86751754 
COG category[I] Lipid transport and metabolism 
COG ID[COG0365] Acyl-coenzyme A synthetases/AMP-(fatty) acid ligases 
TIGRFAM ID[TIGR02262] benzoate-CoA ligase family 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.653443 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCACGGCA TGACCGGGGC AGGGTCGTAC AATGCGGTGA GCTGGTTGCT CGACCGCAAC 
GTCGCCGAGG GGCGCGGCGA CAAGCTGGCC TATACCGACA CGGTTTCCGA GCTGAGCTAT
CGCGCGCTGC AAACGCAGAC CTGCCGCGCC GCCAACCTGA TGCGCCGCCT CGGCGTGCGC
CGCGAAGAGC GGGTGGCGAT GATCATGCTC GACACGGTGG AGTTTCCAGT GGTGTTTCTC
GGCGCGATCC GCGCCGGGGT GGTGCCGGTG CCGCTCAATA CGCTGCTGAC GGCCGAGCAA
TATGCCTATG TGCTGGCGGA TTGCCGCGCG CGTGTGCTGT TCGTCTCCGA AGCGCTGTAT
CCGGTGCTGA AGGACATTCT GTCCGGCCTG CCGGACCTCG CGCATGTCGT TGTCTCGGGC
GGCGATGCGC ACGGCCATCT GAAACTCGCC GACGAGTTGG CGCAGGAAAG CGACGCCTGC
GAAACCGCCG CGACCCATGC GGAGGAGCCG GCGTTCTGGC TGTATTCCTC GGGCTCGACC
GGGATGCCGA AGGGCGTGCG GCATCTGCAC GCCAACCTCG CCGCCACCGC CGAGACCTAT
GCCAGGCAGG TGCTCGGCAT CCGCGAGGAC GACGTCGTGC TGTCGGCAGC GAAGCTGTTC
TTCGCCTATG GGCTCGGCAA TTCGCTGACC TTCCCGCTGT CGGTCGGCGC CACCACGGTG
CTGAATTCGG AACGGCCGAC GCCGGCGGTC GTGTTCAAGC TGATGCAGCG CTACAATCCG
ACGATCTTCT GCGGCGTGCC GACGCTGTTC GCCGCGATGC TGAACGACTC CGCACTGAAG
AGCGAGGCCG CCGGTTCGCG ACTGCGAATC TGCACCTCGG CCGGCGAAGC ATTGCCGGAA
TCGGTGGGGC TAGCCTGGAA GGCGCGGTTC GGCGCGGACA TTCTCGACGG CGTCGGCTCG
ACCGAACTGC TGCACATCTT CCTGTCCAAT GCGCCCGGCG ACATCAAATA CGGCACCTCG
GGCAAGCCCG TGCCGGGCTA CAAGGTGCGG CTGGTCAACG AGACCGGCAC CGAGGTCGCC
GATGGCGAGG TCGGCGAATT GCTGGTCGAT GCGCCGTCGG CCGGCGAGGG CTACTGGAAT
CAGCGCAGCA AGAGCCGCGC GACCTTCGAG GGCAACTGGA CCCGCACCGG CGACAAGTAC
ATCCGCGATG CGGATGGCCG TTACACCTTC TGCGGCCGCG CCGACGACAT GTTCAAGGTG
TCGGGCATCT GGGTGTCGCC GTTCGAGGTC GAGAGCGCGC TGATCACGCA TCCGGCGGTG
CTCGAAGCCG CCGTCGTGCC GGACGCCGAT TTCGACGGCC TCTTGAAGCC GCGCGCCTAT
GTGGTGCTGC GCGAGGGCGT CGCTCCCGAC GGGCTGTTCG AGGCGCTCAA GGACCACGTC
AAGCAGAAGG TCGGGCCGTG GAAATATCCG CGCTGGATCG AAGTCGTGCC AAGCCTGCCG
AAAACCGCCA CCGGCAAGAT CCAGCGCTTC AAGCTGCGCG AGGGTGCGCA GTGA
 
Protein sequence
MHGMTGAGSY NAVSWLLDRN VAEGRGDKLA YTDTVSELSY RALQTQTCRA ANLMRRLGVR 
REERVAMIML DTVEFPVVFL GAIRAGVVPV PLNTLLTAEQ YAYVLADCRA RVLFVSEALY
PVLKDILSGL PDLAHVVVSG GDAHGHLKLA DELAQESDAC ETAATHAEEP AFWLYSSGST
GMPKGVRHLH ANLAATAETY ARQVLGIRED DVVLSAAKLF FAYGLGNSLT FPLSVGATTV
LNSERPTPAV VFKLMQRYNP TIFCGVPTLF AAMLNDSALK SEAAGSRLRI CTSAGEALPE
SVGLAWKARF GADILDGVGS TELLHIFLSN APGDIKYGTS GKPVPGYKVR LVNETGTEVA
DGEVGELLVD APSAGEGYWN QRSKSRATFE GNWTRTGDKY IRDADGRYTF CGRADDMFKV
SGIWVSPFEV ESALITHPAV LEAAVVPDAD FDGLLKPRAY VVLREGVAPD GLFEALKDHV
KQKVGPWKYP RWIEVVPSLP KTATGKIQRF KLREGAQ