Gene RPD_1103 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_1103 
Symbol 
ID4021579 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp1252825 
End bp1254717 
Gene Length1893 bp 
Protein Length630 aa 
Translation table11 
GC content66% 
IMG OID637961295 
Productlong-chain-acyl-CoA synthetase 
Protein accessionYP_568242 
Protein GI91975583 
COG category[I] Lipid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0318] Acyl-CoA synthetases (AMP-forming)/AMP-acid ligases II 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAATCAA TCGTCAGTCG CATTCGAGTA AGTTGCACCG ACAATTCTCA CGATCTTGTC 
GGGCTGGCGT TCATGAACAT CCAAGCAAGA CCTGCGACGG ACGACGCGCT GCAGACGCGT
CGCGCGCAAC CATCGGTGGC CAAAAGCTGG TTGAAGGCGA TCGAGATCAC GGCGCGGGTC
GAGCAGGAGC CGCGACGACT GCTCGCAACC GTCGTCGACG AATGGGCAGC CGTCGCACCG
AACTCCCCCG CGATCGTGTC GGATCGCGAC TCCTACAGCT ACGCGGAGCT CGCGCGCCGC
ATCAACCGCT ATGCGCGCTG GGCGCTGGAG AACGGGGTCG GCATCGGCGA CGTCGTCTGC
CTGCTGATGT CGAACCGGCC GGACTACGTC GCAGCCTGGC TCGGCATCAC CAAGGTCGGC
GGCGTGGTCG CGCTGATCAA CACCCAGCTC GTCGGCGCAT CCCTGGCGCA TTGCATCGAC
ATCGCGCAGC CCGGACACAT CATCGTCGGC GAGGAATTCG TCGACGCGTG GGAGAGCGCC
CGCGCGCATC TCGGAGCGGC TCCGCGAATC TGGCTCCATG GCGAGACTTC TGGAGACAAG
GCGCTGGACC AGGCGCTCGC GGCGCTCGAC AGCGCTGCGC TGGCGCCGCA GGAGCAGCGC
GACGTCGGCA TCGATGATCT GGCGCTGCTG ATCTACACAT CCGGAACCAC CGGCCTGCCC
AAGGCGGCGC GCGTCACCCA TCGCCGGGTG ATGGGCTGGG CCGGCTGGTT CGCGGGGTTG
ACCGACGCCG CACCGGACGA TCGGATGTAC AACTGCCTGC CGATCTATCA CAGCGTCGGC
GGCGTGGTCG CGACCGGCAG CATGCTGATG GCGGGCGGCT CGGTGGTGAT CGCCGAGAAA
TTCTCCGCGA GCCGGTTCTG GGACGACATC ATCCGCTGGG ACTGCACGCT GTTTCAATAT
ATCGGCGAAC TCTGCCGCTA TCTGCTGCAG GCGCCGCCAT CCGACCGCGA CACCCGGCAT
CGGCTGCGGC TGTGCTGCGG CAATGGATTG CGCGGCGAGA TCTGGGAGCC GTTCCAGGCG
CGCTTTGCGA TCCCCCGCAT CCTCGAATTC TACGCGTCGA CCGAGGGCAA TTTCTCGCTC
TACAATGTCG AGGGCAAGCC CGGCGCGATC GGGCGCATTC CGTCATTTCT GGCGCATCGC
TTTCCCGCGG CGATCGTCAA ATTCGACGTC GAGACCGGCG GTCCGCTGCG CGACGAGAAC
GGGCTGTGCA TCCGTTGCGC CCGCGGCGAA ACCGGAGAAG CGATCGGCCG GATCGGCGAG
GCGCGCGACA GCGGCGGCCG GTTCGAAGGC TACACCAACG ATTCCGAAAC CGAGAAGAAG
GTGCTGCGCG ACGTGTTCGC CGCAGGCGAC GCGTGGTTTC GCACCGGCGA CCTGATGAGG
CTCGACGACA AGGGCTTCTT CCATTTCGTC GACCGGATCG GCGACACCTT CCGCTGGAAG
GGCGAGAACG TCGCGGCGAG CGAGGTCGCC GAAACGATCG CCGCCTGCCC CGGCGTGATC
GACGCCAGCG TCTATGGCGT GTCGGTGCCC CACACGGACG GCCGCGCCGG CATGGCGGCG
CTGGTCGTCG ACGATCGCTT CGACCTCGCG GCGCTGCATC GCCATCTCGC CGAACGGTTG
CCGGCCTATG CGCGCCCGGT CTTCATCCGG ATCCAGGCCG CACTGCAGAT CACCGGCACC
TTCAAGCAGA ACAAGCAGGA TTTGATCCGC GACGGCTTCG ATCCCGTCGT TGTGAGCGAT
CCGCTGTATC TCGGCGATGC GACCGCAGCC GGCTACGTCG TGCTCGATGA GCCTCTGCAT
CGCAGGATTG CGGCCGGCAC ACTGCGGCTT TGA
 
Protein sequence
MQSIVSRIRV SCTDNSHDLV GLAFMNIQAR PATDDALQTR RAQPSVAKSW LKAIEITARV 
EQEPRRLLAT VVDEWAAVAP NSPAIVSDRD SYSYAELARR INRYARWALE NGVGIGDVVC
LLMSNRPDYV AAWLGITKVG GVVALINTQL VGASLAHCID IAQPGHIIVG EEFVDAWESA
RAHLGAAPRI WLHGETSGDK ALDQALAALD SAALAPQEQR DVGIDDLALL IYTSGTTGLP
KAARVTHRRV MGWAGWFAGL TDAAPDDRMY NCLPIYHSVG GVVATGSMLM AGGSVVIAEK
FSASRFWDDI IRWDCTLFQY IGELCRYLLQ APPSDRDTRH RLRLCCGNGL RGEIWEPFQA
RFAIPRILEF YASTEGNFSL YNVEGKPGAI GRIPSFLAHR FPAAIVKFDV ETGGPLRDEN
GLCIRCARGE TGEAIGRIGE ARDSGGRFEG YTNDSETEKK VLRDVFAAGD AWFRTGDLMR
LDDKGFFHFV DRIGDTFRWK GENVAASEVA ETIAACPGVI DASVYGVSVP HTDGRAGMAA
LVVDDRFDLA ALHRHLAERL PAYARPVFIR IQAALQITGT FKQNKQDLIR DGFDPVVVSD
PLYLGDATAA GYVVLDEPLH RRIAAGTLRL