Gene RPD_1871 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_1871 
Symbol 
ID4022353 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp2094920 
End bp2096494 
Gene Length1575 bp 
Protein Length524 aa 
Translation table11 
GC content65% 
IMG OID637962064 
Productlong-chain-fatty-acid--CoA ligase 
Protein accessionYP_569007 
Protein GI91976348 
COG category[I] Lipid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0318] Acyl-CoA synthetases (AMP-forming)/AMP-acid ligases II 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.394568 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.852293 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCCAAG GCGAGCTCAC CCAAAGCCAT ATCACCCAAG GCCATCTCTC TCACGGCATC 
GTCAGTGGCG ACCGCCATCG GTCGTTCGAG GAGGTCAATG CGCGCGCCGC GCAGATCGCC
GGCGGGCTGC AGGGCCTTGG CGTCAAACCC GGCGATTGCG TTTGCGTCTT GATGCGCAAC
GACATCGCCT TTCTTGAATC CGCCTATGCG GTCATGATGC TCGGCGCCTA CATGGTCCCG
GTGAACTGGC ACTTCAAACC AGAAGAGGTT CTCTACGTGC TGGGCGATTC CGGCACGCGC
GTTTTGATCG GCCATGCCGA TCTGCTGCAC CACGCGGCTG GAATCGTGCC CGCAGCCGTC
ACGATGCTGA GCGTGCCGAC GCCGCCGGAA ATCCTCGCGA GCTACAGCAT CGATCCCGAC
CATCGCTCGG CTCCCGCCGG CGCGATCGAT CTCGACGGCT GGCTCGCGCA GCAGAGGCCA
TATGACGGCC CGGCGCTGCC GCAGCCGCAG AACATGATCT ACACCTCGGG CACCACCGGC
CATCCGAAGG GCGTCAAACG GTTCGCGCCG ACGCCGCAAC AGTCGGCGAA CGCCGAAGCC
ATGCGCGCAG CGATCTACGG CCTGAAGCCG GGCGTCCGCG CGCTGTGCCC GGGGCCGTTG
TATCACTCCG CGCCGAATTC GTTCGGCATT CGCGCCGGCC GGCTCGGCGG AGTGCTGGCG
CTGATGCCGC GGTTCGAACC CGAAGCGCTG CTGCAACTGA TCGAGCAGCA CCGGATCGAC
ACCGTGTTCA TGGTGCCGAC GATGTTCATC CGGCTGATGA AGCTGCCGGA AGCGGTGCGC
AACAAGTACG ACGTGTCGTC GCTGCGCCAT GTCATCCATG CCGCGGCGCC GTGTCCCCCG
GACGTCAAGC GCGCGATGAT CGACTGGTGG GGGCCGGTGA TCTACGAATT CTACGGCTCG
ACCGAGAGTG GCGCGGTGAC GTTCGCGAGC TCCGAGGATG CGCTGAAGAA GCCCGGCACC
GTCGGCAAGA TCGCTGCCGG CGCCGAGCTT GTGTTCGTCG ACGACGACAA TAACGAGGTG
CCGCAGAGCG AGGTCGGCGA GATCTTCTCG CGGATCCCCG GCAATCCGGA TTTCACCTAT
CACAACAAGC CGGAGAAACG CGCGGAGATC GACCGGGGCG GCTTCATCAC CTCGGGCGAC
ATGGGCTATC TCGACGAGGA CGGCTACGTC TTCATCTGCG ACCGCAAGCG CGACATGGTG
ATCTCCGGCG GCGTCAACAT CTATCCAGCC GAGATCGAGG CGGCGCTGCA CGCGATCCCG
GGTGTGCACG ACTGTGCGGT GTTCGGGATT CCCGACGCCG AATTCGGCGA GGCGCTGATG
GCGATGCTCG AGCCGCAGCC TGGCGTCACA CTGGAGCAGA GCAATATCCG CGAGCAGCTC
AGGCTGTCGC TCGCCGGCTA TAAGGTTCCG AAGCACATCG AGATCATGGC GCAGCTGCCG
CGCGAGGACT CCGGCAAGAT CTTCAAGCGC CGGCTACGCG ATCCGTATTG GGCCAAAGCT
GGTCGGGTGA TTTAG
 
Protein sequence
MSQGELTQSH ITQGHLSHGI VSGDRHRSFE EVNARAAQIA GGLQGLGVKP GDCVCVLMRN 
DIAFLESAYA VMMLGAYMVP VNWHFKPEEV LYVLGDSGTR VLIGHADLLH HAAGIVPAAV
TMLSVPTPPE ILASYSIDPD HRSAPAGAID LDGWLAQQRP YDGPALPQPQ NMIYTSGTTG
HPKGVKRFAP TPQQSANAEA MRAAIYGLKP GVRALCPGPL YHSAPNSFGI RAGRLGGVLA
LMPRFEPEAL LQLIEQHRID TVFMVPTMFI RLMKLPEAVR NKYDVSSLRH VIHAAAPCPP
DVKRAMIDWW GPVIYEFYGS TESGAVTFAS SEDALKKPGT VGKIAAGAEL VFVDDDNNEV
PQSEVGEIFS RIPGNPDFTY HNKPEKRAEI DRGGFITSGD MGYLDEDGYV FICDRKRDMV
ISGGVNIYPA EIEAALHAIP GVHDCAVFGI PDAEFGEALM AMLEPQPGVT LEQSNIREQL
RLSLAGYKVP KHIEIMAQLP REDSGKIFKR RLRDPYWAKA GRVI