Gene RPC_4074 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPC_4074 
Symbol 
ID3973355 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB18 
KingdomBacteria 
Replicon accessionNC_007925 
Strand
Start bp4527254 
End bp4528939 
Gene Length1686 bp 
Protein Length561 aa 
Translation table11 
GC content62% 
IMG OID637927178 
ProductLong-chain-fatty-acid--CoA ligase 
Protein accessionYP_533919 
Protein GI90425549 
COG category[I] Lipid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0318] Acyl-CoA synthetases (AMP-forming)/AMP-acid ligases II 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAGCGCA TCTGGCTCAA GCATTATCCG CCCGGCGTGC CCGCGGACAT CGACGTCAAC 
CAATACCCGT CGCTGGTCGA CCTGCTGGAA GAGAGCTTCG CGAAATTTCG CGACCGCAAA
GCCTTCATCT GTATGGACAA GGCGATCAGT TACGGCCAGC TCGACGAGAT GTCGACGGCG
CTCGGCGCTT ATCTGCAAAG CAAGGGGTTG AAGCAGGGCG CGCGCGTCGC GGTGATGATG
CCGAACGTGC TGCAATATCC GATCGCCACC GCCGCGGTGC TGCGCGCCGG CTTTGCCGTG
GTCAACGTCA ATCCGCTCTA CACCCCGCGC GAGCTCGAAC ATCAGCTGAA GGATTCCGGC
GCCGAGGCGA TCATCGTGTT GGAGAACTTC GCCTCCACCG TCGAGCATGT GATCGCGCGC
ACCGGCGTGA AGCACGTCAT CGTTGGCGCC ATGGGCGACC TGCTCGGCTT CAAAGGCGTC
ATCGTCAACT TCGTGGTGCG CCGGGTCAAG AAAATGGTGC CGGCGTTCTC GCTGCCGAGC
AAGGTGGCGT TCAACGACGC GCTGGCCGCA GGGCGCAGCC TCAAATTTGC CAAGCCGACA
ATCGGCCCGG ACGACGTCGC CTTCCTGCAA TATACCGGCG GCACTACCGG CGTGTCGAAG
GGCGCCACGC TGCTGCATCG CAACGTGGTC GCCAACGTGC TGCAGAACGA CGCCTGGCTG
CAGCCGGCGC TCACCAAACC GCCGCATGTC GATCAGTTGT TCATCGTCTG CGCGCTGCCG
CTGTATCATA TCTTCGCGCT CACCGTCTGC TTCCTGCTGG CGATGCGCGC CGGCGGCGTC
AACCTATTGA TCCCCAATCC GCGCGACATG AAGAGCTTCA TCAAGGAATT GAAGAAGTAT
CAGGTCAACA GCTTCCCCGC GGTCAACACG CTGTACAACG GCCTGCTGCA CGCCGAGGGG
TTCGACCAGG TGGACTTCTC CAAGCTGAAA ATTTCCAACG GCGGCGGCAT GGCGGTGCAA
CGCCCGGTCG CCGAGCAGTG GAGCAAGCTC ACCGGCTGCG GCATCGCCGA AGGCTACGGG
CTGTCGGAAA CCGCGCCGGT CCTGACCTGC AACCCGGCGA CTATCGACAG CTTCACCGGA
TCGATCGGCC TGCCGCTGCC CTCGACCTAT CTGTCGATCC GCGACGACGC CGGCAACGAA
TTGTCGCTCG GCCAGATCGG CGAGATCTGC GCCAAGGGTC CGCAAGTGAT GGCCGGTTAT
TGGAACATGC CGGAGGAAAC CGCGCTGGTG ATGACCGAGG ACGGTTACTT CCGCACCGGC
GACATCGGGG TGATGAGTGA AGACGGCTCC ACCAAGATCG TCGACCGCAA GAAGGACATG
ATCCTGGTGT CGGGCTTCAA CGTCTATCCG AACGAGGTCG AGGAAGTCAT CGCCACCCAT
CCCGGCGTGC TGGAATGCGC CGTGGTCGGC GTCCATGATT CGCGCAGCAA CGAATCGGTG
AAAGCCTTCG TGGTGAGGAA AGATCCCGAG GTGACCGCCG AAGAGATCAT CAAGTTCTGC
CATACCCAGC TGACCAACTA CAAAGTGCCG AAACAGATCG AATTCCGCAC CGAACTGCCG
AAGACCAATG TCGGCAAGAT CCTGCGCCGG CAATTGCGCG ACGAGAAGAA ACAGGCCGCG
GCGTAA
 
Protein sequence
MERIWLKHYP PGVPADIDVN QYPSLVDLLE ESFAKFRDRK AFICMDKAIS YGQLDEMSTA 
LGAYLQSKGL KQGARVAVMM PNVLQYPIAT AAVLRAGFAV VNVNPLYTPR ELEHQLKDSG
AEAIIVLENF ASTVEHVIAR TGVKHVIVGA MGDLLGFKGV IVNFVVRRVK KMVPAFSLPS
KVAFNDALAA GRSLKFAKPT IGPDDVAFLQ YTGGTTGVSK GATLLHRNVV ANVLQNDAWL
QPALTKPPHV DQLFIVCALP LYHIFALTVC FLLAMRAGGV NLLIPNPRDM KSFIKELKKY
QVNSFPAVNT LYNGLLHAEG FDQVDFSKLK ISNGGGMAVQ RPVAEQWSKL TGCGIAEGYG
LSETAPVLTC NPATIDSFTG SIGLPLPSTY LSIRDDAGNE LSLGQIGEIC AKGPQVMAGY
WNMPEETALV MTEDGYFRTG DIGVMSEDGS TKIVDRKKDM ILVSGFNVYP NEVEEVIATH
PGVLECAVVG VHDSRSNESV KAFVVRKDPE VTAEEIIKFC HTQLTNYKVP KQIEFRTELP
KTNVGKILRR QLRDEKKQAA A