Gene Rpal_4237 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRpal_4237 
Symbol 
ID6411921 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris TIE-1 
KingdomBacteria 
Replicon accessionNC_011004 
Strand
Start bp4549203 
End bp4550861 
Gene Length1659 bp 
Protein Length552 aa 
Translation table11 
GC content66% 
IMG OID642714119 
Productdicarboxylate/CoA ligase PimA 
Protein accessionYP_001993208 
Protein GI192292603 
COG category[I] Lipid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0318] Acyl-CoA synthetases (AMP-forming)/AMP-acid ligases II 
TIGRFAM ID[TIGR03205] dicarboxylate--CoA ligase PimA 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCCCATC CCGGTGAGCA GTACTATCCA CCCGGCGTTC GCTGGGATGC GGTGATCGCG 
AGGGGGACAT TGCCGGAGCT GCTGGCGCAG GCCGCGAAGG AGTACAGCTC GCGGCCGGCG
CTGGAGTTTC GTGATCGGCA GATCACCTAC ACCAAGCTCG AAGCGATGGC CGAGACCGCG
GCGGCCGCGC TGCTCGCAGC AGGCTATGGC CGCGACTCCT CGGTCGCGCT GTATCTCGGC
AACACCCCAG ACCATCCGAT CAATTTCTTC GGCGCGCTGA AGGCCGGCGC CCGCGTGGTG
CATCTGTCGC CGCTCGACGG CGAGCGGGCG CTATCGCACA AGCTCAGCGA CTCCGGCGCG
CGGGTGCTGA TCACAACCGA TTCCGCCGCG CTGCTGCCGA TGGCCGAGAG GTTTCTTGCC
AAGGGCCTGC TCGACCGCCT GATCGTGTGC AGCGATGCCA GCTGGGGCGA GTCCGCGACG
CCGCTGGCGC CGCTGCCCAG CGATCCGCGG GTGATTACTT ACGCCGACTT CATCAAGGAC
GCGTCCAAAC CCGCGGCCTG GCCGGCGATT TCGCCGGACG ATATCGCGCT GTTGCAATAC
ACCGGCGGCA CCACCGGGCT GCCGAAGGGC GCGATGCTGA CCCATTCCAA CCTGACCTCG
GCGGTGTCGA TCTACGACGT CTGGGGTCTG GTGCGCGCCA ATAGCGGCGG CATTCATCGC
GTGATCTGCG TGCTGCCGCT GTTTCACATC TACGCGCTCA CCGTGATCCT GCTGCGCTGC
CTGAAGCAGG GCGACCTGAT CTCGCTGCAT CAGCGCTTCG ACGTCGCCGC GGTGTTCCGT
GACATCGAGG AAAAGCGCGC CACCGTGTTC CCCGGCGTGC CGACGATGTG GATCGCGCTC
GCCAACGATC CGTCACTGGA GAAGCGCGAT CTGTCGTCGC TGACGATGGC CGGCTCCGGC
GGCGCGCCGC TGCCGGTCGA GGTTGCGCGG CTGTTCGAGC GCAAAACCAA TCTGAAGCTG
AAGAGCGGTT GGGGCATGAC CGAGACCTGC TCGCCCGGCA CCGGCCATCC GCCGGAAGGG
CCGGACAAGC CGGGCTCGAT CGGCCTGATG CTGCCGGGCA TCGAACTGGA CGTCGTCGCG
CTCGATGATC CGAAGCGCGT GCTGCCGCCG GGGGAAGTCG GCGAACTCCG CGTTCGCGGT
CCCAATGTCA CCAAGGGCTA CTGGAATCGG CCGGAGGAGT CCGCTCACAG TTTTGTCGGC
GATCGTTTTC TGACCGGCGA CATCGGCTAT ATGGACCAGG ATGGCTACTT CTTCCTGGTC
GACCGCAAGA AGGACATGAT CATCTCCGGC GGCTTCAACG TCTATCCGCA GATGATCGAA
CAGGCGATCT ATGAGCACCC GGCGGTGCAA GAAGTGATCG TGATCGGCGT GCCGGACGAT
TACCGCGGCG AGGCGGCGAA GGCGTTCGTC AAGCTGCGCG ATGGCGCCAA GCCGTTCAGC
ATCGACGAGC TGCGCGCCTT CCTCACCGGC AAGCTCGGCA AGCACGAACT GCCGACCGCG
GTGGAGTTTC TCGACGAACT GCCGCGCACC ACGGTGGGCA AACTGTCCCG CCACGAACTG
CGCAGCCAGC AGACCACCAC CAAGACACAG ACCAAGTAA
 
Protein sequence
MSHPGEQYYP PGVRWDAVIA RGTLPELLAQ AAKEYSSRPA LEFRDRQITY TKLEAMAETA 
AAALLAAGYG RDSSVALYLG NTPDHPINFF GALKAGARVV HLSPLDGERA LSHKLSDSGA
RVLITTDSAA LLPMAERFLA KGLLDRLIVC SDASWGESAT PLAPLPSDPR VITYADFIKD
ASKPAAWPAI SPDDIALLQY TGGTTGLPKG AMLTHSNLTS AVSIYDVWGL VRANSGGIHR
VICVLPLFHI YALTVILLRC LKQGDLISLH QRFDVAAVFR DIEEKRATVF PGVPTMWIAL
ANDPSLEKRD LSSLTMAGSG GAPLPVEVAR LFERKTNLKL KSGWGMTETC SPGTGHPPEG
PDKPGSIGLM LPGIELDVVA LDDPKRVLPP GEVGELRVRG PNVTKGYWNR PEESAHSFVG
DRFLTGDIGY MDQDGYFFLV DRKKDMIISG GFNVYPQMIE QAIYEHPAVQ EVIVIGVPDD
YRGEAAKAFV KLRDGAKPFS IDELRAFLTG KLGKHELPTA VEFLDELPRT TVGKLSRHEL
RSQQTTTKTQ TK