Gene Rpal_5072 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRpal_5072 
Symbol 
ID6412766 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris TIE-1 
KingdomBacteria 
Replicon accessionNC_011004 
Strand
Start bp5459304 
End bp5461121 
Gene Length1818 bp 
Protein Length605 aa 
Translation table11 
GC content66% 
IMG OID642714957 
Productlong-chain-acyl-CoA synthetase 
Protein accessionYP_001994036 
Protein GI192293431 
COG category[I] Lipid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0318] Acyl-CoA synthetases (AMP-forming)/AMP-acid ligases II 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.0947195 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCGTGC AGGCAGCCGC TGCCATCCGC TCGAACGATG CGGCGCCCGA GCGCCCATCG 
GTCGCCAAGA GCTGGCTGAA CGCGATCGAG ATCACTGCGC GGATCGAACG CGAGCCGGAA
CGTTTGCTGT GCGATACGGT CGCCGAATGG GCGATGCGCA CGCCGAATGC GCACGCATTA
TTGTCGGAGC GCGAACGGTT CAGCTACGCC GAATTGGCGC GACGAATTGA CGGCTACGCA
CGCTGGGCAC TCGCGCAAGG CATCGGCAAA GGCGTCTCCG TCGCGCTGCT GATGCCGAAC
CGCGCCGAAT ACCTGGCGAT CTGGCTTGGT ATCACCAAGG TCGGCGGCGT GGTGGCGCTG
CTCAACACAC AGCTCACCGG CGCCTCGCTG GCCCATTGCA TCGATGTCGC AGCGCCCAGC
CATATCATCG TGGCGAAGGA ATTGAGCGGC GCCTACGACA GCTCAACACA GCATCTGGCA
ACTGCGCCCC GGCTGTGGCT GCACGGCGAC GACGATACCG AGGTGGGGCT GTCGGACGCG
CTGGCGATCG CAAACGACGA TCCACTTACG GCGGACGAAC GTCCCGCTGT GACTGTCGAT
GATACGGCAC TGCTGATCTA CACCTCCGGC ACCACGGGGC TGCCGAAGGC AGCGCGGGTC
AGTCACCGCC GGGTGATGAG CTGGGCCGGC TGGTTCGCCG GCCTCACCGG CGCGACATCC
GACGATCGCA TCTACGATTG TTTGCCGATC TACCACAGCG TCGGCGGCGT GGTCGCGACC
GGCAGCCTGC TGATGGCGGG AGGCTCGGTC GTGATCGCCG AGAAGTTCTC CGCGCGGCGG
TTCTGGGACG ATATCGTCCG CTACGACTGT ACGCTGTTTC AATATATCGG CGAGCTGTGC
CGCTATCTGG TGCAGGCGCC GATCGCGCCG AACGAAACGC GGCATCGTCT GCGGCTCGCC
TGCGGCAACG GACTGCGCGG CGACGTCTGG GAGGCGTTTC AGGCGCGGTT CGCGATCCCG
CGCATTCTCG AATTCTACGC CTCCACCGAG GGCAACTTCT CGCTCTACAA TGTCGAGGGC
GAGCCCGGTG CGATCGGGCG GCTGCCGTCG TTTCTGGCGC ATCGCTTCCC GGCCGCACTG
GTGAAATTCG ATTTCGAAAC GGGACTACCG GTGCGCGACG AACAGGGCCG CTGCATCCGC
TGCGCCCGCG GCGAGGCCGG CGAAGCGATC GGCCGGATCG GCGAGGCCGA GCGCGGCGGC
GGCCGGTTCG AAGGCTACAC CAGCGATGGC GAGAGCGAGC GCAAGATCTT GCGCGACGTG
TTTGCACCGG GTGACGCCTG GTTCCGCACC GGCGACCTGA TGCTGCAGGA CGCCAAGGGC
TTCTTCCGCT TCGTCGACCG GATCGGCGAT ACCTTCCGCT GGAAGGGCGA GAACGTCGCA
GCGAGCGAAG TGGCCGACAT ACTCGCTGTC TGCCCCGGCG TAATCGACGC CAGCGTCTAT
GGCGTCAGCG TTCCGAACCA CGACGGCCGC GCCGGCATGG CTGCGCTGGT GACCGAGGAG
AGCTTCGATC TCGCGGCGCT CCATCGCCAT CTCGCCGAGC GGCTGCCGGC TTACTCACGG
CCGCTCTTTC TTCGGCTCCG GCCGACGCTC GATCTCACCG GCACGTTCAA GCAGGCCAAG
CAGACGCTGA TCACCGAGGG TTTTGACCCG TCGGTCGTCG GCGATCCGCT TTATGTCGCC
GACATCACTA CGGGCGGCTA CGTCACGCTC GACGCCCCCC TCTTCAGCCG CATCGCGCGC
GGCGCATTTA GACTGTGA
 
Protein sequence
MTVQAAAAIR SNDAAPERPS VAKSWLNAIE ITARIEREPE RLLCDTVAEW AMRTPNAHAL 
LSERERFSYA ELARRIDGYA RWALAQGIGK GVSVALLMPN RAEYLAIWLG ITKVGGVVAL
LNTQLTGASL AHCIDVAAPS HIIVAKELSG AYDSSTQHLA TAPRLWLHGD DDTEVGLSDA
LAIANDDPLT ADERPAVTVD DTALLIYTSG TTGLPKAARV SHRRVMSWAG WFAGLTGATS
DDRIYDCLPI YHSVGGVVAT GSLLMAGGSV VIAEKFSARR FWDDIVRYDC TLFQYIGELC
RYLVQAPIAP NETRHRLRLA CGNGLRGDVW EAFQARFAIP RILEFYASTE GNFSLYNVEG
EPGAIGRLPS FLAHRFPAAL VKFDFETGLP VRDEQGRCIR CARGEAGEAI GRIGEAERGG
GRFEGYTSDG ESERKILRDV FAPGDAWFRT GDLMLQDAKG FFRFVDRIGD TFRWKGENVA
ASEVADILAV CPGVIDASVY GVSVPNHDGR AGMAALVTEE SFDLAALHRH LAERLPAYSR
PLFLRLRPTL DLTGTFKQAK QTLITEGFDP SVVGDPLYVA DITTGGYVTL DAPLFSRIAR
GAFRL