Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rpal_5072 |
Symbol | |
ID | 6412766 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris TIE-1 |
Kingdom | Bacteria |
Replicon accession | NC_011004 |
Strand | + |
Start bp | 5459304 |
End bp | 5461121 |
Gene Length | 1818 bp |
Protein Length | 605 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 642714957 |
Product | long-chain-acyl-CoA synthetase |
Protein accession | YP_001994036 |
Protein GI | 192293431 |
COG category | [I] Lipid transport and metabolism [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG0318] Acyl-CoA synthetases (AMP-forming)/AMP-acid ligases II |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.0947195 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACCGTGC AGGCAGCCGC TGCCATCCGC TCGAACGATG CGGCGCCCGA GCGCCCATCG GTCGCCAAGA GCTGGCTGAA CGCGATCGAG ATCACTGCGC GGATCGAACG CGAGCCGGAA CGTTTGCTGT GCGATACGGT CGCCGAATGG GCGATGCGCA CGCCGAATGC GCACGCATTA TTGTCGGAGC GCGAACGGTT CAGCTACGCC GAATTGGCGC GACGAATTGA CGGCTACGCA CGCTGGGCAC TCGCGCAAGG CATCGGCAAA GGCGTCTCCG TCGCGCTGCT GATGCCGAAC CGCGCCGAAT ACCTGGCGAT CTGGCTTGGT ATCACCAAGG TCGGCGGCGT GGTGGCGCTG CTCAACACAC AGCTCACCGG CGCCTCGCTG GCCCATTGCA TCGATGTCGC AGCGCCCAGC CATATCATCG TGGCGAAGGA ATTGAGCGGC GCCTACGACA GCTCAACACA GCATCTGGCA ACTGCGCCCC GGCTGTGGCT GCACGGCGAC GACGATACCG AGGTGGGGCT GTCGGACGCG CTGGCGATCG CAAACGACGA TCCACTTACG GCGGACGAAC GTCCCGCTGT GACTGTCGAT GATACGGCAC TGCTGATCTA CACCTCCGGC ACCACGGGGC TGCCGAAGGC AGCGCGGGTC AGTCACCGCC GGGTGATGAG CTGGGCCGGC TGGTTCGCCG GCCTCACCGG CGCGACATCC GACGATCGCA TCTACGATTG TTTGCCGATC TACCACAGCG TCGGCGGCGT GGTCGCGACC GGCAGCCTGC TGATGGCGGG AGGCTCGGTC GTGATCGCCG AGAAGTTCTC CGCGCGGCGG TTCTGGGACG ATATCGTCCG CTACGACTGT ACGCTGTTTC AATATATCGG CGAGCTGTGC CGCTATCTGG TGCAGGCGCC GATCGCGCCG AACGAAACGC GGCATCGTCT GCGGCTCGCC TGCGGCAACG GACTGCGCGG CGACGTCTGG GAGGCGTTTC AGGCGCGGTT CGCGATCCCG CGCATTCTCG AATTCTACGC CTCCACCGAG GGCAACTTCT CGCTCTACAA TGTCGAGGGC GAGCCCGGTG CGATCGGGCG GCTGCCGTCG TTTCTGGCGC ATCGCTTCCC GGCCGCACTG GTGAAATTCG ATTTCGAAAC GGGACTACCG GTGCGCGACG AACAGGGCCG CTGCATCCGC TGCGCCCGCG GCGAGGCCGG CGAAGCGATC GGCCGGATCG GCGAGGCCGA GCGCGGCGGC GGCCGGTTCG AAGGCTACAC CAGCGATGGC GAGAGCGAGC GCAAGATCTT GCGCGACGTG TTTGCACCGG GTGACGCCTG GTTCCGCACC GGCGACCTGA TGCTGCAGGA CGCCAAGGGC TTCTTCCGCT TCGTCGACCG GATCGGCGAT ACCTTCCGCT GGAAGGGCGA GAACGTCGCA GCGAGCGAAG TGGCCGACAT ACTCGCTGTC TGCCCCGGCG TAATCGACGC CAGCGTCTAT GGCGTCAGCG TTCCGAACCA CGACGGCCGC GCCGGCATGG CTGCGCTGGT GACCGAGGAG AGCTTCGATC TCGCGGCGCT CCATCGCCAT CTCGCCGAGC GGCTGCCGGC TTACTCACGG CCGCTCTTTC TTCGGCTCCG GCCGACGCTC GATCTCACCG GCACGTTCAA GCAGGCCAAG CAGACGCTGA TCACCGAGGG TTTTGACCCG TCGGTCGTCG GCGATCCGCT TTATGTCGCC GACATCACTA CGGGCGGCTA CGTCACGCTC GACGCCCCCC TCTTCAGCCG CATCGCGCGC GGCGCATTTA GACTGTGA
|
Protein sequence | MTVQAAAAIR SNDAAPERPS VAKSWLNAIE ITARIEREPE RLLCDTVAEW AMRTPNAHAL LSERERFSYA ELARRIDGYA RWALAQGIGK GVSVALLMPN RAEYLAIWLG ITKVGGVVAL LNTQLTGASL AHCIDVAAPS HIIVAKELSG AYDSSTQHLA TAPRLWLHGD DDTEVGLSDA LAIANDDPLT ADERPAVTVD DTALLIYTSG TTGLPKAARV SHRRVMSWAG WFAGLTGATS DDRIYDCLPI YHSVGGVVAT GSLLMAGGSV VIAEKFSARR FWDDIVRYDC TLFQYIGELC RYLVQAPIAP NETRHRLRLA CGNGLRGDVW EAFQARFAIP RILEFYASTE GNFSLYNVEG EPGAIGRLPS FLAHRFPAAL VKFDFETGLP VRDEQGRCIR CARGEAGEAI GRIGEAERGG GRFEGYTSDG ESERKILRDV FAPGDAWFRT GDLMLQDAKG FFRFVDRIGD TFRWKGENVA ASEVADILAV CPGVIDASVY GVSVPNHDGR AGMAALVTEE SFDLAALHRH LAERLPAYSR PLFLRLRPTL DLTGTFKQAK QTLITEGFDP SVVGDPLYVA DITTGGYVTL DAPLFSRIAR GAFRL
|
| |