Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_1000 |
Symbol | |
ID | 3909297 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | - |
Start bp | 1144232 |
End bp | 1146100 |
Gene Length | 1869 bp |
Protein Length | 622 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 637882893 |
Product | long-chain-acyl-CoA synthetase |
Protein accession | YP_484621 |
Protein GI | 86748125 |
COG category | [I] Lipid transport and metabolism [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG0318] Acyl-CoA synthetases (AMP-forming)/AMP-acid ligases II |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.832439 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACATCC AACCACGATC CGTGACGGCG GACGCGACGC AGGCCCCGCC TGCGCGGGGG GCTGCGACCA CTGCCCCGTC CGTCACCAAA TCTCCTGTCA CCAAGCCTCC TTCCGTCACC AAGAGCTGGC TGCGGGCGAT CGAGATCACC TCGCGGATCG AGACCGAACC GCGCCGGCTG CTGGCCTCCG TGATCGACGA ATGGGCCGCC GCCGCGCCGC AGCGGGACGC GATCGTGTCG GATCGCGAAT CCTTCAGCTA CGCGGCGCTG GCCGATCGCA TCGACCGCTA CGCGCGCTGG GCGCTGACGA ACGGGATCGG GATCGGCGAC GTGGTGTGTG TGCTGATGCC GAACCGGCCG GACTATCTGG CGGCGTGGCT CGGCATCACC AAGGTCGGCG GCGTCGCGGC GCTGATCAAC ACCCAGCTCG TCGGCGCCTC GCTGGCGCAT TGCATCGAGG TCGCGCAGCC GAAACACGTC ATCGTCGCCG ACGAACTGGC GGAGGCCTTC GCGAGCGCCC GCCCACATCT CGCGCAGGCC CCACGCGTAT GGACGCATGG CGGCGCTGGC GCCGATTCGA TCGACCAGGC GCTAGCCGCG CTCGACGCCG GCCCGCTGGC GCCGCACGAA CGGCGCGAGG TCTCGATCGA GCATCTGGCG CTGCTGATCT ACACCTCCGG CACCACCGGG CTGCCGAAGG CCGCGCGAGT CACGCATCGC CGGGTGATGA GCTGGGCCGG CTGGTTCGCC GGCCTCACCG ACGCCGGGCC CGGCGACCGG ATGTACAATT GCCTGCCGAT CTATCACAGC GTCGGCGGCG TGGTGGCGCC CGGCAGCCTG TTGATGGCCG GCGGCTCGGT GGTGATCGCC GAAAAGTTTT CCGCGAGCCG GTTCTGGGAC GACATCGCCC GCTGGGATTG CACGCTGTTT CAATATATCG GCGAGCTCTG CCGCTATCTG CTGCAGGCGC CGCCGCGCGC GCGCGACACG CAGCACCGGC TGCGGCTGGC TTGCGGCAAC GGGCTGCGCG GCGACGTCTG GGAGGCGTTC CAGGCGCGCT TCGCGATTCC GCGCATCCTC GAATTCTACG CCTCGACCGA AGGCAATTTC TCGCTCTACA ATGTCGAGGG CAGGCCCGGC GCGATCGGCC GCGTGCCGTC GTTCCTGGCG CATCGCTTTC CGGCCGCGAT CGTGAAGTTC GACCTCGACA GCGGCCTTCC GCTGCGCGGC GACGACGGGC TGTGCGTCCG CTGCGCGCGC AACGAGCCCG GCGAGGCGAT CGGCCGGATC GGCGACGCCG CCGATCGCGG CGGCCGGTTC GAGGGCTACA CCAGCGATGC CGCGAGCGAC ACCAAGGTGC TGCGCGACGT GTTCGCCAGG GGCGACGCCT GGTATCGCAC CGGCGACCTG ATGCGGCTCG ACGATCAGGG CTTCTTCCAT TTCGTCGACC GCATCGGCGA CACCTTCCGC TGGAAGGGCG AGAACGTCGC GGCGAGCGAA GTCGCCGAGG CGATCGCCGC CTGCCCAGGC GTGACCGACG TCAGCGTCTA TGGCGTCAGC GTGCCGCAGC ACGACGGCCG CGCCGGCATG GCCGCGCTGG TGGTCGACGC GCGGTTCGAT ATCGACGCGC TGCATCGCCA TCTGGCCGAT CGGCTGCCGT CCTACGCGCG CCCGCTGTTC CTGCGGCTGC GCCCGGCGCT GGAAATCACC GGCACGTTCA AGCAGAACAA GCAGGATCTG ATCCGCGACG GATTCGATCC CGGCGTGGTG AGCGATCCGC TCTATGTCGG CGGGGCCCAG GCCGCGCGCT ACGTCGCGCT CGACGAGGAC CTGCACCGCC GCATCGCCGC AGGCGAGCTG CGGCTGTGA
|
Protein sequence | MNIQPRSVTA DATQAPPARG AATTAPSVTK SPVTKPPSVT KSWLRAIEIT SRIETEPRRL LASVIDEWAA AAPQRDAIVS DRESFSYAAL ADRIDRYARW ALTNGIGIGD VVCVLMPNRP DYLAAWLGIT KVGGVAALIN TQLVGASLAH CIEVAQPKHV IVADELAEAF ASARPHLAQA PRVWTHGGAG ADSIDQALAA LDAGPLAPHE RREVSIEHLA LLIYTSGTTG LPKAARVTHR RVMSWAGWFA GLTDAGPGDR MYNCLPIYHS VGGVVAPGSL LMAGGSVVIA EKFSASRFWD DIARWDCTLF QYIGELCRYL LQAPPRARDT QHRLRLACGN GLRGDVWEAF QARFAIPRIL EFYASTEGNF SLYNVEGRPG AIGRVPSFLA HRFPAAIVKF DLDSGLPLRG DDGLCVRCAR NEPGEAIGRI GDAADRGGRF EGYTSDAASD TKVLRDVFAR GDAWYRTGDL MRLDDQGFFH FVDRIGDTFR WKGENVAASE VAEAIAACPG VTDVSVYGVS VPQHDGRAGM AALVVDARFD IDALHRHLAD RLPSYARPLF LRLRPALEIT GTFKQNKQDL IRDGFDPGVV SDPLYVGGAQ AARYVALDED LHRRIAAGEL RL
|
| |