Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_1747 |
Symbol | |
ID | 3909734 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | + |
Start bp | 1995904 |
End bp | 1997589 |
Gene Length | 1686 bp |
Protein Length | 561 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 637883641 |
Product | AMP-dependent synthetase and ligase |
Protein accession | YP_485366 |
Protein GI | 86748870 |
COG category | [I] Lipid transport and metabolism [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG0318] Acyl-CoA synthetases (AMP-forming)/AMP-acid ligases II |
TIGRFAM ID | [TIGR03205] dicarboxylate--CoA ligase PimA |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.0225164 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCCACC CCGGTGAGCA GTACTATCCG CCCGGCGTTC GCTGGGATGC GGAGATCGCG AAGGGCACGT TGCCCGATCT CCTGGCGAAG GCGGCGACCG ACTACGCGGC GCGGCCGGCG CTGGAATTCC GCGACGGCCA GATAAATTAC GCCGGGCTGC AGGAACGCGC CGACATCGCC GCCGCAGCCC TGCTGCGCGC CGGCTACGGC CCCGGCGCTT CGGTCGCTCT GTTTCTCGGC AACACGCCGG ATCACCCGAT CAACTTCTTC GGCGCGCTGA AGGCCGGCGC CCGCGTTGTG CATCTGTCGC CGCTCGACGG CGAGCGGGCG CTGTCGCACA AGCTCAGCGA TTCCGGCGCG CGCGTGCTGA TCACCACCGA TTCCGCAGCA TTGCTGCCGA TGGCGCTGAG GTTCCTCGAC AAGGGTCTGC TCGATCGCCT GATCGTCTGC GCCGACTCAG ATTGGGGCGC ATCGGCCACG CCGCTCGCCC CATTGCCGGA CGATCCGCGC GTGATCCGCT ACGCCGACTT CATCGAAGGC CCTGCGAAGC CCGCCGCATG GCCGCAGATC TCGCCCGACG ACATCGCGCT CCTGCAATAC ACCGGCGGCA CCACCGGCCT GCCCAAGGGC GCGATGCTGA CTCACGCCAA TCTCACCTCG GCGGTGTCGA TCTACGACGT CTGGGGCCTG GTGCGCGCGG GCGAGGGCGG CGCGCATCGC GTGATCTGCG TGCTGCCGCT GTTTCACATC TACGCGCTGA CCGTGATCCT GCTGCGCTGT CTGAAGCAGG GCGACCTGAT CTCGCTGCAT CAGCGCTTCG ACGTCGCCGC GGTGTTCCGC GACATCGAGG AGAAGCGCGC CACGGTGTTC CCCGGCGTGC CGACGATGTG GATCGCGCTC GCCAACGATC CGTCGCTGGA GAGCCGCGAT CTGTCGTCGC TGACGATGGC CGGCTCCGGC GGCGCGCCGC TGCCGGTCGA GGTCGCGCGA TTGTTCGAGC GCAAGACCAA TCTCAAACTC AAGAGCGGCT GGGGCATGAC CGAGACCTGC TCGCCCGGCA CCGGCCATCC GCCGGACGGG CCCGACAAGC CGGGCTCGAT CGGGCTGATG CTGCCGGGGA TCGAACTCGA CGTCGTCGCG CTCGACGATC CGAAGAAGGT TCTGCCGCCC GGCGAAGTCG GCGAGATCCG GGTCCGCGGT CCCAATGTCA CCCAGGGCTA CTGGAACCGG GCGCAGGAAA CCGCGGAGTC GTTCGTCGGT GACCGTTTTC TCACCGGCGA TATCGGCTAT ATGGACTCCG ACGGCTATTT CTTCCTGGTC GACCGCAAGA AGGACATGAT CATTTCGGGA GGATTCAACG TCTACCCGCA GATGATCGAA CAAGCGATCT ATGAACATCC GGCGGTGCAG GAAGTGATCG TGATTGGCAT CCCCGACGAT TATCGCGGCG AGGCGGCGAA GGCGTTCGTC AAGCTGCGCG ACGGCGCGAA GCCTTTCAGC GTCGAGGAGC TGCGCGATTT CCTCAAGGGC AAACTCGGCA AGCACGAGCT GCCCGCCGCG GTCGAGTTCG TCGACGAATT GCCGCGCACC CCGGTCGGCA AACTCTCGCG CCACGAACTG CGCAATCAGC TACCCAAATC CACCAACCAG AGCCAACAGC AAACCGCACA GGGAGTCCGC CCATGA
|
Protein sequence | MTHPGEQYYP PGVRWDAEIA KGTLPDLLAK AATDYAARPA LEFRDGQINY AGLQERADIA AAALLRAGYG PGASVALFLG NTPDHPINFF GALKAGARVV HLSPLDGERA LSHKLSDSGA RVLITTDSAA LLPMALRFLD KGLLDRLIVC ADSDWGASAT PLAPLPDDPR VIRYADFIEG PAKPAAWPQI SPDDIALLQY TGGTTGLPKG AMLTHANLTS AVSIYDVWGL VRAGEGGAHR VICVLPLFHI YALTVILLRC LKQGDLISLH QRFDVAAVFR DIEEKRATVF PGVPTMWIAL ANDPSLESRD LSSLTMAGSG GAPLPVEVAR LFERKTNLKL KSGWGMTETC SPGTGHPPDG PDKPGSIGLM LPGIELDVVA LDDPKKVLPP GEVGEIRVRG PNVTQGYWNR AQETAESFVG DRFLTGDIGY MDSDGYFFLV DRKKDMIISG GFNVYPQMIE QAIYEHPAVQ EVIVIGIPDD YRGEAAKAFV KLRDGAKPFS VEELRDFLKG KLGKHELPAA VEFVDELPRT PVGKLSRHEL RNQLPKSTNQ SQQQTAQGVR P
|
| |