Gene RPB_1747 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_1747 
Symbol 
ID3909734 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp1995904 
End bp1997589 
Gene Length1686 bp 
Protein Length561 aa 
Translation table11 
GC content66% 
IMG OID637883641 
ProductAMP-dependent synthetase and ligase 
Protein accessionYP_485366 
Protein GI86748870 
COG category[I] Lipid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0318] Acyl-CoA synthetases (AMP-forming)/AMP-acid ligases II 
TIGRFAM ID[TIGR03205] dicarboxylate--CoA ligase PimA 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0225164 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCCACC CCGGTGAGCA GTACTATCCG CCCGGCGTTC GCTGGGATGC GGAGATCGCG 
AAGGGCACGT TGCCCGATCT CCTGGCGAAG GCGGCGACCG ACTACGCGGC GCGGCCGGCG
CTGGAATTCC GCGACGGCCA GATAAATTAC GCCGGGCTGC AGGAACGCGC CGACATCGCC
GCCGCAGCCC TGCTGCGCGC CGGCTACGGC CCCGGCGCTT CGGTCGCTCT GTTTCTCGGC
AACACGCCGG ATCACCCGAT CAACTTCTTC GGCGCGCTGA AGGCCGGCGC CCGCGTTGTG
CATCTGTCGC CGCTCGACGG CGAGCGGGCG CTGTCGCACA AGCTCAGCGA TTCCGGCGCG
CGCGTGCTGA TCACCACCGA TTCCGCAGCA TTGCTGCCGA TGGCGCTGAG GTTCCTCGAC
AAGGGTCTGC TCGATCGCCT GATCGTCTGC GCCGACTCAG ATTGGGGCGC ATCGGCCACG
CCGCTCGCCC CATTGCCGGA CGATCCGCGC GTGATCCGCT ACGCCGACTT CATCGAAGGC
CCTGCGAAGC CCGCCGCATG GCCGCAGATC TCGCCCGACG ACATCGCGCT CCTGCAATAC
ACCGGCGGCA CCACCGGCCT GCCCAAGGGC GCGATGCTGA CTCACGCCAA TCTCACCTCG
GCGGTGTCGA TCTACGACGT CTGGGGCCTG GTGCGCGCGG GCGAGGGCGG CGCGCATCGC
GTGATCTGCG TGCTGCCGCT GTTTCACATC TACGCGCTGA CCGTGATCCT GCTGCGCTGT
CTGAAGCAGG GCGACCTGAT CTCGCTGCAT CAGCGCTTCG ACGTCGCCGC GGTGTTCCGC
GACATCGAGG AGAAGCGCGC CACGGTGTTC CCCGGCGTGC CGACGATGTG GATCGCGCTC
GCCAACGATC CGTCGCTGGA GAGCCGCGAT CTGTCGTCGC TGACGATGGC CGGCTCCGGC
GGCGCGCCGC TGCCGGTCGA GGTCGCGCGA TTGTTCGAGC GCAAGACCAA TCTCAAACTC
AAGAGCGGCT GGGGCATGAC CGAGACCTGC TCGCCCGGCA CCGGCCATCC GCCGGACGGG
CCCGACAAGC CGGGCTCGAT CGGGCTGATG CTGCCGGGGA TCGAACTCGA CGTCGTCGCG
CTCGACGATC CGAAGAAGGT TCTGCCGCCC GGCGAAGTCG GCGAGATCCG GGTCCGCGGT
CCCAATGTCA CCCAGGGCTA CTGGAACCGG GCGCAGGAAA CCGCGGAGTC GTTCGTCGGT
GACCGTTTTC TCACCGGCGA TATCGGCTAT ATGGACTCCG ACGGCTATTT CTTCCTGGTC
GACCGCAAGA AGGACATGAT CATTTCGGGA GGATTCAACG TCTACCCGCA GATGATCGAA
CAAGCGATCT ATGAACATCC GGCGGTGCAG GAAGTGATCG TGATTGGCAT CCCCGACGAT
TATCGCGGCG AGGCGGCGAA GGCGTTCGTC AAGCTGCGCG ACGGCGCGAA GCCTTTCAGC
GTCGAGGAGC TGCGCGATTT CCTCAAGGGC AAACTCGGCA AGCACGAGCT GCCCGCCGCG
GTCGAGTTCG TCGACGAATT GCCGCGCACC CCGGTCGGCA AACTCTCGCG CCACGAACTG
CGCAATCAGC TACCCAAATC CACCAACCAG AGCCAACAGC AAACCGCACA GGGAGTCCGC
CCATGA
 
Protein sequence
MTHPGEQYYP PGVRWDAEIA KGTLPDLLAK AATDYAARPA LEFRDGQINY AGLQERADIA 
AAALLRAGYG PGASVALFLG NTPDHPINFF GALKAGARVV HLSPLDGERA LSHKLSDSGA
RVLITTDSAA LLPMALRFLD KGLLDRLIVC ADSDWGASAT PLAPLPDDPR VIRYADFIEG
PAKPAAWPQI SPDDIALLQY TGGTTGLPKG AMLTHANLTS AVSIYDVWGL VRAGEGGAHR
VICVLPLFHI YALTVILLRC LKQGDLISLH QRFDVAAVFR DIEEKRATVF PGVPTMWIAL
ANDPSLESRD LSSLTMAGSG GAPLPVEVAR LFERKTNLKL KSGWGMTETC SPGTGHPPDG
PDKPGSIGLM LPGIELDVVA LDDPKKVLPP GEVGEIRVRG PNVTQGYWNR AQETAESFVG
DRFLTGDIGY MDSDGYFFLV DRKKDMIISG GFNVYPQMIE QAIYEHPAVQ EVIVIGIPDD
YRGEAAKAFV KLRDGAKPFS VEELRDFLKG KLGKHELPAA VEFVDELPRT PVGKLSRHEL
RNQLPKSTNQ SQQQTAQGVR P