Gene RPD_0353 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_0353 
Symbol 
ID4020818 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp417000 
End bp418610 
Gene Length1611 bp 
Protein Length536 aa 
Translation table11 
GC content60% 
IMG OID637960537 
ProductAMP-dependent synthetase and ligase 
Protein accessionYP_567492 
Protein GI91974833 
COG category[I] Lipid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0318] Acyl-CoA synthetases (AMP-forming)/AMP-acid ligases II 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000011697 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGTCGGACG CAGAGGATAT CAAGCTCGCG ACCATCTCCG ATCAGGCGAC CAATCTCGCG 
CGGCGGGTGA ATGTCGGTGA CATCACAAGA CGAGGTGCGC GCCGTCATCG CGACAAGATC
GCCGTTATCA TGGGCGAGAC CCGTCTCACA TACGGCGAAC TGGACGCTCG GGCGAACCGC
ATCGCGCATG GATTGCTGGC GATGGGCTTG GGCAACGGGG CCCGCATCGG CGGCCTCGCC
CGAAACTCGA TCGACTTCCT GACATTGTAC TTTGCAGCAG CAAAGGCTGG CGCGATCTTC
TGCCCGTCCA ATCCAGCAAT TCCTGACGCG GATCTCGTTC ATATCCTTGG TCATGCCGAG
GTTTCGGCAA TCTTCATCGA TCCTGACCGG CACCAGCAAT TCACCGCTGT CGCATCCCAG
GTGCCTTCCA TCAGAAAGAT ATTCTCCGTC GGCGGCAACG GGCAGGCAGA TTCGCAGCTC
GACTCGCTGG CGGTGATCGC CGAAGGGCAA CCTGCAATCG ATCCGGAGAC CGCGACAGGC
GATCGTGATG TCGCCATGAT CATGTACACC AGCGGAACGA CCTCCGCCCC CAAGGGAGCG
ATGTTGTCGC ACATCAACGT GACAACTGGC GCCGTGCACA ATGCGTTCGC GGGCGAGGTC
GACGAGAACA CCATCGCAAC CGCCATACTA CCCTTGTTTC ATTGCGGTCA GCTATCAATC
AGCAGCGGAA CGTTGATGCG TGGAGGCACC GTCGTCGTTT TCGACGGGTT CGAACCCGCA
GCCCTCCTGG ACGCGATCGC GCGCGAACGC ATCACCTGGC TATTCGCTCT TCCCGCGATG
TATCGCGCCC TCCTGGCGCA TAAGGATCTC GACAACACGG ACGTGTCGAG TCTCGCATTC
TGTTTGTATG CGATGGCTCC GATGGATCCT TCGACGCTGC GCGAAGCGTC GCGCAGGCTC
AAGGCTCGCT TCGCACTCAC CAGCGGACAG ACGGAAGCCT ATCCGCCGAC GGTTGTATTC
GCTCCGGAAT TTCAGTTGAC CAAGCACGGC GCCTTTTGGG GGCGCGCCAT GCCCTTGGTC
GATTTGGCAA TCATGGACGA CGATGGGCGC CTGGTCGAGG ACGGATCCGT CGGCGAAATC
GTCTATCGAG GTCCGATGGT CATGGAAGGG TATCTGAAAG ATCCGGAAGC CACAGCTCGG
GCGTTCGAGG GCGGTTGGTT TCACTCCGGC GATCTCGGGC GCTTCGACGA AGACTCGCTG
CTCCTCTTCG TCGACCGCAA GAAGGATATC ATCAAGTCGG GCGGAGAAAA CGTCTCATCC
GTCAAGGTGG AGAGCTGTCT CCTTGCCCAC CCGGCGGTGC GGGCTGCGGC GATTGTCGGC
GTGCCTCACA GCCGTTGGAG CGAGGCCGTT GTCGCTGCTG TTTGCCTGCT TCCGGGTTCA
GTGGAAGACG AAGGGCAATT GATCGCCCAT TGTAAACAGA CACTCGCACC CTTCGAAGTC
CCGAAGAAGA TTGTCTTCTA CCGTGAGCTT CCCCAGACAG CGACCGGAAA GCTCCAGAAA
TATCAGATCC GGGGCGAGCT CGAAAACCTG TTTCGCGACC AGACGAACTG A
 
Protein sequence
MSDAEDIKLA TISDQATNLA RRVNVGDITR RGARRHRDKI AVIMGETRLT YGELDARANR 
IAHGLLAMGL GNGARIGGLA RNSIDFLTLY FAAAKAGAIF CPSNPAIPDA DLVHILGHAE
VSAIFIDPDR HQQFTAVASQ VPSIRKIFSV GGNGQADSQL DSLAVIAEGQ PAIDPETATG
DRDVAMIMYT SGTTSAPKGA MLSHINVTTG AVHNAFAGEV DENTIATAIL PLFHCGQLSI
SSGTLMRGGT VVVFDGFEPA ALLDAIARER ITWLFALPAM YRALLAHKDL DNTDVSSLAF
CLYAMAPMDP STLREASRRL KARFALTSGQ TEAYPPTVVF APEFQLTKHG AFWGRAMPLV
DLAIMDDDGR LVEDGSVGEI VYRGPMVMEG YLKDPEATAR AFEGGWFHSG DLGRFDEDSL
LLFVDRKKDI IKSGGENVSS VKVESCLLAH PAVRAAAIVG VPHSRWSEAV VAAVCLLPGS
VEDEGQLIAH CKQTLAPFEV PKKIVFYREL PQTATGKLQK YQIRGELENL FRDQTN