Gene RPD_0040 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_0040 
Symbol 
ID4020494 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp50548 
End bp51813 
Gene Length1266 bp 
Protein Length421 aa 
Translation table11 
GC content64% 
IMG OID637960216 
Productthiolase 
Protein accessionYP_567181 
Protein GI91974522 
COG category[I] Lipid transport and metabolism 
COG ID[COG0183] Acetyl-CoA acetyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGCCTG AAGTCTATGT GATCGGAACT GCCTGTACGC CGTTCGGCAA AAGGCCGCAG 
ACCAGCTTCA AGGCATTGAC CCGCGAGGCG TATCTTGCGG CACTGGCGGA CGCGGGGATG
GCGGATGGAC GCGACATTGC GATGGCGTGG TTTGGAAACT GCGGCATGGG GACATTCGGA
CAGCGCAATA TTCGCGGACA GGTCTGCCTC TCTCCGCTGG TGCGGGAGGG GCTGTTTCCC
GAGCGCATCC CGACGATGAA TGTCGAAGGC GGCTGCGCGA CGGCCTCTCA GGCCCTGCAT
GGCGCATGGA AGGACATCGC GTCGGGCGAC GCGCAGCTCT CGCTCGCCAT CGGCGTCGAG
AAGACCTTCG TTCCGGACGA CCCGGCACGA ACGCAGGAGA TCTTCGACGG CGGGATCGAT
CAGCTCGATC CCGGCGAATG GCTCGCCTAC TACCGCGACG CCGGCGAGGT CAGTGGCAAG
CCGTTCCAGC CGGATGACAA GCGCGGCACC ATCTTCATGG ATACTTACGC CATGCAGGCG
GCGTATCACA TGAAACGCTA CGGCACGACG CAGCGCCAGA TCGCGATCGG CGCGGCCAAG
AACCATCATC ACGGAAGCCT GAATCCGCTG GCGCAATATC GGTTCACAAT GACGGCCGAT
GAGGTCCTGG CCGATCGCCC GATCAGCTAT CCGCTGACCC GCAGCATGTG TGCGCCGATC
GGCGACGGCG CCGCCGCCGC CCTGGTCTGC TCGAAGGACT ATCTTGCTTC ATTGCCGCGT
GGGGTGCGGG AGCGGGCGGT GAAGATCAGG GCGAGCGCGA TGTCGGGCGG CAAGTATCGG
TCGCTCGACG AGCCTGGGCT TTCGCGCATT GCCGCCGACA GGGCCTACAA AATGGCAGGG
ATTTCGCCGT CGGACATCGA TATCGCCGAG GTTCATGACG CCACCTCGTT CTGCGAGATC
TATCAGGTCG AGATGCTGCG CTTCTGCGCA GAAGGACAAG GCGGCGCCTA TGTCGCCTCA
GGCGCGACCG CGCTCGGCGG CGATCGTCCG GTGAATCTGT CCGGGGGACT GGTCTCCAAG
GGACATCCGG TCGGAGCCAC AGGTCTTTCG ATGATCCATG AGCTGGTGCT GCAGTTGCGC
GGCGAGGCCG GCGAACGGCA GGCCAAGAAT GCGCGGCTGG CGCTGGCGGA AAATGGCGGC
GGCGTCGTCG GCTTCGATGA AGCCGCCTGC GCGATCACGA TCCTGGAGAG GCTCGAGCCC
AACTGA
 
Protein sequence
MMPEVYVIGT ACTPFGKRPQ TSFKALTREA YLAALADAGM ADGRDIAMAW FGNCGMGTFG 
QRNIRGQVCL SPLVREGLFP ERIPTMNVEG GCATASQALH GAWKDIASGD AQLSLAIGVE
KTFVPDDPAR TQEIFDGGID QLDPGEWLAY YRDAGEVSGK PFQPDDKRGT IFMDTYAMQA
AYHMKRYGTT QRQIAIGAAK NHHHGSLNPL AQYRFTMTAD EVLADRPISY PLTRSMCAPI
GDGAAAALVC SKDYLASLPR GVRERAVKIR ASAMSGGKYR SLDEPGLSRI AADRAYKMAG
ISPSDIDIAE VHDATSFCEI YQVEMLRFCA EGQGGAYVAS GATALGGDRP VNLSGGLVSK
GHPVGATGLS MIHELVLQLR GEAGERQAKN ARLALAENGG GVVGFDEAAC AITILERLEP
N