Gene RPB_3588 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_3588 
Symbol 
ID3911390 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp4112286 
End bp4113425 
Gene Length1140 bp 
Protein Length379 aa 
Translation table11 
GC content67% 
IMG OID637885490 
Productthiolase 
Protein accessionYP_487194 
Protein GI86750698 
COG category[I] Lipid transport and metabolism 
COG ID[COG0183] Acetyl-CoA acetyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.446493 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGTCGCA ATCAGGTCGC CGTCGTCGGC GCCGCCGAGA CCACCAAGCT CGGCGTCATT 
CCCGACATGT CGCAGATCCA GCTCCACGCC GACGCCGCGC TGAACGCGAT GGCCGATTGC
GGGCTGAAGC CGTCCGACAT CGACGGCGTC GCCACCGCGG TCGAGAGCCC GCAGCAGATC
GCGCATTATC TCGGCATCAC CCCGAGCTGG GTCGACGGCA CGTCGGTCGG CGGCTGCTCG
TTCATGCTGC ACGTCCGTCA CGCGGCGGCA GCGATCGAGG CCGGGCTGTG CAAGACCGTG
CTGATCACCC ACGCCGAGAG CGGCAAATCG ATGATCGGCA AGCTGCCGCG CTCGATCCCC
GCCGACAGCC TGCAGGGCCA GTTCGAGGCG CCCTACGGCA TCTACGGGCC GCCGAGCCAG
TTTCCGATCC CGGTGCTGCG CTTCATGAAG ACCTGGGGCA TCACCCACGA GCAGCTCGCG
ATGGTCGCCG TGGTGCAGCG CGAATGGGCG GCGAAGAATC CGCGCGCGAC CATGAAGGAC
CCGATCACCG TCGCCGACGT GCTGAACTCG CGGATGATCG CCTATCCGTT CCGGCTGCTG
CAATGCTGCC TCGTCACCGA CGGCGGCGGC GCGCTGATCA TGACCTCGGC CGATCGCGCC
AAGGACTTCC CGCACAAGCC GGTCTATGTG CTCGGCACCG GCGAGAGCGT GGAAACGCCG
ATGGTCAGCC AGATGGAGAG CTTCAACTCC TCGCGCGCCT TCAAGGTGGC GGGGCCGACC
GCGTTCCGCG AGGCCGGCAT CAGCCACAGC GACGTCGACC ACCTGATGAT CTACGACGCC
TTCGCGCATC TGCCGCTGTT CGGCCTCGGC GACCTCGGCT TCATGCCGTA TGAGGAGACC
GGCAAGTTCA TTGCCGACGG CAACACCCGC CCCGGCGGCA AGCTGCCGCT CAACACCAAT
GGCGGCGGGC TGAGCTATAT GCACTCCGGC ATGTACGGCA TGTACGCGCT GCAGGAGAGC
GTCCGGCAGA TGCGCGGCAT CGCGCCGGCG CAGGTGGAAG GCGCGAAGAT CTCGGTCTGC
CACGGCGTCG GCGGCATGTT CGCGGCGTCG GGAACGATCA TCTTTACGAA CGAGAAGTAG
 
Protein sequence
MRRNQVAVVG AAETTKLGVI PDMSQIQLHA DAALNAMADC GLKPSDIDGV ATAVESPQQI 
AHYLGITPSW VDGTSVGGCS FMLHVRHAAA AIEAGLCKTV LITHAESGKS MIGKLPRSIP
ADSLQGQFEA PYGIYGPPSQ FPIPVLRFMK TWGITHEQLA MVAVVQREWA AKNPRATMKD
PITVADVLNS RMIAYPFRLL QCCLVTDGGG ALIMTSADRA KDFPHKPVYV LGTGESVETP
MVSQMESFNS SRAFKVAGPT AFREAGISHS DVDHLMIYDA FAHLPLFGLG DLGFMPYEET
GKFIADGNTR PGGKLPLNTN GGGLSYMHSG MYGMYALQES VRQMRGIAPA QVEGAKISVC
HGVGGMFAAS GTIIFTNEK