Gene RPC_1074 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPC_1074 
Symbol 
ID3969586 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB18 
KingdomBacteria 
Replicon accessionNC_007925 
Strand
Start bp1175842 
End bp1176996 
Gene Length1155 bp 
Protein Length384 aa 
Translation table11 
GC content69% 
IMG OID637924185 
Productthiolase 
Protein accessionYP_530957 
Protein GI90422587 
COG category[I] Lipid transport and metabolism 
COG ID[COG0183] Acetyl-CoA acetyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGACGC GGACGCGTGG CCATGCGGCC GTCGTCGGCG TGGCCGAATC CGATCTCGGC 
ATCGTCGGAC CGCACATGAC GCCGGTCGAT CTGATGGCGC AGGCGACGAT GCGCGCGCTC
GACGACTGCG GCCTCAAGCT CAGCGACGTC GACGCGGTGT TCGCCGCCAG CGCGCAGGTG
CGGTTCGGGC CGATGATGCT CTGCGAGTAT CTCGGGCTCA ATCCGCGCAC CATCGACGGC
ACCCAGATCG GCGGCTCCTC GTTCATGTCG CATCTGGCGC ATGCGGTGGC GGTGATCGAG
CTCGGGCTGT GCGAAGTGGC GCTGATCGCC TATGGCAGCA CGCAGCGCTC GGTGTCGCGC
GCCGCCGCGT CGCCGACCGA TATCAATCCT TACGAGGCGC CGTTTCGGCC GGTGATGCCG
GCGAGCGCCT ACGCGCTGGC CGCCGCCCGA CACATGTATC AGTTCGGCAC CACGCGCGAG
CAGCTCGCCG ACGTCGCGGT CGCGGCGCGG CAATGGGCGC TGCTCAATCC GAAGGCCTGG
GAGAAAGAAC CGCTCAGCCG CGAGCAGGTG CTGGGCGCGC GGATGCTGTC GGACCCGCTC
ACCGTGCGCG ACTGTTGCCT GGTGCTCGAC GGCGGCGGCG CCATGATCGT GACTTCGGCG
GCGACCGCGC GCGATTGCCG CAAACTGCCG ATCCATGTGC TCGGCACCGG CGAAGCCATC
GGCCACGGCT CGATCTCCGG CATGGCCGAC CTCACCACCA CGGCCGCCGC GGTCTCCGGG
CCGCGCGCCT TTGCGGCGGC GGGCTTAGCG CCGGCCGACA TCGACGTCGC GCTGCTATAC
GACGCCTTCA CCATCACGCC GATCCTGTTT CTCGAGGATC TCGGCTTCTG CGCCAAGGGC
GAGGGCGGTT CTTTCGTGCA GGACGGTGGC ATCGCGCCGG GCGGTCGCCT CGCCGTCAAC
ACCAATGGCG GCGGGCTGTC CTACTGCCAT CCCGGCATGT ACGGCCTGTT GGCGATGATC
GAATGCGTCC GGCAGCTGCG CGGCGAATGC GGCGCCAGGC AGCTCGCCAA GCACAACGTC
GCACTGGCGC ACGGCAATGG CGGGGTGCTG TCCAGCCAAT GCACCGCGAT CTTCGGCACA
CCGGCGGCGC TGTAG
 
Protein sequence
MTTRTRGHAA VVGVAESDLG IVGPHMTPVD LMAQATMRAL DDCGLKLSDV DAVFAASAQV 
RFGPMMLCEY LGLNPRTIDG TQIGGSSFMS HLAHAVAVIE LGLCEVALIA YGSTQRSVSR
AAASPTDINP YEAPFRPVMP ASAYALAAAR HMYQFGTTRE QLADVAVAAR QWALLNPKAW
EKEPLSREQV LGARMLSDPL TVRDCCLVLD GGGAMIVTSA ATARDCRKLP IHVLGTGEAI
GHGSISGMAD LTTTAAAVSG PRAFAAAGLA PADIDVALLY DAFTITPILF LEDLGFCAKG
EGGSFVQDGG IAPGGRLAVN TNGGGLSYCH PGMYGLLAMI ECVRQLRGEC GARQLAKHNV
ALAHGNGGVL SSQCTAIFGT PAAL