Gene RPB_2105 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_2105 
Symbol 
ID3908519 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp2394181 
End bp2395395 
Gene Length1215 bp 
Protein Length404 aa 
Translation table11 
GC content68% 
IMG OID637883998 
Productthiolase 
Protein accessionYP_485722 
Protein GI86749226 
COG category[I] Lipid transport and metabolism 
COG ID[COG0183] Acetyl-CoA acetyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0715913 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0186682 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATTCCG GGTTCGCGCC AAGGGGCGCG CCCCGGAATG ACGAAAGTGA GGATGATGCC 
AAGCGTCGGA CGTTCATCAT GAGCTACATC ACCGGCGTCG GCCTCACGCC GTTCGGCAAG
ATCGATGGTT CGACGACGCT GTCGCTGATG CGCGACGCTG CCGAGCAGGC GCTCGCCGAT
GCGGAACTGA CCCGCTCCGA CATCGACGGG CTGCTGTGCG GCTATTCGAC GACGATGCCG
CACATCATGC TGGCCACCGT GTTCGCCGAG CATTTCGGCA TTCGTCCCGC CTACTGCCAC
GCCGTGCAGG TCGGCGGCGC CACCGGGATG GCGATGGCGA TGCTGGCGCA TCAGCTGGTC
GAGAGCGGTG CGGCGAAGAA CATTCTGGTG GTCGGCGGCG AAAATCGCCT GAGCGGGCAG
AGCCGCGACG CCTCCGTGCA GGCGCTGGCG CAGGTCGGCC ACCCGGTCTA CGAAGTGCCG
CTCGGCCCGA CCATCCCCGC GTATTATGGC CTGGTCGCGT CGCGCTACAT GCACCAGCAC
GGCGTCAGCG AGGAAGACCT CGCCGCATTC GCCGTGCTGA TGCGCGCGCA CGCGGCCCTC
CATCCCGGTG CGCAGTTTCA CGAGCCGATC AGCATCGCCG AGGTGATGGC GTCGAAGCCG
ATCGCCACGC CGCTGAAACT GCTCGATTGC TGTCCTGTGT CCGATGGCGG CGCGGCGCTG
ATCGTCAGCC GCGAGCCGAC CACGACTCAT CGTATCAAGG TCCGCGGTTG CGGCCAGGCG
CATACCCATC AGCACGTCAC CGCGATGCCG GCGGACGGCC CGTCCGGCGC GGAGCAGTCG
ATCGCGCGTG CCAAAGCTGC GAGCGGCGTG GCACTCGGCG ATATTCGCTA CGCCGCGGTG
TATGACAGCT TCACCATCAC GCTGCTGATG CTGCTGGAAG ATCTCGGCCT CGCGCCCCGC
GGCGAGGCGG CGGCGCGCGC CCGCGATGGT TACTTCTCGC GCGACGGCGC GATGCCGCTC
AACACCCATG GCGGGCTGTT GTCCTACGGC CATTGCGGCG TCGGTGGCGC GATGGCGCAT
CTGGTCGAGA CCCATCTGCA GATGACCGGT CGGGCCGATG ATCGCCAGGT CCGCGACGCC
TCGCTGGCGT TGCTGCATGG CGACGGCGGC GTGCTGTCGT CGCATGTCAG CATGTTCCTG
GAGCGGGTGC GATGA
 
Protein sequence
MDSGFAPRGA PRNDESEDDA KRRTFIMSYI TGVGLTPFGK IDGSTTLSLM RDAAEQALAD 
AELTRSDIDG LLCGYSTTMP HIMLATVFAE HFGIRPAYCH AVQVGGATGM AMAMLAHQLV
ESGAAKNILV VGGENRLSGQ SRDASVQALA QVGHPVYEVP LGPTIPAYYG LVASRYMHQH
GVSEEDLAAF AVLMRAHAAL HPGAQFHEPI SIAEVMASKP IATPLKLLDC CPVSDGGAAL
IVSREPTTTH RIKVRGCGQA HTHQHVTAMP ADGPSGAEQS IARAKAASGV ALGDIRYAAV
YDSFTITLLM LLEDLGLAPR GEAAARARDG YFSRDGAMPL NTHGGLLSYG HCGVGGAMAH
LVETHLQMTG RADDRQVRDA SLALLHGDGG VLSSHVSMFL ERVR