Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_2105 |
Symbol | |
ID | 3908519 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | + |
Start bp | 2394181 |
End bp | 2395395 |
Gene Length | 1215 bp |
Protein Length | 404 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 637883998 |
Product | thiolase |
Protein accession | YP_485722 |
Protein GI | 86749226 |
COG category | [I] Lipid transport and metabolism |
COG ID | [COG0183] Acetyl-CoA acetyltransferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.0715913 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.0186682 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGATTCCG GGTTCGCGCC AAGGGGCGCG CCCCGGAATG ACGAAAGTGA GGATGATGCC AAGCGTCGGA CGTTCATCAT GAGCTACATC ACCGGCGTCG GCCTCACGCC GTTCGGCAAG ATCGATGGTT CGACGACGCT GTCGCTGATG CGCGACGCTG CCGAGCAGGC GCTCGCCGAT GCGGAACTGA CCCGCTCCGA CATCGACGGG CTGCTGTGCG GCTATTCGAC GACGATGCCG CACATCATGC TGGCCACCGT GTTCGCCGAG CATTTCGGCA TTCGTCCCGC CTACTGCCAC GCCGTGCAGG TCGGCGGCGC CACCGGGATG GCGATGGCGA TGCTGGCGCA TCAGCTGGTC GAGAGCGGTG CGGCGAAGAA CATTCTGGTG GTCGGCGGCG AAAATCGCCT GAGCGGGCAG AGCCGCGACG CCTCCGTGCA GGCGCTGGCG CAGGTCGGCC ACCCGGTCTA CGAAGTGCCG CTCGGCCCGA CCATCCCCGC GTATTATGGC CTGGTCGCGT CGCGCTACAT GCACCAGCAC GGCGTCAGCG AGGAAGACCT CGCCGCATTC GCCGTGCTGA TGCGCGCGCA CGCGGCCCTC CATCCCGGTG CGCAGTTTCA CGAGCCGATC AGCATCGCCG AGGTGATGGC GTCGAAGCCG ATCGCCACGC CGCTGAAACT GCTCGATTGC TGTCCTGTGT CCGATGGCGG CGCGGCGCTG ATCGTCAGCC GCGAGCCGAC CACGACTCAT CGTATCAAGG TCCGCGGTTG CGGCCAGGCG CATACCCATC AGCACGTCAC CGCGATGCCG GCGGACGGCC CGTCCGGCGC GGAGCAGTCG ATCGCGCGTG CCAAAGCTGC GAGCGGCGTG GCACTCGGCG ATATTCGCTA CGCCGCGGTG TATGACAGCT TCACCATCAC GCTGCTGATG CTGCTGGAAG ATCTCGGCCT CGCGCCCCGC GGCGAGGCGG CGGCGCGCGC CCGCGATGGT TACTTCTCGC GCGACGGCGC GATGCCGCTC AACACCCATG GCGGGCTGTT GTCCTACGGC CATTGCGGCG TCGGTGGCGC GATGGCGCAT CTGGTCGAGA CCCATCTGCA GATGACCGGT CGGGCCGATG ATCGCCAGGT CCGCGACGCC TCGCTGGCGT TGCTGCATGG CGACGGCGGC GTGCTGTCGT CGCATGTCAG CATGTTCCTG GAGCGGGTGC GATGA
|
Protein sequence | MDSGFAPRGA PRNDESEDDA KRRTFIMSYI TGVGLTPFGK IDGSTTLSLM RDAAEQALAD AELTRSDIDG LLCGYSTTMP HIMLATVFAE HFGIRPAYCH AVQVGGATGM AMAMLAHQLV ESGAAKNILV VGGENRLSGQ SRDASVQALA QVGHPVYEVP LGPTIPAYYG LVASRYMHQH GVSEEDLAAF AVLMRAHAAL HPGAQFHEPI SIAEVMASKP IATPLKLLDC CPVSDGGAAL IVSREPTTTH RIKVRGCGQA HTHQHVTAMP ADGPSGAEQS IARAKAASGV ALGDIRYAAV YDSFTITLLM LLEDLGLAPR GEAAARARDG YFSRDGAMPL NTHGGLLSYG HCGVGGAMAH LVETHLQMTG RADDRQVRDA SLALLHGDGG VLSSHVSMFL ERVR
|
| |