Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_3588 |
Symbol | |
ID | 3911390 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | + |
Start bp | 4112286 |
End bp | 4113425 |
Gene Length | 1140 bp |
Protein Length | 379 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 637885490 |
Product | thiolase |
Protein accession | YP_487194 |
Protein GI | 86750698 |
COG category | [I] Lipid transport and metabolism |
COG ID | [COG0183] Acetyl-CoA acetyltransferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.446493 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGTCGCA ATCAGGTCGC CGTCGTCGGC GCCGCCGAGA CCACCAAGCT CGGCGTCATT CCCGACATGT CGCAGATCCA GCTCCACGCC GACGCCGCGC TGAACGCGAT GGCCGATTGC GGGCTGAAGC CGTCCGACAT CGACGGCGTC GCCACCGCGG TCGAGAGCCC GCAGCAGATC GCGCATTATC TCGGCATCAC CCCGAGCTGG GTCGACGGCA CGTCGGTCGG CGGCTGCTCG TTCATGCTGC ACGTCCGTCA CGCGGCGGCA GCGATCGAGG CCGGGCTGTG CAAGACCGTG CTGATCACCC ACGCCGAGAG CGGCAAATCG ATGATCGGCA AGCTGCCGCG CTCGATCCCC GCCGACAGCC TGCAGGGCCA GTTCGAGGCG CCCTACGGCA TCTACGGGCC GCCGAGCCAG TTTCCGATCC CGGTGCTGCG CTTCATGAAG ACCTGGGGCA TCACCCACGA GCAGCTCGCG ATGGTCGCCG TGGTGCAGCG CGAATGGGCG GCGAAGAATC CGCGCGCGAC CATGAAGGAC CCGATCACCG TCGCCGACGT GCTGAACTCG CGGATGATCG CCTATCCGTT CCGGCTGCTG CAATGCTGCC TCGTCACCGA CGGCGGCGGC GCGCTGATCA TGACCTCGGC CGATCGCGCC AAGGACTTCC CGCACAAGCC GGTCTATGTG CTCGGCACCG GCGAGAGCGT GGAAACGCCG ATGGTCAGCC AGATGGAGAG CTTCAACTCC TCGCGCGCCT TCAAGGTGGC GGGGCCGACC GCGTTCCGCG AGGCCGGCAT CAGCCACAGC GACGTCGACC ACCTGATGAT CTACGACGCC TTCGCGCATC TGCCGCTGTT CGGCCTCGGC GACCTCGGCT TCATGCCGTA TGAGGAGACC GGCAAGTTCA TTGCCGACGG CAACACCCGC CCCGGCGGCA AGCTGCCGCT CAACACCAAT GGCGGCGGGC TGAGCTATAT GCACTCCGGC ATGTACGGCA TGTACGCGCT GCAGGAGAGC GTCCGGCAGA TGCGCGGCAT CGCGCCGGCG CAGGTGGAAG GCGCGAAGAT CTCGGTCTGC CACGGCGTCG GCGGCATGTT CGCGGCGTCG GGAACGATCA TCTTTACGAA CGAGAAGTAG
|
Protein sequence | MRRNQVAVVG AAETTKLGVI PDMSQIQLHA DAALNAMADC GLKPSDIDGV ATAVESPQQI AHYLGITPSW VDGTSVGGCS FMLHVRHAAA AIEAGLCKTV LITHAESGKS MIGKLPRSIP ADSLQGQFEA PYGIYGPPSQ FPIPVLRFMK TWGITHEQLA MVAVVQREWA AKNPRATMKD PITVADVLNS RMIAYPFRLL QCCLVTDGGG ALIMTSADRA KDFPHKPVYV LGTGESVETP MVSQMESFNS SRAFKVAGPT AFREAGISHS DVDHLMIYDA FAHLPLFGLG DLGFMPYEET GKFIADGNTR PGGKLPLNTN GGGLSYMHSG MYGMYALQES VRQMRGIAPA QVEGAKISVC HGVGGMFAAS GTIIFTNEK
|
| |