Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPC_0859 |
Symbol | |
ID | 3969817 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris BisB18 |
Kingdom | Bacteria |
Replicon accession | NC_007925 |
Strand | + |
Start bp | 947285 |
End bp | 948454 |
Gene Length | 1170 bp |
Protein Length | 389 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 637923975 |
Product | thiolase |
Protein accession | YP_530748 |
Protein GI | 90422378 |
COG category | [I] Lipid transport and metabolism |
COG ID | [COG0183] Acetyl-CoA acetyltransferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.823302 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCAGCA AACGCGACAT CGTGATCGCC GGCTATTCGG AGACGGCGAT TGAGTATAAA TCCGGCCGCA GCGCCTACGA CCTCGGCGGC GAGGCGCTGG CGAAGCTGCT GGCGGCGACC GGCATCGAAA AGGACGCGAT CGACGGCCTC GCCGTCACCG CGCCGCTCAG CGAATGTCCA AATCCGTTCT TCGCGGTCTA TATGACCGAG GCGCTGGGGC TGACGCCGAC CTGGCTGAAC TATGGCGGCA CCGGCGGTTG CTCGGCCACC GGCGGCGTGG CGCGTGCGGC TTCTGCCATT CGCGATGGCT TGTGCGAGGT GGTGGTGGTG CTGTCGGCGG ATGCGCCGAG CACGTCGTGG CGCGCCAATT ACGGCGCCTA TCGCGGCGAA TTCCAGGATC CGCCGGGCGT GCAGGGACCG CCGGCGACCT TCGGCCTGCT GATGAGCCGC TACAAGCATC AATACGGCCT GAATTCCGAC GCGCTCGGCA AGATCGCCAT CACCCAGCGC GACCACGCGC TGCACAACGA CAACGCCTAT GCCAAGTTCA AGACCCCGAT CACGATGGAC GACTACAACA AGTCGCGCGT CATCGCCGAT CCGTTGCGGC TGCTGGACTG CGTGATGTTC TGCGACGGCG CCAACGCCTT CCTGGTCACC AGCGAGGCCA AGGCCAAGAG CCTCGGCATC AGCAAGATGG TGTATCCGAC CGCCTATGCC GAGATCACCA ACGTCAACGG CAACCAGTCC TGCCCGGACA TCACCGAGAC CGGGTTCTCC AAGATCGCGC CGAAGCTGTA CAAGCAGTCC GGGCTCAGCG CGAAGGACAT CAAGATGTTC CAGCCCTATG ACGACTTCAC CATCGCGGTG ATGATGAAGT TCGAGGACTT TGGCTTCTGC AAGCGCGGCC AGGGCAGCGA CTTCACGCTC GACACCGATC TGTCGTTCAA GGGCACGCTG CCGCTCAACA CCGGCGGCGG GCAGATTTCC GCCGGCCAGC CCGGGCTCGC CAGCGGCGGG CTCAACCTCG CCGAAGCGGT GCGGCAGATG TTCGGCGAAG GCGGCGGCCG CCAGGTCGCC AATCCCAGCA ACGCGCTGAT CACCGGCATC GGCGTCATCC CTTACGCGCG GAACTGGGGA ACCAGTGCCG CCATGATCCT GGAGGTCTGA
|
Protein sequence | MSSKRDIVIA GYSETAIEYK SGRSAYDLGG EALAKLLAAT GIEKDAIDGL AVTAPLSECP NPFFAVYMTE ALGLTPTWLN YGGTGGCSAT GGVARAASAI RDGLCEVVVV LSADAPSTSW RANYGAYRGE FQDPPGVQGP PATFGLLMSR YKHQYGLNSD ALGKIAITQR DHALHNDNAY AKFKTPITMD DYNKSRVIAD PLRLLDCVMF CDGANAFLVT SEAKAKSLGI SKMVYPTAYA EITNVNGNQS CPDITETGFS KIAPKLYKQS GLSAKDIKMF QPYDDFTIAV MMKFEDFGFC KRGQGSDFTL DTDLSFKGTL PLNTGGGQIS AGQPGLASGG LNLAEAVRQM FGEGGGRQVA NPSNALITGI GVIPYARNWG TSAAMILEV
|
| |