Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_2101 |
Symbol | |
ID | 3908515 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | + |
Start bp | 2388707 |
End bp | 2389876 |
Gene Length | 1170 bp |
Protein Length | 389 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 637883994 |
Product | acetyl-CoA acetyltransferase |
Protein accession | YP_485718 |
Protein GI | 86749222 |
COG category | [I] Lipid transport and metabolism |
COG ID | [COG0183] Acetyl-CoA acetyltransferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.0474037 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 4 |
Fosmid unclonability p-value | 0.00217907 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGACCGCCA GCATCGTTGG ATGGGCGCAT ATGCCCTTCG GCAAGTTCGA CGCCGAGACC GTCGAGAGCA TGATCGTCCG CGTCGCCAAT GAGGCGATCG CCGATGCCGG CATCGCCGCC GGCGATGTCG ATGAAATCGT GCTGGGGCAT TTCAACGCCG GCTTCTCGCC GCAGGATTTC ACCGCCGCTC TGGTGCTGCA GGCCGATCCG GCGCTGCGCT TCAAGCCCGC CACCCGGGTC GAGAACGCCT GCGCCACCGG CTCCGCCGCC GTGCATCAGG GCATCCGCGC GATCGAGGCC GGCGCCGCCA AGGTGGTGCT GGTGGTCGGC GTCGAGCAGA TGACGCGGAC GCCGGGCCCG GAAATCGGCA AGAACCTGCT GCGCGCCTCC TATCTGCCGG AAGACGGCGA GACGCCCGCG GGCTTCGCCG GGGTGTTCGG CATCATCGCG CAACGATACT TCCAGAAATA CGGCGACCAG TCCGACGCCC TCGCCATGAT CGCCGCCAAG AACCACCACA ACGGAGTGGC CAATCCCTAT GCGCAGATGC GCAAGGATTT CGGTTTCGAG TTCTGCCGCG CCGAGGGCGA GAAGAACCCG TTCGTCGCCG GCCCCCTGAA GCGCACCGAC TGCTCGCTCG TCTCCGACGG CGCCGCCGCG CTGGTGCTGA CTTCGGCGGA GAACGCCAAG ACCATGGGCA AGGCCGTCAA CATCCGCGCC CGCGCCCATG CGCAGGATTT CCTGCCGATG TCCAAGCGCG ACATCCTGCA GTTCGAAGGC TGCACCGTCG CCTGGCAGCG CGCGCTGGCC GACGCCGGAA TCACGCTCGA TGATCTCTCC TTCGTCGAGA CCCACGATTG CTTCACCATC GCCGAACTGA TCGAGTACGA AGCGATGGGG CTGACCACGA AGGGGCAGGG CGCCCGCGCC ATCAAGGAAG GCTGGACCCA GAAGGACGGC AAGCTGCCGA TCAATCCGTC CGGCGGCCTC AAGGCCAAGG GCCATCCGAT CGGCGCCACC GGCGTGTCGA TGCACGTGCT GACCGCGATG CAGCTGCTCG GCCAGGCGCC GGAAGGCATG CAGATCAAGG ACGCCAAGCT CGGCGGCATC TTCAACATGG GCGGCGCCGC CGTCGCCAAC TACGTCTCGG TGCTCGAGCC GGCGAAGTAA
|
Protein sequence | MTASIVGWAH MPFGKFDAET VESMIVRVAN EAIADAGIAA GDVDEIVLGH FNAGFSPQDF TAALVLQADP ALRFKPATRV ENACATGSAA VHQGIRAIEA GAAKVVLVVG VEQMTRTPGP EIGKNLLRAS YLPEDGETPA GFAGVFGIIA QRYFQKYGDQ SDALAMIAAK NHHNGVANPY AQMRKDFGFE FCRAEGEKNP FVAGPLKRTD CSLVSDGAAA LVLTSAENAK TMGKAVNIRA RAHAQDFLPM SKRDILQFEG CTVAWQRALA DAGITLDDLS FVETHDCFTI AELIEYEAMG LTTKGQGARA IKEGWTQKDG KLPINPSGGL KAKGHPIGAT GVSMHVLTAM QLLGQAPEGM QIKDAKLGGI FNMGGAAVAN YVSVLEPAK
|
| |