Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RoseRS_4348 |
Symbol | |
ID | 5211332 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus sp. RS-1 |
Kingdom | Bacteria |
Replicon accession | NC_009523 |
Strand | - |
Start bp | 5464182 |
End bp | 5465372 |
Gene Length | 1191 bp |
Protein Length | 396 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 640597931 |
Product | acetyl-CoA acetyltransferase |
Protein accession | YP_001278635 |
Protein GI | 148658430 |
COG category | [I] Lipid transport and metabolism |
COG ID | [COG0183] Acetyl-CoA acetyltransferase |
TIGRFAM ID | [TIGR01930] acetyl-CoA acetyltransferases |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.430233 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 4 |
Fosmid unclonability p-value | 0.0132867 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGGCGA ATGGACGCGA CGTAGTGGTG CTGAGTGGTG TCCGCACCGC GATCGGCAAT TTTGGCGGCA GTCTCAAGGA TCAACCGCCG AGCGAACTGG CGGCGCAGGT CGTGCGTGAA GCGGTCAGGC GCGCGGGTGT CGAGCCGACG GAAATCGGGC AGGTTGTGTT TGGTAATATC ATCCACACCG ACGGGCACGA CCACTATCTG GCGCGGGTTG CAGGGGTCAA GGGCGGCTTG CCGGTGGACG TTCCGGCGTT GACGTTGAAT CGCCTGTGCG GCAGTGGCTT GCAGGCGATC ATCTCGGCAG CGCAGACAAT CATGCTCGGC GATGCCGATG CCGCCGTCGC TGGCGGCGCC GAGTCGATGA GTCGCAGCCC ATACTGGGCG CATGCGATGC GCTGGGGCGC GCGGATGAAT GATGTTGCGA TGGTCGATGC AATGGTAGCG GCGCTCAGCG ATCCGTTCGA TGATGTGCAC ATGGGCGTAA CAGCTGAGAA TGTCGCCCGG AAGTGGGAGA TTACTCGCGA GGATCAGGAT GCGCTGGCTG TTGAAAGTCA TAAACGCGCT GCCGCTGCCA TTGCGGAAGG GCGTTTCAAG GATCAAATTC TGCCCGTTGA GATCAAGGTC AAGGGCGGGG TTCAGATGTT TGATACCGAT GAAAGCGTGC GCCCTGACAC AAGTCTTGAG AAGCTTGCCA AACTGCGTCC GGTCTTCGAC AAGCAGGGAA CCGTGACCGC CGGTAATGCA TCGAGCATCA ATGATGCTGC GGCTGCTGTG GTGTTGATGG AACGCAGTGT TGCCGAACAG CGCGGCTACA AACCGATGGG TCGTCTGGTG GGGTACAGCG TTGTCGGCGT CGACCCGAAG TATATGGGCA TCGGTCCGGT TCCGGCAGTG CGCAAGGTGT TGGAGCGCAC CGGACTGAGC ATCGATGACA TCGATCTGTT TGAACTGAAC GAGGCGTTCG CGGCGCAGGC GCTCGCCGTC ATCCGCGAGC TTGATCTACC AATGGAGAAG GTCAATCCGA ACGGCAGCGG CATTTCGCTC GGTCACCCGA TTGGCGCAAC CGGCGCGATA CTGACGGTGA AGGCGCTCTA CGAGCTGCAA CGCACCGGTG GTCGCTACGC CTGCGTCACC ATGTGCATCG GCGGCGGTCA GGGCATCGCT GCGATCTTCG AGCGGATATA G
|
Protein sequence | MTANGRDVVV LSGVRTAIGN FGGSLKDQPP SELAAQVVRE AVRRAGVEPT EIGQVVFGNI IHTDGHDHYL ARVAGVKGGL PVDVPALTLN RLCGSGLQAI ISAAQTIMLG DADAAVAGGA ESMSRSPYWA HAMRWGARMN DVAMVDAMVA ALSDPFDDVH MGVTAENVAR KWEITREDQD ALAVESHKRA AAAIAEGRFK DQILPVEIKV KGGVQMFDTD ESVRPDTSLE KLAKLRPVFD KQGTVTAGNA SSINDAAAAV VLMERSVAEQ RGYKPMGRLV GYSVVGVDPK YMGIGPVPAV RKVLERTGLS IDDIDLFELN EAFAAQALAV IRELDLPMEK VNPNGSGISL GHPIGATGAI LTVKALYELQ RTGGRYACVT MCIGGGQGIA AIFERI
|
| |