Gene RoseRS_4348 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRoseRS_4348 
Symbol 
ID5211332 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus sp. RS-1 
KingdomBacteria 
Replicon accessionNC_009523 
Strand
Start bp5464182 
End bp5465372 
Gene Length1191 bp 
Protein Length396 aa 
Translation table11 
GC content61% 
IMG OID640597931 
Productacetyl-CoA acetyltransferase 
Protein accessionYP_001278635 
Protein GI148658430 
COG category[I] Lipid transport and metabolism 
COG ID[COG0183] Acetyl-CoA acetyltransferase 
TIGRFAM ID[TIGR01930] acetyl-CoA acetyltransferases 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.430233 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0132867 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGGCGA ATGGACGCGA CGTAGTGGTG CTGAGTGGTG TCCGCACCGC GATCGGCAAT 
TTTGGCGGCA GTCTCAAGGA TCAACCGCCG AGCGAACTGG CGGCGCAGGT CGTGCGTGAA
GCGGTCAGGC GCGCGGGTGT CGAGCCGACG GAAATCGGGC AGGTTGTGTT TGGTAATATC
ATCCACACCG ACGGGCACGA CCACTATCTG GCGCGGGTTG CAGGGGTCAA GGGCGGCTTG
CCGGTGGACG TTCCGGCGTT GACGTTGAAT CGCCTGTGCG GCAGTGGCTT GCAGGCGATC
ATCTCGGCAG CGCAGACAAT CATGCTCGGC GATGCCGATG CCGCCGTCGC TGGCGGCGCC
GAGTCGATGA GTCGCAGCCC ATACTGGGCG CATGCGATGC GCTGGGGCGC GCGGATGAAT
GATGTTGCGA TGGTCGATGC AATGGTAGCG GCGCTCAGCG ATCCGTTCGA TGATGTGCAC
ATGGGCGTAA CAGCTGAGAA TGTCGCCCGG AAGTGGGAGA TTACTCGCGA GGATCAGGAT
GCGCTGGCTG TTGAAAGTCA TAAACGCGCT GCCGCTGCCA TTGCGGAAGG GCGTTTCAAG
GATCAAATTC TGCCCGTTGA GATCAAGGTC AAGGGCGGGG TTCAGATGTT TGATACCGAT
GAAAGCGTGC GCCCTGACAC AAGTCTTGAG AAGCTTGCCA AACTGCGTCC GGTCTTCGAC
AAGCAGGGAA CCGTGACCGC CGGTAATGCA TCGAGCATCA ATGATGCTGC GGCTGCTGTG
GTGTTGATGG AACGCAGTGT TGCCGAACAG CGCGGCTACA AACCGATGGG TCGTCTGGTG
GGGTACAGCG TTGTCGGCGT CGACCCGAAG TATATGGGCA TCGGTCCGGT TCCGGCAGTG
CGCAAGGTGT TGGAGCGCAC CGGACTGAGC ATCGATGACA TCGATCTGTT TGAACTGAAC
GAGGCGTTCG CGGCGCAGGC GCTCGCCGTC ATCCGCGAGC TTGATCTACC AATGGAGAAG
GTCAATCCGA ACGGCAGCGG CATTTCGCTC GGTCACCCGA TTGGCGCAAC CGGCGCGATA
CTGACGGTGA AGGCGCTCTA CGAGCTGCAA CGCACCGGTG GTCGCTACGC CTGCGTCACC
ATGTGCATCG GCGGCGGTCA GGGCATCGCT GCGATCTTCG AGCGGATATA G
 
Protein sequence
MTANGRDVVV LSGVRTAIGN FGGSLKDQPP SELAAQVVRE AVRRAGVEPT EIGQVVFGNI 
IHTDGHDHYL ARVAGVKGGL PVDVPALTLN RLCGSGLQAI ISAAQTIMLG DADAAVAGGA
ESMSRSPYWA HAMRWGARMN DVAMVDAMVA ALSDPFDDVH MGVTAENVAR KWEITREDQD
ALAVESHKRA AAAIAEGRFK DQILPVEIKV KGGVQMFDTD ESVRPDTSLE KLAKLRPVFD
KQGTVTAGNA SSINDAAAAV VLMERSVAEQ RGYKPMGRLV GYSVVGVDPK YMGIGPVPAV
RKVLERTGLS IDDIDLFELN EAFAAQALAV IRELDLPMEK VNPNGSGISL GHPIGATGAI
LTVKALYELQ RTGGRYACVT MCIGGGQGIA AIFERI