Gene Rxyl_2454 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRxyl_2454 
Symbol 
ID4115523 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRubrobacter xylanophilus DSM 9941 
KingdomBacteria 
Replicon accessionNC_008148 
Strand
Start bp2468473 
End bp2469783 
Gene Length1311 bp 
Protein Length436 aa 
Translation table11 
GC content73% 
IMG OID638037234 
Product3-ketoacyl-CoA thiolase 
Protein accessionYP_645193 
Protein GI108805256 
COG category[I] Lipid transport and metabolism 
COG ID[COG0183] Acetyl-CoA acetyltransferase 
TIGRFAM ID[TIGR01930] acetyl-CoA acetyltransferases 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.363311 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGGACG CGCGAAGAGA GGAGGGGCGG CGCGTCGTCC TGGTGGACGG CGTGAGGACG 
CCCTTTATGC GGGCGGGGAC CGCCTACCTG AGCCAGACGT CCTACGACCT GGCGCGCACC
GTGCTGCGGG GCCTCCTGGA GCGCGCGGGC GTCGCGCCGG AGAGCGTGGG CTACGTCGTG
ATGGGCACCG TGATCCAGAA CATCAGCACC TCCAACGTCG CCCGCGACGC CGCGCTCGCC
GCGGGGCTGC CGAGCTCGGT GCCCGCCCAC ACGGTGACCA TGGCCTGCAT CTCCGCCAAC
CAGGCCATAA CCAGCGCCCT GGAGACCATC CGGGCCGGCA AGGCGGAGGC CGCCGTCGCC
GGCGGGGTCG AGGTGATGTC CGACACCCCG CCCCAGTTCG GCAAGAAGGC GCGCGAGAGG
CTCTTCCTGG CGCAGGGCTA CAGGTCCCCC CTGGAGTTCC GGAGGCTGCT TTCGGGGATG
AGCCCGAGCG ACTTCCTCCC CCGCGCTCCC GCCATCTCCG AGTACTCCAC CGGGGAGCTG
ATGGGGGAGA GCGCCGACCG GCTCGCCGCG GCCTTCGGGG TGAGCCGCGA GGAGCAGGAC
GAGTACGCCC TGCGCACCCA CACCCTGGCC GCCAAGGCCA CCGAGAGCGG CAGGCTCGCC
GCCGAGATAC TGCCGACCGC ACCCCCGCCG GACTTCGAGC TCCTGACCGA GGACAACACC
ATCCGCAGGG ACACCAGCCT GGAGAAGCTC GCGGCGCTCC CCCCGGCGTT CGTGAAGCCG
CTCGGCACGG TCACCGCGGG CAACTCCTCG CCGCTCACCG ACGGGGCGGC GGCCACCCTC
CTCATGGAGG AGGATGCTGC CCGCGCCGCC GGCCACGAGC CGAAGGCGCG CCTCGCGGAC
TACCTCTACG TCGCCCAGGA CCCGGGCGAG GAGCTCCTGC TCGGCCCGGC CTACGCCGTC
CCCAGGCTGC TCGAGCGCAA CGGCCTCTCC CTCTCCGACA TAGACGTCCT GGAGATCCAC
GAGGCCTTCG CCGGGCAGGT GCTCGCGGTG CTGCGCGCGC TCGAGTCGGA CCGCTTCGCC
CGCGAGAGGC TCGGCCTCGG GCGGCGCGTG GGCGAGGTGG AGATGGAGCG GGTGAACGCC
TGGGGCGGGT CGGTCTCGCT CGGGCACCCC TTCGGGGCCA CCGGGGCGCG GCTCGTGACC
ACCGCGGCCA ACCGGCTGCG GGAGGAGGAC GGGCGCTTCG CCGTCGTGAC CGCCTGCGCC
GCCGGGGGGC TGGGACACGC CATGCTCGTC GAACGCATCG GGGAGGGCTA G
 
Protein sequence
MTDARREEGR RVVLVDGVRT PFMRAGTAYL SQTSYDLART VLRGLLERAG VAPESVGYVV 
MGTVIQNIST SNVARDAALA AGLPSSVPAH TVTMACISAN QAITSALETI RAGKAEAAVA
GGVEVMSDTP PQFGKKARER LFLAQGYRSP LEFRRLLSGM SPSDFLPRAP AISEYSTGEL
MGESADRLAA AFGVSREEQD EYALRTHTLA AKATESGRLA AEILPTAPPP DFELLTEDNT
IRRDTSLEKL AALPPAFVKP LGTVTAGNSS PLTDGAAATL LMEEDAARAA GHEPKARLAD
YLYVAQDPGE ELLLGPAYAV PRLLERNGLS LSDIDVLEIH EAFAGQVLAV LRALESDRFA
RERLGLGRRV GEVEMERVNA WGGSVSLGHP FGATGARLVT TAANRLREED GRFAVVTACA
AGGLGHAMLV ERIGEG