Gene Strop_1359 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagStrop_1359 
Symbol 
ID5057812 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora tropica CNB-440 
KingdomBacteria 
Replicon accessionNC_009380 
Strand
Start bp1528107 
End bp1529312 
Gene Length1206 bp 
Protein Length401 aa 
Translation table11 
GC content74% 
IMG OID640473628 
Productbeta-ketoadipyl CoA thiolase 
Protein accessionYP_001158204 
Protein GI145593907 
COG category[I] Lipid transport and metabolism 
COG ID[COG0183] Acetyl-CoA acetyltransferase 
TIGRFAM ID[TIGR01930] acetyl-CoA acetyltransferases
[TIGR02430] beta-ketoadipyl CoA thiolase 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.755858 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGTGG CATATCTGGT GGCTGGTGTC CGTACTCCGA TCGGGAGGTA TGCCGGGGCG 
CTCGCCGGCG TCCGCCCCGA TGACCTGGCC GCGCATGTGA TCCGGGAGCT GCTCGCCCGG
CACCCGACGG TGGACTGGGC CCGTACCGAC GACGTGATCC TCGGCTGCGC GAACCAGGCC
GGTGAGGACA ACCGCAACGT GGCCCGGATG GCGGCGCTGC TCGGCGGGCT GCCCGAGCAG
GTGCCCGGGA GCACGGTCAA CCGGCTCTGC GGCTCCGGCC TGGACGCCCT CGCCATCGCC
GCCCGCTCCA TCGTCGCCGG TGAGGCCGAC CTGGTGGTGG CCGGCGGGGT GGAGAGCATG
AGCCGGGCGC CGTTCGTGTT ACCCAAGGCT GAGACCGCGT TCTCCCGCAA CGCGGAGGTC
TACGACACCA CCATCGGCTG GCGGCTGGTC AACCCGGTGC TGGAGCAGGG GTGGGGCATC
GACTCGATGC CGGAGACCGC GGAGAACGTC GCTGCCGAGT ACGGCGTCGC GCGTGCCACG
CAGGACGAGT TCGCGTACCG CTCCCAGCAG CGCGTGGCGC AGGCGCAGGC CGACGGCCGG
TTCGCCGAGG AGATCGTGCC GGTGCCCGCT CCCGCCGGCC GGCGGGGGAC GACGGTGGTC
GAGGTCGACG AGCATCCGCG GGAGACGTCG CTGGCGAAGC TGGCCGCGCT GCCCACCCCG
TTCCGGGTGG GGGGCACGAT CACCGCCGGC AACTCTTCCG GCGTCAACGA CGGCGCGGTG
GCGCTGCTGG TGGCGTCCGA GGCAGCACTC ACGCGGTACG ACCTGACCCC GTTGGCCCGG
GTCGTCGGCT CCGCCGCGGC CGGTGTGTCG CCACGGGTGA TGGGCGTCGG CCCGGTGCCG
GCCACCCGCC GGCTCCTCGA CCGGCACGGT CTGGGGGTGG GCGATCTGGA CGTGGTCGAG
CTGAACGAGG CGTTCGCCGC GCAGGCGGTG GCCGTCTTGC GGGAACTGGG CCTGCCGGAG
GACGCCGAGC ATGTCAATCC CAACGGGGGC GCGATCGCGT TGGGGCATCC GCTCGGCGCG
AGTGGGGCCC GGCTGGCGCT GACCGCCGCC CTGGAGTTGC GTCGCCGGGG CGGCCGGCGG
GCGCTGGCCA CCATGTGCGT CGGCGTGGGC CAGGGCATCT CGCTGCTGTT GGAGTCCGTG
GGGTGA
 
Protein sequence
MTVAYLVAGV RTPIGRYAGA LAGVRPDDLA AHVIRELLAR HPTVDWARTD DVILGCANQA 
GEDNRNVARM AALLGGLPEQ VPGSTVNRLC GSGLDALAIA ARSIVAGEAD LVVAGGVESM
SRAPFVLPKA ETAFSRNAEV YDTTIGWRLV NPVLEQGWGI DSMPETAENV AAEYGVARAT
QDEFAYRSQQ RVAQAQADGR FAEEIVPVPA PAGRRGTTVV EVDEHPRETS LAKLAALPTP
FRVGGTITAG NSSGVNDGAV ALLVASEAAL TRYDLTPLAR VVGSAAAGVS PRVMGVGPVP
ATRRLLDRHG LGVGDLDVVE LNEAFAAQAV AVLRELGLPE DAEHVNPNGG AIALGHPLGA
SGARLALTAA LELRRRGGRR ALATMCVGVG QGISLLLESV G