Gene Csal_0301 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCsal_0301 
Symbol 
ID4025957 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChromohalobacter salexigens DSM 3043 
KingdomBacteria 
Replicon accessionNC_007963 
Strand
Start bp339824 
End bp341029 
Gene Length1206 bp 
Protein Length401 aa 
Translation table11 
GC content69% 
IMG OID637965451 
ProductBeta-ketoadipyl CoA thiolase 
Protein accessionYP_572363 
Protein GI92112435 
COG category[I] Lipid transport and metabolism 
COG ID[COG0183] Acetyl-CoA acetyltransferase 
TIGRFAM ID[TIGR01930] acetyl-CoA acetyltransferases
[TIGR02430] beta-ketoadipyl CoA thiolase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0179307 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCGATG TTTGGTTGTG TCACCCCACG CGTTCCGCCG TGGGACGTTT CGGCGGCTCG 
CTGGCCAGCG TCCGCCCCGA CGACCTGGCG GCCACGATTT TCCGGGCCGT GCTGGCCCAG
GCGCCGGACC TCGATCCGGC GGCCATCGAT GAAGTGATCA TGGGCTGCGC CAACCAGGCC
GGCGAGGACA ATCGCAACGT CGCGCGCATG TCGGCGCTGC TGGCCGGGCT GCCGACCAGC
GTGCCCGGCA CCACCTTGAA TCGCCTGTGC GGGTCGGGCA TGGATGCCGT GGGGACGGCA
TTCCGCGCCA TCAAGGCGGG CGAGTTCGAG CTGGCGCTGG CGGGGGGCGT GGAGTCCATG
TCGCGCGCCC CGTTCGTGAT GGGCAAGGCC GAGACGGCAT TCTCGCGCAG CCAGGCGATC
GAGGACACGA CCATCGGCTG GCGTTTCGTC AACCCGCTGA TGAAGAACCA CTACGGCGTC
GAGTCGATGC CAGAGACCGC CGAGAACGTG GCCGAGCAGT TCCACGTGTC CCGCGAGGAT
CAGGATGCCT TCGCCGTGCG CTCGCAGCAT AAGACCGAGC GCGCCCAGCA GTCCGGGCGT
CTGGCGCAGG AGATCACGCC GGTCGAGGTG CCGCGTCGCA AGCAGGAGCC CCTGATCGTC
GATCGGGATG AGCATCCGCG CGCAGGCACC ACGCTGGAAA AGCTGGCCAA GTTGCCCACG
CCGTTTCGCG ATGGCGGCAG CGTGACCGCG GGCAATGCGT CGGGCGTCAA CGACGGGGCG
GCGGCGATGC TGGTTGCCAG CGACGCCGCC GTCGCGCGCC ATGGTCTCAC CCCGATGGCG
CGGATCCTGG GCATGGCCAC CGCCGGCGTC GAACCCCGCA TCATGGGCAT GGGCCCGGTA
CCGGCGACGC GCAAGCTGCT CGAGCGTCTG GGTATCGGCA TCGATGAGGT CGACATCATC
GAGCTCAACG AAGCCTTTGC CGCCCAGGGG CTGGCATGCC TGCGCGAACT AGGCGTGGCC
GATGACGATC CGCGCGTCAA TCCCAATGGC GGCGCGATCG CGCTGGGGCA TCCGCTGGGC
ATGTCCGGGG CGCGGCTGCT GCTGACCGCG GCGCACGAGC TGCACGTACA GAACAAGCGT
TACGCCCTTT GCACCATGTG TGTCGGCGTG GGCCAGGGCG TGGCCACTTT GATCGAACGC
GTTTGA
 
Protein sequence
MSDVWLCHPT RSAVGRFGGS LASVRPDDLA ATIFRAVLAQ APDLDPAAID EVIMGCANQA 
GEDNRNVARM SALLAGLPTS VPGTTLNRLC GSGMDAVGTA FRAIKAGEFE LALAGGVESM
SRAPFVMGKA ETAFSRSQAI EDTTIGWRFV NPLMKNHYGV ESMPETAENV AEQFHVSRED
QDAFAVRSQH KTERAQQSGR LAQEITPVEV PRRKQEPLIV DRDEHPRAGT TLEKLAKLPT
PFRDGGSVTA GNASGVNDGA AAMLVASDAA VARHGLTPMA RILGMATAGV EPRIMGMGPV
PATRKLLERL GIGIDEVDII ELNEAFAAQG LACLRELGVA DDDPRVNPNG GAIALGHPLG
MSGARLLLTA AHELHVQNKR YALCTMCVGV GQGVATLIER V