Gene Caul_5088 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_5088 
Symbol 
ID5897414 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010335 
Strand
Start bp8339 
End bp9496 
Gene Length1158 bp 
Protein Length385 aa 
Translation table11 
GC content69% 
IMG OID641555191 
Productacetyl-CoA acetyltransferase 
Protein accessionYP_001676522 
Protein GI167621737 
COG category[I] Lipid transport and metabolism 
COG ID[COG0183] Acetyl-CoA acetyltransferase 
TIGRFAM ID[TIGR01930] acetyl-CoA acetyltransferases 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.225671 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.910861 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTCAGG CCGTGATCGC GGGCTACGTC CGCACGCCGT TCCATTTCGC GCGCAAGGGC 
GCGTTGATCG GGGTGCGCCC CGATGACCTG GCGGCGATCG CCATCCGGGG CCTGGTCGAT
CGAACAGGCG TCGATCCGTC CAGCCTCGAG GACGTGCTGC TCGGCTGCGC CTATCCTGAG
GCCGAGCAGG GCAACAACAT CGCCCGGGTT GCCGTACTGC TGGCCGGCCT GCCGATCACG
GTCTCTGGCA TGACGGTCAA CCGGTTCTGC GGCTCGTCGA TGTCGGCGAT CCATATCGCC
GCCGGCCAGA TCGCCATCGG CGCCGGCGAG GCCTTTATCT GCGGCGGCGT GGAGTCGATG
ACCCGGGTGC CGCAGGGGGG CTTCAACTTC TCGCCGAACC CGCGCTTCAA GCGTCGCGAC
ATGCCGGACG CCGAGCTGAT GACCGAAGCC TACATCTCCA TGGGCGAGAC GGCCGAGAAC
GTCGCCGAGC AGTACGGCGT CTCTCGCGCC GACCAGGAGA AGCTGGCGTT CGCCTCCCAG
CAGAAGGCCG CCGCCGCCCA GGCCGCCGGC CGCCTCGCCG ACGAGATCGT TCCGGTGGCC
ACCCCAGACG GCCTGGTCGA CATCGATGGC TGCCTTCGCC CCCAAACAAC CCTGGAAGGG
CTCGCCGCGC TGAAGCCCGC CTTCCGCCCG GACGGCACGG TGACCGCCGG GACCGCCTCC
CCCCTGACCG ATGGCGCCTC GGCGGTCCTG GTCTGCACCG AGGATTACGC CCGCGCCGCC
GGCCTGGAGA TCCTGGCCAG GATCCGCTCG GTCGCGTCGG CCGGCGTGGC GCCGGAACTG
ATGGGCATGG GGCCCGTGCC GGCCACGCTC AAGGCGCTTG AACGCGCCGG GCTGTCGATC
GACGACATTG ACGTGGTGGA GATCAACGAG GCTTTCGGCA GTCAGGCGGT AGCCTGTCTC
CGCGAGCTCA AGATCGCCGA GGACAGGGTC AATATCGACG GCGGCGGCAT TTCGATCGGT
CACCCGCTCG GCGCCACGGG CGCGCGCATA ACCGGCAAGG CGGCTCAGAT CCTCAGGCGC
GAAGGAAAGC GTTTCGCCCT GTCGACCCAG TGCATCGGCG GCGGCCAAGG CGTGGCCACG
ATCCTGGAAG CGGTCTGA
 
Protein sequence
MTQAVIAGYV RTPFHFARKG ALIGVRPDDL AAIAIRGLVD RTGVDPSSLE DVLLGCAYPE 
AEQGNNIARV AVLLAGLPIT VSGMTVNRFC GSSMSAIHIA AGQIAIGAGE AFICGGVESM
TRVPQGGFNF SPNPRFKRRD MPDAELMTEA YISMGETAEN VAEQYGVSRA DQEKLAFASQ
QKAAAAQAAG RLADEIVPVA TPDGLVDIDG CLRPQTTLEG LAALKPAFRP DGTVTAGTAS
PLTDGASAVL VCTEDYARAA GLEILARIRS VASAGVAPEL MGMGPVPATL KALERAGLSI
DDIDVVEINE AFGSQAVACL RELKIAEDRV NIDGGGISIG HPLGATGARI TGKAAQILRR
EGKRFALSTQ CIGGGQGVAT ILEAV