Gene Caul_1696 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_1696 
Symbol 
ID5899151 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp1784405 
End bp1785607 
Gene Length1203 bp 
Protein Length400 aa 
Translation table11 
GC content73% 
IMG OID641562186 
Productbeta-ketoadipyl CoA thiolase 
Protein accessionYP_001683323 
Protein GI167645660 
COG category[I] Lipid transport and metabolism 
COG ID[COG0183] Acetyl-CoA acetyltransferase 
TIGRFAM ID[TIGR01930] acetyl-CoA acetyltransferases
[TIGR02430] beta-ketoadipyl CoA thiolase 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.142195 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGACG CCTTCATCTG CGACGCCATC CGCACGCCGA TCGGCCGCTA CGGCGGGGCG 
CTGTCCAGCG TGCGGGCCGA CGACCTGGCG GCGCTGTCGA TCAGGGCGCT GATCGCCCGC
AACCCCGGCG TGGACTGGGG CGCGCTGGAC GACGTGGTGC TGGGCTGCGC CAACCAGGCC
GGCGAGGACA ACCGCAACGT CGCGCGCATG GCCGCCCTGC TGGCCGGACT GCCGGCCACC
GCCCCCGGTT CGACCGTCAA CCGGCTTTGC GGATCGGGCC TCGACGCCCT GGGCGTGGCG
GCGCGGGCCA TCAAGGCGGG CGAGGCCCAC CTGATGATCG CCGGCGGCGT CGAGAGCATG
AGCCGCGCGC CGTTCGTGAT GGGCAAGGCC GACAGCGCCT TCTCGCGCAA CGCCGAGATC
TTCGACACCA CCATCGGCTG GCGGTTCGTC AATCCGGCCA TGCGCAAGGC CTATGGCGTC
GACTCCATGC CCGAGACCGC CGAGAACGTC GCCGACGCGT GGAAGGTCAC GCGCGCCGAC
CAGGACGCCT TCGCTCTGCG CAGTCAGGCC CGCGCGGCGG CCGCCCAGGC CTCGGGCCGC
TTCGACGTCG AGATCGCGCC GGTCACCCTG CCGCATCGCA AGGGCGACCC GGTCGTCGTG
TCCAGGGACG AGCATCCGCG CGCCACGACG ATCGAGACGC TGGCGTCGCT GAAACCCATC
GTCCGCCCGG ACGGCACGAT CACCGCCGGC AACGCCTCGG GCGTCAACGA CGGGGCGGCG
GCGCTGATCG TCGCTTCGGA AGCGGCGGCC AAGGCCCATG GCCTGACGCC GCGCGCCCGC
ATCCTGGGCG TCGCCGCCGC CGGGGTGGAG CCGCGCGTCA TGGGGATCGG ACCCGGGCCG
GCGACCCAGA AACTGCTGGC GCGGCTTGGC CTCTCGATCG GCGACATCGA CGTGGTTGAG
CTGAACGAAG CCTTCGCGGC GCAGGGCCTG GCGGTGCTGC GCGACCTGGG CCTGCCCGAC
GACGGCGAGC ACGTGAACCC CAACGGCGGC GCCATCGCCC TGGGTCATCC GCTGGGCATG
AGCGGCGCCC GGCTGGGCCT GACCCTGGTG GAGGAGCTCC ACCGGCGCGG CGCGCGGTAC
GGCCTAGCGA CCATGTGCAT CGGCGTGGGC CAGGGCATCG CGATGGTGGT CGAGCGAGTC
TAG
 
Protein sequence
MTDAFICDAI RTPIGRYGGA LSSVRADDLA ALSIRALIAR NPGVDWGALD DVVLGCANQA 
GEDNRNVARM AALLAGLPAT APGSTVNRLC GSGLDALGVA ARAIKAGEAH LMIAGGVESM
SRAPFVMGKA DSAFSRNAEI FDTTIGWRFV NPAMRKAYGV DSMPETAENV ADAWKVTRAD
QDAFALRSQA RAAAAQASGR FDVEIAPVTL PHRKGDPVVV SRDEHPRATT IETLASLKPI
VRPDGTITAG NASGVNDGAA ALIVASEAAA KAHGLTPRAR ILGVAAAGVE PRVMGIGPGP
ATQKLLARLG LSIGDIDVVE LNEAFAAQGL AVLRDLGLPD DGEHVNPNGG AIALGHPLGM
SGARLGLTLV EELHRRGARY GLATMCIGVG QGIAMVVERV