Gene Caul_4301 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_4301 
Symbol 
ID5901762 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp4674298 
End bp4675404 
Gene Length1107 bp 
Protein Length368 aa 
Translation table11 
GC content74% 
IMG OID641564819 
ProductPHB depolymerase family esterase 
Protein accessionYP_001685919 
Protein GI167648256 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG3509] Poly(3-hydroxybutyrate) depolymerase 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence
[TIGR01840] esterase, PHB depolymerase family 


Plasmid Coverage information

Num covering plasmid clones42 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGGCGC GCCTGCGCCG GGCCGCGACA ATGACCCCTT CGACGACAGC GGGGAACGCG 
GCCCTGGTCG AGACCGCCGC GTTCGGGGAA AACCCCGGCG CGCTGCGCAT GCTGTCCTAT
GCCCCCAGCG GCCTGGCCGC CAACGCGCCG CTTGTCGTGG TGCTGCACGG CTGCACCCAG
ACCGCCGAGG GCCACGCCGT CAACGGCGGC TGGATCGCCC TGGCCGACCG CCTGGGCTTC
GCCGTCCTGG CCCCTGAGCA GGTCGCGGCC AACAACCCCA ACCGCTGCTT CACCTGGTTC
GAGCCGGGCG ACACCACGCG GGGACGGGGC GAGGCGGCCT CGATCGCCGC CATGGTCTTG
GCCGCGATCC GGGTCCACCG CTGCGATCCG CGACGGGTGT TCGTCACCGG ACTGTCGGCC
GGCGGCGCGA TGACGGCCGT AATGCTGGCG GCCTATCCCG AACTGTTCGC CGGCGGGGCG
ATCGTCGCCG GCCTGGCCTA CGGCGTGGCG CGCGGCACGC CCGACGCCCT GCGCCTCATG
GGACGCGGCG ACGGCCGCGC CGCGCCCGTG CTGGGGGATC TGGTTCAGCG TGGCGGCGCC
CCGCCCCTAC GGTTGGCGAT CTGGCACGGC GATGCGGACT ACACCGTCAA CGCCGCCAAC
GCCCAGGATC TGGCCCGGCA ATGGACCACC GCCACCGGCC TGGCCGAAGC GCCCGGCGAG
GTCGCCCGGC AGGGGAACCG GACGCGTTCC GTCTGGCGAG ACGAGGCCGG CGGCTCGATC
GTCGAACTGA ACATCGTCCA TGGTCTGGGC CATGGGACGC CGCTCTCGAC CAAGGGCGAA
GGCGACGTGG GCAAGCCCGC GCCCTACATG CTGGAGGCGG GGCTTTCCTC CACCTTGGAG
ATCGCCGCCT TCTGGGGCCT CTCCAAGGGC GAAGCGGCGA CGGCCCGAGC CGCCCCGCCC
GCCGACGAGG CTGAAACGAC GCCGTCGGCG GACGCCAAGC CGACCGGCGT CGCCGCCCAG
GTTCTGGACG CGGTGTCCGG CCATGTGCCG TCCCAGGTGC GCGACGTCAT CGCCAAGGCC
CTGCGCTCGG CCGGCTTGAT GCGGTGA
 
Protein sequence
MLARLRRAAT MTPSTTAGNA ALVETAAFGE NPGALRMLSY APSGLAANAP LVVVLHGCTQ 
TAEGHAVNGG WIALADRLGF AVLAPEQVAA NNPNRCFTWF EPGDTTRGRG EAASIAAMVL
AAIRVHRCDP RRVFVTGLSA GGAMTAVMLA AYPELFAGGA IVAGLAYGVA RGTPDALRLM
GRGDGRAAPV LGDLVQRGGA PPLRLAIWHG DADYTVNAAN AQDLARQWTT ATGLAEAPGE
VARQGNRTRS VWRDEAGGSI VELNIVHGLG HGTPLSTKGE GDVGKPAPYM LEAGLSSTLE
IAAFWGLSKG EAATARAAPP ADEAETTPSA DAKPTGVAAQ VLDAVSGHVP SQVRDVIAKA
LRSAGLMR