Gene Caul_1847 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_1847 
Symbol 
ID5899302 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp1967178 
End bp1968848 
Gene Length1671 bp 
Protein Length556 aa 
Translation table11 
GC content71% 
IMG OID641562337 
Productcarboxylesterase type B 
Protein accessionYP_001683474 
Protein GI167645811 
COG category[I] Lipid transport and metabolism 
COG ID[COG2272] Carboxylesterase type B 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.105867 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0362661 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGTCCT TCATTCGCGA CAGAGGGATC GACCGGCGTC GGTTGCTGGC TGGTGGGGCC 
GCCGCCGCCG GCGCCCTGGC CGCGCCCTTG GTCGCGCGCG CCGCCGACGC GCCGTCCGAG
TCCGTGGAGA TCGCGATCGC CGAAGGGCGC CTGCGCGGCC TGCGCACCGG CGGGGTCGAC
GTCTACAAGG GCGTGCCCTA CGGGGCGAGC GTCTCCGGGG CTGGCCGGTT CAAGCCCGCC
AACCCCGTCG CGCCCTGGAC CGGAGTGCGG GACGCCACGC GGCTCGGCAC GCCGTCGCTG
CAGGATCCCG GCACCGTCTA CGGCGTGAAC GAGCCGGCGC CGGGCGAGGA TTGCCTGGTG
CTGAACCTCT GGACCCCGGC CGGCGGCGGT AAGGGCAAGC CTGTGATGTT CTACAGTCAC
GGCGGCGGCT ACACGACCGG TTCGGGCGGC AGCACGGCCC AGGACGGCTC GAACCTGGCG
CGCGAGCACG ACGTGGTGGT GGTGTCGACC AATCACCGAT TGGGCGTGCT GGGCTATCTC
TACATGGGCG AACTGGGCGG GGCCGAATAC GCCGGCTCGG GCAACCAGGG CCTGTCCGAC
ATCGTGCTGG CGCTCAAGTG GGTGGCTCGC AACATCGCCG CGTTCGGAGG CGACCCCGGC
AATGTGACGA TCTTCGGCGA GTCCGGCGGC GGGGCAAAGA CCTCCTGCCT CTACGCCATG
CCCTCGGTCG CGCCGCTGTT TTCCAAGGCG ATCATCCAGA GCGGGCCGGT GGTGCGGGTG
ACGACCCCGG ACGTCGCCGC CCAGACGACA CGCATGTTCC TCGAACAACT GGGGATCGCC
CCGGCCGACT GGCGCAAGGT GCTCGATGTT CCCGCGCCAC AGATCCTGGC CGCGCAGAAG
GCGCTGAACG CCAAGGTGAA GAGCGACAGC GGCGGCTGGC GGGGGATCCA GTCGCTGACG
CCCGGCACGT ACGGGCCGAT CGTCGATGGC GGCCTGCTGC CGCGCCATCC GTTCGATCCC
GCCGCCCCGG CCAGCGCCGC CGACAAGCCG CTGATGATCG GCTGGCTTGA CAGCGAAGCG
GCCTTTTTCG CCTGGACGGG CAAGGACGTC GAGGCTTTCA GGCTGGACGA GGCGGGGCTG
AAGGCGCGGC TGTCGGGCAA GTTCGGCGAC AAGGCCCAGA CATTAATCGA CACCTACCGG
GCCGACCGTC CCGGCGCGAC GCCGAGCGAC ATCTATCTGG CGGCGGCCAG CTTCCACGCC
ATGGGCGCCG GCTCGGTGGT CACCGCCGAG CGCAAGGCCA CGCAGGGCAG GGCGCCGGTC
TATGTCTACA ACATCGCCTA TCGCTCGAAC CGCAAGATGG ACGGGACCGA CATCGAGCTG
GGGGCCATGC ACGCCAGCGA CATCCCCCTG GTGTTCAACA CCGTCGCCTC GCCCACCACC
CTGGCCGGAG ACCGCGCCGA CCGTTTCGCG GCGGCGCGCA ACGTCAGCAC GATGTGGGCC
AATTTCGCGC GGACCGGACG GCCGGCGGCG CCGGGCCAGC CGGTCTGGCC CGCCTACGAC
CTGAAGACGC GACAGACGAT GGTGCTCGAT GTCGGCTGCA GCGTGGTTCC AGACCGGTTC
GGGGCGGAGC GGAAGCTGTG GGCGCGGTTG GATCCGCCGA CCGCGGGATG A
 
Protein sequence
MTSFIRDRGI DRRRLLAGGA AAAGALAAPL VARAADAPSE SVEIAIAEGR LRGLRTGGVD 
VYKGVPYGAS VSGAGRFKPA NPVAPWTGVR DATRLGTPSL QDPGTVYGVN EPAPGEDCLV
LNLWTPAGGG KGKPVMFYSH GGGYTTGSGG STAQDGSNLA REHDVVVVST NHRLGVLGYL
YMGELGGAEY AGSGNQGLSD IVLALKWVAR NIAAFGGDPG NVTIFGESGG GAKTSCLYAM
PSVAPLFSKA IIQSGPVVRV TTPDVAAQTT RMFLEQLGIA PADWRKVLDV PAPQILAAQK
ALNAKVKSDS GGWRGIQSLT PGTYGPIVDG GLLPRHPFDP AAPASAADKP LMIGWLDSEA
AFFAWTGKDV EAFRLDEAGL KARLSGKFGD KAQTLIDTYR ADRPGATPSD IYLAAASFHA
MGAGSVVTAE RKATQGRAPV YVYNIAYRSN RKMDGTDIEL GAMHASDIPL VFNTVASPTT
LAGDRADRFA AARNVSTMWA NFARTGRPAA PGQPVWPAYD LKTRQTMVLD VGCSVVPDRF
GAERKLWARL DPPTAG