Gene Caul_1039 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_1039 
Symbol 
ID5898494 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp1097322 
End bp1098926 
Gene Length1605 bp 
Protein Length534 aa 
Translation table11 
GC content71% 
IMG OID641561521 
Productcarboxylesterase type B 
Protein accessionYP_001682667 
Protein GI167645004 
COG category[I] Lipid transport and metabolism 
COG ID[COG2272] Carboxylesterase type B 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones36 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.43547 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATCACC TGAATGTCGA TCGTCGCGGC CTGATCGCCG GCGCGGGCGC GGCCCTCGCC 
CTGGCCTCGG CCCCCGCCCT GGCCGCCAAG CCCGGTCCGG TCGTAACCAC CACCGCCGGC
AAGGTGCGCG GCGCCGTCGG TAACGGCGTC CAGGTCTTCA AGGGCCTGCG CTATGGAGCC
GACACCGGCG GCGCGGGGCG CTTCATGCCG CCCCAGCCGC CCAAGCCCTG GACCGGCGTC
GCCGACGCCC TGGCCTATGG CGCGGCCTGC CCGCAGGGCA AGGGCGAGGA CGGCGAGGCG
CTGAGCGAGG ACTGCCTGTT CCTCAATGTC TGGACGCCGG CGGCCGGCAA GACCTTGGCC
GACGGCGTCA AGCGGCCGGT GATGTTCTAC ATCCACGGCG GCGCCTACAA CACCGGCTCG
GGGGCCAGCC CCTGGTACGA CGGGACCAAG CTGGCCAAGC GCGGCGACGT GGTGGTGGTG
ACGGTCAACC ACCGGCTCAA CGCCTTCGGC TACCTCTATC TGGCCCGCCT GTTCGACGAT
CCGAGCGTGG CCGACAGCGG CAATGTCGGC CAGATGGACC TGGTCCTGGC CCTGCGATGG
GTGCGCAACA ACATCGCCGC ATTCGGCGGC GATCCCGGCA ATGTCATGCT GTTCGGCCAG
TCGGGCGGCG GCGCCAAGAT CGCCACCCTG ATGGCCATGC CGACCGCCAA GGGCCTGTTC
CACCGCGCCG CCACCATGAG CGGCCAGCAG GTCACCGCCG CCGGCCCGTT CAACGCCACC
AGGCGCGCCA AGGGCTTCAT CGACAAGCTG GGGATCAAGG ACCTGGCCGC CCTGCGCGCC
CTGCCCGCCG AGACCTTCCT GGCGGGCCTG AAGGCCGTCG ACCCGATCGC CGGCGGCGGC
GGGGTCTATA TGGGTCCGGT GCTGGACGAG CGGTCGCTGA CCCGTCACCC GTTCTTCCCC
GACGCCGCGC CGCAGAGCCT GGACGTGCCG ATGATGGTCG GCAACACCCA TGACGAGACG
CGCGGCTTCG TCGGCTACGA CAAGAGCCTG CTGGACATGA CCTGGGACCA GGTGATCGCC
AAGCTGCCCA GCCAGTTCAA CGCCCGCATC GACATCGACC CGGCCACAGT GGTGGCGGCC
TATCGCAAGA TCTATCCGAG CTACTCGCCG TCCGACGTCT ATTTCGCGGC CAGCACGGCC
GGCCGCTCCT GGAAGGCGGC GATCATCCAG GACGAGGAAC GCGCCCAGGC CGGCGCTCCG
GCCTGGGCCT ACCAGCTGGA CTGGCGCGCG CCCAAGGACG GCGGCGCCTG GGGCGCGCCC
CATACGCTGG ACATGCAGCT GGTGTTCGGC AACTTCGACG CGCCCGGCGT GATCACCGGC
TCGGGTCCGG ACGCCGTGGC GCTGCACGAG CGGCTGGCGG ATGCGTTCAT CGCCTTCGCG
CGGACGGGGA ACCCCAACTG CGCGGCGATC CCGAAGTGGG AACCCTACAC CCTGCCCCGC
CGCCAGACCC TGGTGTTCGA CAACACCACC CGGATGGAAG ACGACCCGCG CGGCGCCGAG
CGCGAGCTGT TCAACAAGGT CCCGTTCACG CAGTTCGGGA CCTGA
 
Protein sequence
MDHLNVDRRG LIAGAGAALA LASAPALAAK PGPVVTTTAG KVRGAVGNGV QVFKGLRYGA 
DTGGAGRFMP PQPPKPWTGV ADALAYGAAC PQGKGEDGEA LSEDCLFLNV WTPAAGKTLA
DGVKRPVMFY IHGGAYNTGS GASPWYDGTK LAKRGDVVVV TVNHRLNAFG YLYLARLFDD
PSVADSGNVG QMDLVLALRW VRNNIAAFGG DPGNVMLFGQ SGGGAKIATL MAMPTAKGLF
HRAATMSGQQ VTAAGPFNAT RRAKGFIDKL GIKDLAALRA LPAETFLAGL KAVDPIAGGG
GVYMGPVLDE RSLTRHPFFP DAAPQSLDVP MMVGNTHDET RGFVGYDKSL LDMTWDQVIA
KLPSQFNARI DIDPATVVAA YRKIYPSYSP SDVYFAASTA GRSWKAAIIQ DEERAQAGAP
AWAYQLDWRA PKDGGAWGAP HTLDMQLVFG NFDAPGVITG SGPDAVALHE RLADAFIAFA
RTGNPNCAAI PKWEPYTLPR RQTLVFDNTT RMEDDPRGAE RELFNKVPFT QFGT