Gene Caul_3290 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_3290 
Symbol 
ID5900745 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp3561450 
End bp3563486 
Gene Length2037 bp 
Protein Length678 aa 
Translation table11 
GC content68% 
IMG OID641563796 
Productputative D-(-)-3-hydroxybutyrate oligomer hydrolase 
Protein accessionYP_001684915 
Protein GI167647252 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTGACG CCAAGTCTGC TTCGCGGCTA CGCCGCTGGA CGAGACTGGC CGGTTTGGCC 
GTATCGACCC TGGCGGTCGC GGCGAGCACT TCCTTTGTCG ACGCCGCGCC GGCAAGCTTG
GCCAAACCTC CAACCTGGCT CGAACTTGGA CCCAGGACCC GGCACGACGG GCCGGCTGAC
GATCTGACGG GACTGGCCGC CAAGCCAGGC CCCTATGCCG ATCCGCTGCA TCCGACAGCC
GCCGAGCTGC GCCGCCAGAC CATGGCTTCG ACGTTCGATA GTGCGGCCGG CTGGGGTCGC
CTACTGGGGC CGACCATCAA TGTCGCCACG GGCCAACCTT ATCCCGACGG CGGGCGCGTC
TCGGGCGAGG AGCAACTGGC CTATGCCCGC GTCGATGGCG CGGTCAGCGC CATGCTGCTG
CAGACCCCTC GAAGCCTGTC GCGGACCCAG CGGTGCATCG TCGCGGTTCC GGTCGCGGGC
TCGTTCAGCC TGTACCACGA CATCAATACT CTGGGTTACT GGGGGCTGCA GCACGGCTGC
GCGGTCGTCT ATACCGACAA GGGCCATGGC GACGGCGTTC ATGATCTGGA CACCGACACC
GTCAACCTGA TCGACGGCAC GCGCGCCGCC GCCAAGGTCG CGGGTAAGGC CTCGCACTTC
ACCGCCGATC TCAGCGAAGC CCGTCGCCGG GTGTTCGTGA AGCAGTGGCC TCATCGCATC
GCGTTCAAGG CGTCCCACTC GAAGCGCAAC CCGGAGAGCC GTTGGGGCGA AGATGTTCTC
AACGCCATCC GCTTTGCCTT CTGGCGGCTG AACGCCGACG ACGGCGGCGG CTGGACCCCG
GCCAACACCC TGGTCATCGT CACCGGCGCC AGCAATGGCG GCGGCGCGAC CCTGTACGCC
GGCGAGGCAG ACACCGACGG GCTGATCGAC GCCGTGGTCG CGGCCGAACC ACAGGTGCAA
TTGCGGCCTG ATCCGGCGGT CTCCGTCGCG CGAGGCAAGA CGGTCCGGCA GGGCGTCGGC
CGAACCCTGC TGGACTACTT CACCTATTTC AATCTCTACG GTCCCTGCGC GGTGCTGGCC
ACGCCCGACG CACCGTGGGC CGGCCAGGTG ACACGAGCGA GTGAGCGCTG CGGGTCTCTG
CGCTTGGCGG GACTTTTGAG CGCCGAAGCT GTTCCGGCCC AGGCCGCCGA GGCGCTGGCT
AGGTTGCGCG ACTACGGGTT CGACCCCGAG ACCGACATCC TGCACGCGGC CAGCTATATG
ATCGGCCCCG ACGCGACCGC CGCCAAATAC GCCAACGACC ATGGCCGATT TGGCGTGGAG
GATCGGCTTT GTGGCTACAG TTTCGCGTCG GTCGACGCGC AGGGCGTTCC TCGCGCCGTT
CCGCCGGCCG AGCTGGCTGA AATCTTCGCC AAGGCGGCTG GCGGCGCGCC ATCGGGCAGC
ATCGATATCG TCAATGACGA TGATCCCCGC GGACCACGCC GCAACAGCCT GTCGATGTCG
CCGAGCAACG GTCGTCTGGA CTACAATTTC GACGGCGCGC GGTGTCTTCG CGAGCTGGCC
GTGGGGAGCT CGGCCAACGC CAGGCGCGTT CAGGCGGGCA TCGCCGAGTT CGCGGCCAAG
GGCGATCTGC ACGGCAAGCC GGCGATCATC GTCCACGGCC GCGCCGACGA CCGGGTTCCC
GCGGCGTTCA GTTCAAGGCC GTATCTGGGT CTCAACAGCC TCAGGGAAGG GGCCGGCAGC
CAACTACGCT ATATCGAGGT CGAGCACGCC GAACATTTCG GGTTCAGCGC CCCGGGTTTC
GACACGCGCT TCGTTCCGTT GACCTACTAT CATCTGCAGG CGCTCGATTG GATGTGGGGA
CGCCTGACGA CCGGCGCGGC GCTGCCCGCC AGCCAGATCG TGCGCACGAC GCCGCGCGGC
GGCGCGCCCG GTTCGGCGCC GCCTCTGAGC CTGGCCAACA TCCCACCGCT GGTGGAAAAG
CCCGCCGCAG CCGACACGAT TTCAGTTCAT CGCGGCTATG TGGCTTTGCC CGATTGA
 
Protein sequence
MADAKSASRL RRWTRLAGLA VSTLAVAAST SFVDAAPASL AKPPTWLELG PRTRHDGPAD 
DLTGLAAKPG PYADPLHPTA AELRRQTMAS TFDSAAGWGR LLGPTINVAT GQPYPDGGRV
SGEEQLAYAR VDGAVSAMLL QTPRSLSRTQ RCIVAVPVAG SFSLYHDINT LGYWGLQHGC
AVVYTDKGHG DGVHDLDTDT VNLIDGTRAA AKVAGKASHF TADLSEARRR VFVKQWPHRI
AFKASHSKRN PESRWGEDVL NAIRFAFWRL NADDGGGWTP ANTLVIVTGA SNGGGATLYA
GEADTDGLID AVVAAEPQVQ LRPDPAVSVA RGKTVRQGVG RTLLDYFTYF NLYGPCAVLA
TPDAPWAGQV TRASERCGSL RLAGLLSAEA VPAQAAEALA RLRDYGFDPE TDILHAASYM
IGPDATAAKY ANDHGRFGVE DRLCGYSFAS VDAQGVPRAV PPAELAEIFA KAAGGAPSGS
IDIVNDDDPR GPRRNSLSMS PSNGRLDYNF DGARCLRELA VGSSANARRV QAGIAEFAAK
GDLHGKPAII VHGRADDRVP AAFSSRPYLG LNSLREGAGS QLRYIEVEHA EHFGFSAPGF
DTRFVPLTYY HLQALDWMWG RLTTGAALPA SQIVRTTPRG GAPGSAPPLS LANIPPLVEK
PAAADTISVH RGYVALPD