Gene Caul_1137 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_1137 
Symbol 
ID5898592 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp1206797 
End bp1207849 
Gene Length1053 bp 
Protein Length350 aa 
Translation table11 
GC content65% 
IMG OID641561619 
Productvanillate monooxygenase 
Protein accessionYP_001682765 
Protein GI167645102 
COG category[P] Inorganic ion transport and metabolism
[R] General function prediction only 
COG ID[COG4638] Phenylpropionate dioxygenase and related ring-hydroxylating dioxygenases, large terminal subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value0.684708 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000140047 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
TTGTCCGCCC AGCCGACTTT CCCGTTGAAC GCCTGGTACG CCGCCGGCTG GGATTCCGAG 
ATCAAGCGCG AGCTGCTGCC CCGGACGATC TGCAACAAGA AGATCGTCTT GTTCCGTAAG
GAAAATGGGC AAGCGGTCTG CCTGGAGGAC GCCTGTTGGC ATCGCCTGTT GCCGCTGTCG
ATGGGGCGGC TCAAGGGCGA CGACGTACAG TGCGGCTATC ACGGCCTTGT ATTCAATGAG
CACGGCCGCT GTGTTCGCAT GCCCTCGCAG GAGACGATCA ACCCATCGGC CTGCGTGCGG
AGTTTCCCCC TGGTCGAACG GCACCGGTTC GTGTGGATCT GGCCCGGCGA CCCGGCGCTG
GCGGATCCGG CCCTGGTTCC CGATCTGCAC TGGAACCACG ACCCGGCCTG GGCCGGCGAT
GGCAAGGTGA TCCACGCCAA GTGCGACTAC CGGCTGATCG TCGACAATCT GATGGACCTC
ACTCACGAGA CCTATATCCA CGGATCGAGC ATCGGCAACG ATGCGGTGGC CGAGGCGCCG
TTCGAGGTCA CCACCGGCGA CAAGACCGCC ATGGTCACGC GCTGGATGAT CGATATCGAG
CCGCCGCCGT TCTGGCGCCA ACAGCTGGGC AAGCCGGGTA ACGTCGACCG CTGGCAGATC
ATCCGTTTCG AGGCGCCCTG CACGGTGGCC ATCGATGTCG GGGTGGCCCC GACCGGGACC
GGCGCGCCGC GGGGCGACCG CTCGCAGGGC GTCAGCATGG TGGTGATCAA CACCATCACC
CCGGCTACGG ACAAGACTTG TCACTACTTC TGGGCCAATG TGCGCGACTA TCAGCTGGGC
GAGCAGAAGG TGACCACCCA GATCCGTGAG GCGATCACCA AGGTGTTCGC CGAGGACGAG
GTTATCGTCG AGGCCCAGCA GCGGGCGATC GACGACCATC CCGACCACGT GTTCTACAAC
CTCAACATCG ACGCCGGCGC CATGTGGGCC AGGCGGCTGA TCGATCGGAT GGTCGCCGCC
GAGGCTCCGC CCGTCGCGAT CGCGGCGGAG TAG
 
Protein sequence
MSAQPTFPLN AWYAAGWDSE IKRELLPRTI CNKKIVLFRK ENGQAVCLED ACWHRLLPLS 
MGRLKGDDVQ CGYHGLVFNE HGRCVRMPSQ ETINPSACVR SFPLVERHRF VWIWPGDPAL
ADPALVPDLH WNHDPAWAGD GKVIHAKCDY RLIVDNLMDL THETYIHGSS IGNDAVAEAP
FEVTTGDKTA MVTRWMIDIE PPPFWRQQLG KPGNVDRWQI IRFEAPCTVA IDVGVAPTGT
GAPRGDRSQG VSMVVINTIT PATDKTCHYF WANVRDYQLG EQKVTTQIRE AITKVFAEDE
VIVEAQQRAI DDHPDHVFYN LNIDAGAMWA RRLIDRMVAA EAPPVAIAAE