Gene Caul_4092 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_4092 
Symbol 
ID5901554 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp4443489 
End bp4444982 
Gene Length1494 bp 
Protein Length497 aa 
Translation table11 
GC content69% 
IMG OID641564612 
Producttryptophan halogenase 
Protein accessionYP_001685714 
Protein GI167648051 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones38 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.683477 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGACA ACAGGATCAA GTCCGTGGTC GTGGTCGGCG GCGGCACGGC CGGCTGGATG 
AGCGCGGCGC TGCTGGCCCG CGCCCTGGGC GGGACGGTCG ACATCAGGCT GGTCGAATCC
GAAGAGATCG GCACGGTGGG CGTCGGCGAG GCCACGATCC CGCAGATCCG TAACTTCAAC
GCCTTCCTCG GCCTGGACGA GAACGCCTTC CTCGCCGCGA CGCAAGGCAC GATCAAGCTG
GGCATCGAAT TCATCGACTG GCGCGCGCCC GGCCAGTCCT ACATCCATGC GTTCGGCGAG
ATCGGCCGGC AGTTGGGCGC GGTCCCCTTC CACCACTATT GGCTGGCCGG CCGCCTGAAG
GGCGACGATC ACCCGTTGTG GGACTATTCG CTGAACGCCC AGGCCGCCAA GGCCGGCCGC
TTCGGCTGCG CCGCCGGCGC GCCGCCGACC GAGGCGCTGA CCTACGCCTT CCAGTTCGAC
GCCGCCCTCT ATGCCGGCCA TCTGCGCGCC TATGCCGAAC ACCACGGGGT GGCGCGCACC
GAAGGGCGGA TCCTGGGCGC GAACCTGCGC GGCGTCGACG GCCTGGTGGA GTCGGTGACG
CTGGAGAGCG GCGAGGTCGT GGCCGGGGAC TTCTTCATCG ACTGCTCGGG CTTTCGCGGC
GTGCTGATCG AGCAGGCGCT GCGGACGGGC TATGAGGACT GGTCGTCCTA CCTGCCGTGC
GACCGCGCCA TCGCCGTACC GACCGCCAAT GTCGGCCCGC CGCGCCCCTA CACCCAGGCC
TTCGCGCGTT CGGCGGGCTG GCAATGGCGC ATCCCGCTGC AGCATCGCAC CGGCAACGGC
CACGTCTTCT GCAGTCGTTT CATCAGCGAG GACGAAGCGG TCGGCCAGTT GATGGCCAAT
CTCGAGGGCG AAGCCCTGGC AGAGCCGCGC ACCCTGAAAT TCGTCACCGG GCGCCGCAAG
GTGTTCTGGA GCAGAAACGT CCTGGCCCTG GGCCTGTCCA GCGGCTTCAT GGAGCCGCTG
GAATCGACCA GCATCCACCT GATCCAGTCA GGCCTGTCGC GCCTGCTCAA CCTCTTCCCC
GACAAGGCCT TCGCCCAGCG CGACATCGAC GAGTACAACC GCCAGGCCGG GCTGGAATTC
GAGCGCATCC GCGATTTCCT GGTGCTGCAC TACTGGGCCA ACCAGCGCGA CGAGCCATTC
TGGCGCGCCT GCCGCGAGAT GGCGGTTCCG CCCGAACTGA CCCGCAAGGT CGAGCTCTTC
CGCGCCCGCG GCCGACTGTT CCGCGAGCCG GAGGATCTGT TCCTCGAAGC CAGCTGGCTG
CAGGTTCTGG TCGGCCAGGG CGTGCTGCCG GAGCGCTGCC ACCCGATGAC CGGAATGATC
ACCGACCCGC AGCTACAGGG CTTCCTGGCG GACCTGCGCA AGATCACCGC CGACTGCGCC
GCCGCCCTGC CCGCCCATGC CGACTTCATC CGCCAGCACG CCGCCGCCCG CTGA
 
Protein sequence
MTDNRIKSVV VVGGGTAGWM SAALLARALG GTVDIRLVES EEIGTVGVGE ATIPQIRNFN 
AFLGLDENAF LAATQGTIKL GIEFIDWRAP GQSYIHAFGE IGRQLGAVPF HHYWLAGRLK
GDDHPLWDYS LNAQAAKAGR FGCAAGAPPT EALTYAFQFD AALYAGHLRA YAEHHGVART
EGRILGANLR GVDGLVESVT LESGEVVAGD FFIDCSGFRG VLIEQALRTG YEDWSSYLPC
DRAIAVPTAN VGPPRPYTQA FARSAGWQWR IPLQHRTGNG HVFCSRFISE DEAVGQLMAN
LEGEALAEPR TLKFVTGRRK VFWSRNVLAL GLSSGFMEPL ESTSIHLIQS GLSRLLNLFP
DKAFAQRDID EYNRQAGLEF ERIRDFLVLH YWANQRDEPF WRACREMAVP PELTRKVELF
RARGRLFREP EDLFLEASWL QVLVGQGVLP ERCHPMTGMI TDPQLQGFLA DLRKITADCA
AALPAHADFI RQHAAAR