Gene Caul_0296 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_0296 
Symbol 
ID5897570 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp331678 
End bp333252 
Gene Length1575 bp 
Protein Length524 aa 
Translation table11 
GC content68% 
IMG OID641560780 
Producttryptophan halogenase 
Protein accessionYP_001681931 
Protein GI167644268 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value0.628963 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACGAAAC CGATCGATGA GGTGCTGATC GTCGGCGGCG GGACGGCCGG CTGGATCACC 
GCCGCCTATC TCGCGCGCAA GCTGGGCGCG GCGCGGCCAG ACGGGGTCAG GATCACCCTG
ATCGAGTCCA GCGAGATCGG CATCATCGGG GTGGGCGAGG GCACGATCCC CACCATCCAG
ACGACAATGC GCGAGATCGG GATCGACGAA GCCCGGTTCA TGCGCGGGGC CGGCGCGACC
TTCAAGCAGG GCATCAAGTT CGTCGACTGG ACCACGGCGC CGGTCGGCGG CGCGCACAAT
CACTACTACC ACTCGTTCAG CCGGCCCCAC ACGCTGGGCG GCCTGGACCT GGCCCCCTAC
TGGATGCTGG GCTGCGCGGG CGACGTGTCG TTCTCCGAGG CGGTGACCCT GCAGGACACG
GTCTGCGAGG CCGGCAAGGG TCCCAAGCTG ATCGACGACC CGCAGTATTC CAGCCCGCTC
GGCTACGCCT ATCACTTCGA CGCCGGCAAG CTGGCGACGC TGATGCGCGA CGTCGGCAAG
GCGCTGGGCG TGCGCCACCT GATCGGTAAT GTCGAAGGCG CGCGCCTGGA CGAGTCCGGC
GCCATCGCGG CGATCGTCAC CCGTGAGCAT GGCGAACTGA CCGCCGGTCT CTACATCGAT
TGCAGCGGCT TCTCGGCCAA GCTGATCGGC GAGGCGATGG GCGTTCCGTT CGTCGACGAC
AGCGACGTGC TGTTCGTCAA CCGCGCCGTG GCCATCCAGG TCCCCTACGA CCGGCCCGAC
GCGCCGGTGG CGACGACCAC CCTTTCCACC GCCCACGAAG CCGGCTGGAC CTGGGATATC
GCCCTGCCCG AGCGGCGGGG CGTCGGCTAT GTCTATTCCA ACAACCACAC CAGCGACGAC
CGCGCCGAGG AGATTCTGCG CGCCTATGTC GGTCCGGCGG CCGAGGGGCT GAACGCCCGC
CAGCTGAAGC TGCCGATCGG CCATCGCCAG AAGCCGTGGG TCAAGAACTG CGTCGCCATC
GGCCTGTCCG GCGGCTTCCT GGAGCCGCTG GAGGCGACCG GCATCATGCT GATCGAGGCG
GCGGCCTGGA TGCTGGGCCG GCTGTTTCCC CGGCCGGGCG AGTTGGAGCC GACCGCCGCC
CTGTTCAACG AGGCTATGGG CCTGCGCTAC AAGGGCGTGC TGGACTTCAT CAAGCTGCAC
TACTGCCTGA CGCAGCGCAC CGACAACGAC TTCTGGATCG ACAACACCCG GCCCGAAAGC
ATACCGGACT CGCTGCACGC CCGGCTGGAG ATGTGGAAGA CCCGCGCGCC CGACCCGTTC
GACTTCGGCA CCGTCCATGA CAGTTTCGAG GTCTTCAACT ACCAGTATGT TCTGTACGGC
ATGGGCTTCA AGACGGACCT CTCGGCCAAT CTCTCGGCCT ATCCCCACCT GGAGGCGGCG
CGGCGCGAGT TCGCGCGGCT GAAGAGCGCC GCGGGACGGG CGGCGGCGGC CATGCCTGAC
CATCGGACTT TGCTGGATGA GATCTACCGC GGCGGCTTCC GGTCTCCCAC GCCCCAAGGA
CTGGCGGCGC GATGA
 
Protein sequence
MTKPIDEVLI VGGGTAGWIT AAYLARKLGA ARPDGVRITL IESSEIGIIG VGEGTIPTIQ 
TTMREIGIDE ARFMRGAGAT FKQGIKFVDW TTAPVGGAHN HYYHSFSRPH TLGGLDLAPY
WMLGCAGDVS FSEAVTLQDT VCEAGKGPKL IDDPQYSSPL GYAYHFDAGK LATLMRDVGK
ALGVRHLIGN VEGARLDESG AIAAIVTREH GELTAGLYID CSGFSAKLIG EAMGVPFVDD
SDVLFVNRAV AIQVPYDRPD APVATTTLST AHEAGWTWDI ALPERRGVGY VYSNNHTSDD
RAEEILRAYV GPAAEGLNAR QLKLPIGHRQ KPWVKNCVAI GLSGGFLEPL EATGIMLIEA
AAWMLGRLFP RPGELEPTAA LFNEAMGLRY KGVLDFIKLH YCLTQRTDND FWIDNTRPES
IPDSLHARLE MWKTRAPDPF DFGTVHDSFE VFNYQYVLYG MGFKTDLSAN LSAYPHLEAA
RREFARLKSA AGRAAAAMPD HRTLLDEIYR GGFRSPTPQG LAAR