Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_0296 |
Symbol | |
ID | 5897570 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | + |
Start bp | 331678 |
End bp | 333252 |
Gene Length | 1575 bp |
Protein Length | 524 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 641560780 |
Product | tryptophan halogenase |
Protein accession | YP_001681931 |
Protein GI | 167644268 |
COG category | |
COG ID | |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 0.628963 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 24 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGACGAAAC CGATCGATGA GGTGCTGATC GTCGGCGGCG GGACGGCCGG CTGGATCACC GCCGCCTATC TCGCGCGCAA GCTGGGCGCG GCGCGGCCAG ACGGGGTCAG GATCACCCTG ATCGAGTCCA GCGAGATCGG CATCATCGGG GTGGGCGAGG GCACGATCCC CACCATCCAG ACGACAATGC GCGAGATCGG GATCGACGAA GCCCGGTTCA TGCGCGGGGC CGGCGCGACC TTCAAGCAGG GCATCAAGTT CGTCGACTGG ACCACGGCGC CGGTCGGCGG CGCGCACAAT CACTACTACC ACTCGTTCAG CCGGCCCCAC ACGCTGGGCG GCCTGGACCT GGCCCCCTAC TGGATGCTGG GCTGCGCGGG CGACGTGTCG TTCTCCGAGG CGGTGACCCT GCAGGACACG GTCTGCGAGG CCGGCAAGGG TCCCAAGCTG ATCGACGACC CGCAGTATTC CAGCCCGCTC GGCTACGCCT ATCACTTCGA CGCCGGCAAG CTGGCGACGC TGATGCGCGA CGTCGGCAAG GCGCTGGGCG TGCGCCACCT GATCGGTAAT GTCGAAGGCG CGCGCCTGGA CGAGTCCGGC GCCATCGCGG CGATCGTCAC CCGTGAGCAT GGCGAACTGA CCGCCGGTCT CTACATCGAT TGCAGCGGCT TCTCGGCCAA GCTGATCGGC GAGGCGATGG GCGTTCCGTT CGTCGACGAC AGCGACGTGC TGTTCGTCAA CCGCGCCGTG GCCATCCAGG TCCCCTACGA CCGGCCCGAC GCGCCGGTGG CGACGACCAC CCTTTCCACC GCCCACGAAG CCGGCTGGAC CTGGGATATC GCCCTGCCCG AGCGGCGGGG CGTCGGCTAT GTCTATTCCA ACAACCACAC CAGCGACGAC CGCGCCGAGG AGATTCTGCG CGCCTATGTC GGTCCGGCGG CCGAGGGGCT GAACGCCCGC CAGCTGAAGC TGCCGATCGG CCATCGCCAG AAGCCGTGGG TCAAGAACTG CGTCGCCATC GGCCTGTCCG GCGGCTTCCT GGAGCCGCTG GAGGCGACCG GCATCATGCT GATCGAGGCG GCGGCCTGGA TGCTGGGCCG GCTGTTTCCC CGGCCGGGCG AGTTGGAGCC GACCGCCGCC CTGTTCAACG AGGCTATGGG CCTGCGCTAC AAGGGCGTGC TGGACTTCAT CAAGCTGCAC TACTGCCTGA CGCAGCGCAC CGACAACGAC TTCTGGATCG ACAACACCCG GCCCGAAAGC ATACCGGACT CGCTGCACGC CCGGCTGGAG ATGTGGAAGA CCCGCGCGCC CGACCCGTTC GACTTCGGCA CCGTCCATGA CAGTTTCGAG GTCTTCAACT ACCAGTATGT TCTGTACGGC ATGGGCTTCA AGACGGACCT CTCGGCCAAT CTCTCGGCCT ATCCCCACCT GGAGGCGGCG CGGCGCGAGT TCGCGCGGCT GAAGAGCGCC GCGGGACGGG CGGCGGCGGC CATGCCTGAC CATCGGACTT TGCTGGATGA GATCTACCGC GGCGGCTTCC GGTCTCCCAC GCCCCAAGGA CTGGCGGCGC GATGA
|
Protein sequence | MTKPIDEVLI VGGGTAGWIT AAYLARKLGA ARPDGVRITL IESSEIGIIG VGEGTIPTIQ TTMREIGIDE ARFMRGAGAT FKQGIKFVDW TTAPVGGAHN HYYHSFSRPH TLGGLDLAPY WMLGCAGDVS FSEAVTLQDT VCEAGKGPKL IDDPQYSSPL GYAYHFDAGK LATLMRDVGK ALGVRHLIGN VEGARLDESG AIAAIVTREH GELTAGLYID CSGFSAKLIG EAMGVPFVDD SDVLFVNRAV AIQVPYDRPD APVATTTLST AHEAGWTWDI ALPERRGVGY VYSNNHTSDD RAEEILRAYV GPAAEGLNAR QLKLPIGHRQ KPWVKNCVAI GLSGGFLEPL EATGIMLIEA AAWMLGRLFP RPGELEPTAA LFNEAMGLRY KGVLDFIKLH YCLTQRTDND FWIDNTRPES IPDSLHARLE MWKTRAPDPF DFGTVHDSFE VFNYQYVLYG MGFKTDLSAN LSAYPHLEAA RREFARLKSA AGRAAAAMPD HRTLLDEIYR GGFRSPTPQG LAAR
|
| |