Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_1839 |
Symbol | |
ID | 5899294 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | - |
Start bp | 1953899 |
End bp | 1955404 |
Gene Length | 1506 bp |
Protein Length | 501 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 641562329 |
Product | tryptophan halogenase |
Protein accession | YP_001683466 |
Protein GI | 167645803 |
COG category | |
COG ID | |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 0.703806 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.00201458 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGACCGACC AACGCCTGCG CAAGATCGTC ATCGTCGGAG GCGGGACCGC CGGCTGGATG ACGGCGGCGG CGCTCGGCCG CTTCCTGAAG GACGGCCACA CCCAGGTGAC CCTGATCGAG TCCGAGGAAA TCGGCACGAT CGGGGTCGGC GAGTCGACCA TCCCGCAGAT CAACATCTTC AACCGCATGC TGGGCCTGGA CGAGAACGAG TTCGTCCGCC GCACCAAGGC GACCTTCAAG CTGGCCATCG AGTTCGTCGA CTGGAAACGG ATCGGCCACG CCTATTATCA CCCGTTCGGA CCCTACGGCG TCGACATGGA CGGGGTGTCT TTTCATGCCT ACTGGCTGCG GCTGAAGGCC ATGGGCGAGG CCGCCGATCT GGGCGAATAC TCCCTGCAGG CCCTGGCGGC GGCGCAGGGC AAGTTCATGC GGGCCAATCA CCAGCCCAAC TCGCCCCTGG GCAGCATCGC CCACGCCTAT CACATCGACG CCGGCCTCTA CGCCCGCTTC CTGCGCGACT ACGCCGAGGA TCACGGGATC CGCCGCCAGG AAGGCAAGAT CGTCGAGGTC CACCAGCGCG CCGTGGACGG CTTCATCGAG GCCGTGACCC TGCAGAGCGG TCAGCGCGTC GAGGGCGATC TGTTCATCGA CTGCTCGGGC TTCCGCGGCC TGCTGATCGA ACAGACCCTG AAGACCGGCT ACGAGGACTG GTCCAACTGG CTGCTCAACG ACCGCGCCGT GGCCGTGCCC TGCGAGCCGG CGGGCGCGCG CGCGCCGGTC ACCCGCGCCA CCGCCCGGCC AGCCGGCTGG CAGTGGCGCA TCCCGCTGCA GCATCGCCTG GGCAACGGCT ACGCCTATTC CAGCGAGCAC ATCAGCGAGG ACGAGGCCAC GGCCTACCTG CTCGCTAACC TCGACGGCGC GCCGCTGCGC GATCCGTTCA CCCTGCGCTT CAAGGCCGGG CGGCGAAAGA AGAGCTGGAA CAAGAACGTC GTCGCCATAG GCCTGTCGGC CGGGTTCATG GAGCCGCTGG AAAGCCAGAG CATCCACCTG ATCCAGGTGG GGATCTCGCG CCTGCTGGCC ATGTTCCCCG ACAAGCGGTT CGAGCAGCCC GACATCGACC GCTACAACAG GGTGATGCAG TTCGAATACG AGAAGATCCG CGACTTCCTG ATCCTGCACT TCCACGCCAC CCAGCGGAAC GACACGCCCT ACTGGGACTA TCTGCGGGAA ATGCCGATCC CGGACTACCT GGCCGACAAG ATCGCGGTGT TCGAGAGCTA CGGCCGGGTG TTCCGCGAGA ATGAGGAACT GTTCAACGAC ACCAGCTGGT TCGCGGTGAT GATCGGTCAG GGTCTGGAGC CGCGCGGCCA CGACCCGATG GCCGACGTAA TGTCCGACGA CGAGTTGCGC GCCAAGATGA AGGGCATCCA CGGTGTTATC GCCAAGTCGG CCGAGGTCAT GCCCGACCAC ATGACGTTCA TCGCCGAAAA CTGCGCGGCT CAATAA
|
Protein sequence | MTDQRLRKIV IVGGGTAGWM TAAALGRFLK DGHTQVTLIE SEEIGTIGVG ESTIPQINIF NRMLGLDENE FVRRTKATFK LAIEFVDWKR IGHAYYHPFG PYGVDMDGVS FHAYWLRLKA MGEAADLGEY SLQALAAAQG KFMRANHQPN SPLGSIAHAY HIDAGLYARF LRDYAEDHGI RRQEGKIVEV HQRAVDGFIE AVTLQSGQRV EGDLFIDCSG FRGLLIEQTL KTGYEDWSNW LLNDRAVAVP CEPAGARAPV TRATARPAGW QWRIPLQHRL GNGYAYSSEH ISEDEATAYL LANLDGAPLR DPFTLRFKAG RRKKSWNKNV VAIGLSAGFM EPLESQSIHL IQVGISRLLA MFPDKRFEQP DIDRYNRVMQ FEYEKIRDFL ILHFHATQRN DTPYWDYLRE MPIPDYLADK IAVFESYGRV FRENEELFND TSWFAVMIGQ GLEPRGHDPM ADVMSDDELR AKMKGIHGVI AKSAEVMPDH MTFIAENCAA Q
|
| |