Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_2406 |
Symbol | |
ID | 5899861 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | - |
Start bp | 2622912 |
End bp | 2624456 |
Gene Length | 1545 bp |
Protein Length | 514 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 641562897 |
Product | tryptophan halogenase |
Protein accession | YP_001684031 |
Protein GI | 167646368 |
COG category | |
COG ID | |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 0.699378 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.260972 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTCCGCC CCGTAAAAAA TATCGTCATC GTGGGCGGCG GGACCGCCGG TTGGCTCACC GCGGGCCTGA TCGCGGCCAA GCACAAGGCC CGCCAAGCGA CGGGCTTCAC CGTCACCCTG GTGGAATCGC CCAACACGCC CATCATCGGC GTCGGAGAAG GCACATGGCC GACCCTGCGC ACGAGCCTGG ACAAGATCGG CGTGTCGGAG ACCGATTTCT TCCGGGAGTG CGATGCGGCC TTCAAGCAGG GCGCGAAATT CGCGCGCTGG ACCACGGGCG CGGCCGACGA CGCCTATTAT CACCCGCTCA TGCTGCCGCA GAGCTTTTCG CAGGTGAACC TTGTTCCCCA CTGGCTGGTC GGCGGGGCGG GGCGAAGTTT CTGTGACGCG GTCACGCCGC AAGGGCGGCT CTGCGACGAG GGTCTGGCCC CCAAGACCAT CACCGCCGCG CCATATCAGG GCGCCGCCAA CTACGCCTAT CATCTGGACG CGGGCAAGTT CGCGCCGTTC CTGCAGCGCC ACTGCTGCGA CAAGCTGGGC GTCCGCCATG TCCTGGCCGA CGTCGAAAGC GTAGCCATGA CCGAGGACGG GGATATTCGC GGCGTCGTCA CCGAACAGCA CGGCGAGATT CAGGGTGATC TCTTCGTCGA TTGCACCGGC TTTCGCGCCC TCCTGCTTGG CGAGACGCTC GGCGTGCCGT TCCGCGGCTG CGGCGATGTC CTATTCTGCG ACACGGCGCT GGCCATCCAG GTTCCCTACG AGACCGAGAC CAGCCCCATC TCCAGTCACA CGATCTCCAC GGCCCAGTCG GCCGGATGGA TCTGGGATAT CGGCTTGCCC ACGCGCCGCG GCGTTGGCCA CGTTTATTCC AGCCGCCACA TCAGCGATGA GCACGCCGAG CGCGAGTTGC GGGCCTATAT CGGTCCGGCC GGCCACAACC TTCCGGCCAG GAAGATTGCG ATCCGCTCGG GCCATCGCGA GACGTTCTGG AAACGCAACT GCGTGGCCGT GGGACTCGCC GCGGGATTTC TCGAACCGCT CGAAGCGTCC GCGATCGTCC TGATCGAACT ATCGGCCAAA CTGATCGCCG AGCAGATGCC CGCCTGCCGC GAAGTGATGG ACATCGTCGC GGCGCGCTTC AACGCCACCA CGCATTATCG CTGGGGCCGC ATTATCGATT TCCTGAAGCT GCATTACGTT CTGAGCCAAC GGTCGGACAG CGCCTTCTGG CGAGACAATC GCGCGCGCGA AACCATCCCC GATCGGCTGG CCGACCTGCT CTTGCTGTGG CGTCATCAAC CGCCCTGGCT ACACGACGAG TTCGACCGCG CCGACGAGAT CTTTCCGGCG GCCAGCTACC AATACGTGCT CTATGGCATG GGCTTTCGCA CGCAGATCGA ACCAGAGTCC CTGGCGGACG AGCGAGCGAT CGCCGAGCGG GCCTGGCGGG AGACGGCGGC TCAAACCGAG AGGCTGCGCG CCACCCTGCC CCACCATCGC GACCTGATCC GCAAGATTGT CGAACACGGC TTGCAGCCCG TATGA
|
Protein sequence | MVRPVKNIVI VGGGTAGWLT AGLIAAKHKA RQATGFTVTL VESPNTPIIG VGEGTWPTLR TSLDKIGVSE TDFFRECDAA FKQGAKFARW TTGAADDAYY HPLMLPQSFS QVNLVPHWLV GGAGRSFCDA VTPQGRLCDE GLAPKTITAA PYQGAANYAY HLDAGKFAPF LQRHCCDKLG VRHVLADVES VAMTEDGDIR GVVTEQHGEI QGDLFVDCTG FRALLLGETL GVPFRGCGDV LFCDTALAIQ VPYETETSPI SSHTISTAQS AGWIWDIGLP TRRGVGHVYS SRHISDEHAE RELRAYIGPA GHNLPARKIA IRSGHRETFW KRNCVAVGLA AGFLEPLEAS AIVLIELSAK LIAEQMPACR EVMDIVAARF NATTHYRWGR IIDFLKLHYV LSQRSDSAFW RDNRARETIP DRLADLLLLW RHQPPWLHDE FDRADEIFPA ASYQYVLYGM GFRTQIEPES LADERAIAER AWRETAAQTE RLRATLPHHR DLIRKIVEHG LQPV
|
| |