Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_4092 |
Symbol | |
ID | 5901554 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | + |
Start bp | 4443489 |
End bp | 4444982 |
Gene Length | 1494 bp |
Protein Length | 497 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 641564612 |
Product | tryptophan halogenase |
Protein accession | YP_001685714 |
Protein GI | 167648051 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 38 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.683477 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCGACA ACAGGATCAA GTCCGTGGTC GTGGTCGGCG GCGGCACGGC CGGCTGGATG AGCGCGGCGC TGCTGGCCCG CGCCCTGGGC GGGACGGTCG ACATCAGGCT GGTCGAATCC GAAGAGATCG GCACGGTGGG CGTCGGCGAG GCCACGATCC CGCAGATCCG TAACTTCAAC GCCTTCCTCG GCCTGGACGA GAACGCCTTC CTCGCCGCGA CGCAAGGCAC GATCAAGCTG GGCATCGAAT TCATCGACTG GCGCGCGCCC GGCCAGTCCT ACATCCATGC GTTCGGCGAG ATCGGCCGGC AGTTGGGCGC GGTCCCCTTC CACCACTATT GGCTGGCCGG CCGCCTGAAG GGCGACGATC ACCCGTTGTG GGACTATTCG CTGAACGCCC AGGCCGCCAA GGCCGGCCGC TTCGGCTGCG CCGCCGGCGC GCCGCCGACC GAGGCGCTGA CCTACGCCTT CCAGTTCGAC GCCGCCCTCT ATGCCGGCCA TCTGCGCGCC TATGCCGAAC ACCACGGGGT GGCGCGCACC GAAGGGCGGA TCCTGGGCGC GAACCTGCGC GGCGTCGACG GCCTGGTGGA GTCGGTGACG CTGGAGAGCG GCGAGGTCGT GGCCGGGGAC TTCTTCATCG ACTGCTCGGG CTTTCGCGGC GTGCTGATCG AGCAGGCGCT GCGGACGGGC TATGAGGACT GGTCGTCCTA CCTGCCGTGC GACCGCGCCA TCGCCGTACC GACCGCCAAT GTCGGCCCGC CGCGCCCCTA CACCCAGGCC TTCGCGCGTT CGGCGGGCTG GCAATGGCGC ATCCCGCTGC AGCATCGCAC CGGCAACGGC CACGTCTTCT GCAGTCGTTT CATCAGCGAG GACGAAGCGG TCGGCCAGTT GATGGCCAAT CTCGAGGGCG AAGCCCTGGC AGAGCCGCGC ACCCTGAAAT TCGTCACCGG GCGCCGCAAG GTGTTCTGGA GCAGAAACGT CCTGGCCCTG GGCCTGTCCA GCGGCTTCAT GGAGCCGCTG GAATCGACCA GCATCCACCT GATCCAGTCA GGCCTGTCGC GCCTGCTCAA CCTCTTCCCC GACAAGGCCT TCGCCCAGCG CGACATCGAC GAGTACAACC GCCAGGCCGG GCTGGAATTC GAGCGCATCC GCGATTTCCT GGTGCTGCAC TACTGGGCCA ACCAGCGCGA CGAGCCATTC TGGCGCGCCT GCCGCGAGAT GGCGGTTCCG CCCGAACTGA CCCGCAAGGT CGAGCTCTTC CGCGCCCGCG GCCGACTGTT CCGCGAGCCG GAGGATCTGT TCCTCGAAGC CAGCTGGCTG CAGGTTCTGG TCGGCCAGGG CGTGCTGCCG GAGCGCTGCC ACCCGATGAC CGGAATGATC ACCGACCCGC AGCTACAGGG CTTCCTGGCG GACCTGCGCA AGATCACCGC CGACTGCGCC GCCGCCCTGC CCGCCCATGC CGACTTCATC CGCCAGCACG CCGCCGCCCG CTGA
|
Protein sequence | MTDNRIKSVV VVGGGTAGWM SAALLARALG GTVDIRLVES EEIGTVGVGE ATIPQIRNFN AFLGLDENAF LAATQGTIKL GIEFIDWRAP GQSYIHAFGE IGRQLGAVPF HHYWLAGRLK GDDHPLWDYS LNAQAAKAGR FGCAAGAPPT EALTYAFQFD AALYAGHLRA YAEHHGVART EGRILGANLR GVDGLVESVT LESGEVVAGD FFIDCSGFRG VLIEQALRTG YEDWSSYLPC DRAIAVPTAN VGPPRPYTQA FARSAGWQWR IPLQHRTGNG HVFCSRFISE DEAVGQLMAN LEGEALAEPR TLKFVTGRRK VFWSRNVLAL GLSSGFMEPL ESTSIHLIQS GLSRLLNLFP DKAFAQRDID EYNRQAGLEF ERIRDFLVLH YWANQRDEPF WRACREMAVP PELTRKVELF RARGRLFREP EDLFLEASWL QVLVGQGVLP ERCHPMTGMI TDPQLQGFLA DLRKITADCA AALPAHADFI RQHAAAR
|
| |