Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_0297 |
Symbol | |
ID | 5897571 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | + |
Start bp | 333249 |
End bp | 334847 |
Gene Length | 1599 bp |
Protein Length | 532 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 641560781 |
Product | tryptophan halogenase |
Protein accession | YP_001681932 |
Protein GI | 167644269 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0665] Glycine/D-amino acid oxidases (deaminating) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 0.427874 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCGACG AGCCCACGAT TGACCGTATC CTGATCGTCG GCGGCGGCAC GGCCGGCTGG ATGACCGCCG CCTACCTGGC CCGCCGCCTG GGGGCGATGC GGCCGGACGG GGTCCAGATC ACCCTGATCG AGTCCAGCGA GATCGGCATC ATCGGCGTGG GCGAGGGGAC CTTCCCGACC ATCCAGAACA CCATGCGGAC GATCGGCGTC GACGAGGCGC GGTTCATGCG CGAGGCCGGG GCGGCCTTCA AGCAAGGCAT CAAGTTCGTC GACTGGAAGA CCGCGCCCAA GGACGGCGTC CACAGTCACT ACTACCACCC CTTCGCCCCT CCCCGGTTGC TGAACGGCGG CATGGACCTG GCGCCCTACT GGCTGATGGG CGAGGCGGGA AACATCCCGT TCTCAGATGC GGTGACGTTG CAGGACAAGG TCTGCGACGC CATGCGCGGC CCCAAGCGGC GCGACGATCC GCAGTACGGC GGGCCGATGG CCTATGCCTA CCATTTCGAC GCCGGCAAGC TGGCCAACCT CCTGCGCGAC GTCGGCAAGG CCACGGGCGT TAAGCACCTG CTGGGCAACG TCCAGGCGGT CAACAAGACG GAAGACGGAT CGATCGCCTC GGTCACCATC CGCGAGCACG GCGACCTGAC CGCCGACCTC TATATCGACT GCACCGGTTT CGCCGGAGCG CTGATCGGCG AGGCCATGGG CTCGGCCTGG ATCGACAAGA ACGATGTGCT GTTCGTCGAC CGCGCCCTGG CCCTGCAGGT CCCCTATGAC CGGCCGGACG CTCCGGTGGC CTCCACCACG CTCTCGACCG CCCACGAGGC CGGCTGGACC TGGGACATCG GCCTGCCCGA CCGCCGGGGC ACGGGCTATG TTTATTCCAG TCGTCATACT ACGGACGATC GGGCCGAGCA GATCCTTCTC GGCTATGTCG GCAAGGCGGG CGAAGGCTTG AACCCGCGCC TGCTGAAGCT GAAGGTCGGC CACCGGGCCC AGCACTGGGT CAAGAACTGC GTGGCCGTGG GCCTGTCGGG CGGCTTCCTG GAGCCGCTGG AATCCACCGG CATCGTGCTG ATCGAGGCGG CCGCCTATAT GCTGGCCCGC AACCTGCCCC GCCGGGGCGG CATGGCGGCG GCGGCGCGCC AGTTCAATAC CGCGATGACC GACCGCTACC TGCGGGCCAT CGACTTCATC AAGCTGCACT ACTGCCTCAG CCAGCGCGCC GACAACAGCT TCTGGACCGA CAACGCCGAC CCCGCCTCGA TCCCCCAGAC GCTGCAGGAT CACCTGGCGA TGTGGAAACA TCGCCCGCCC AACGTCTTCG ACTTCCCGAA CCTCCACGAG TCGTTCAAGT CCTTCAATTA CCAGTACATC CTGTACGGCA TGGGGTACGA GACGAAGGTC GATCCGGCCG CCCACGTCCA TGGCGACCTG GCCCGGGCCG ACTTCGCGCG CGTGCGGGAG GCCGGCGTCC GCGCCGCCGC CAGCCTGCCC GACCATCGCG CCCTGCTGAC CGAGGTCTAC GCCCACGGCT TCAAGACCAA GACCCCCGAC GCCGCCTCGG CGGAGGCCGC CGAGGGGCTG CGCCGGTGA
|
Protein sequence | MSDEPTIDRI LIVGGGTAGW MTAAYLARRL GAMRPDGVQI TLIESSEIGI IGVGEGTFPT IQNTMRTIGV DEARFMREAG AAFKQGIKFV DWKTAPKDGV HSHYYHPFAP PRLLNGGMDL APYWLMGEAG NIPFSDAVTL QDKVCDAMRG PKRRDDPQYG GPMAYAYHFD AGKLANLLRD VGKATGVKHL LGNVQAVNKT EDGSIASVTI REHGDLTADL YIDCTGFAGA LIGEAMGSAW IDKNDVLFVD RALALQVPYD RPDAPVASTT LSTAHEAGWT WDIGLPDRRG TGYVYSSRHT TDDRAEQILL GYVGKAGEGL NPRLLKLKVG HRAQHWVKNC VAVGLSGGFL EPLESTGIVL IEAAAYMLAR NLPRRGGMAA AARQFNTAMT DRYLRAIDFI KLHYCLSQRA DNSFWTDNAD PASIPQTLQD HLAMWKHRPP NVFDFPNLHE SFKSFNYQYI LYGMGYETKV DPAAHVHGDL ARADFARVRE AGVRAAASLP DHRALLTEVY AHGFKTKTPD AASAEAAEGL RR
|
| |