Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caci_0530 |
Symbol | |
ID | 8331857 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Catenulispora acidiphila DSM 44928 |
Kingdom | Bacteria |
Replicon accession | NC_013131 |
Strand | - |
Start bp | 614461 |
End bp | 615936 |
Gene Length | 1476 bp |
Protein Length | 491 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 644953687 |
Product | tryptophan halogenase |
Protein accession | YP_003111314 |
Protein GI | 256389750 |
COG category | [C] Energy production and conversion |
COG ID | [COG0644] Dehydrogenases (flavoproteins) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.680136 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 0.208224 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACTGAGC AGAACACAGT GAAAACTGCG AACGACGTAG ACAACCCGGA CAACGCCGAG AAGACCCGCG TGCTGGTCAT CGGCGGCGGC CCCGGCGGCT CGACCGCCGC CACCCTGCTG GCCCGGCAGG GCATCGAGGT GACGCTGCTG GAGAGCGCCA TCTTCCCCCG CTACCACATC GGCGAGTCGA TCCTGCCTTC CGTACTGCCG GTGCTGGACC TGCTCGGAGT CCGCGAGGAG GTCGACAACC ACGGCTTCGT GCGCAAGGAC GGCGCCTATT TCGAGTGGGG CCCGGAGAAC TGGGACCTGA ACTTCGACCA CCTGTCCGGC GCCAGCCGGC ACAGCTACCA GGTGATCCGG TCCGAGTTCG ACCACATGCT GCTGAAGAAC GCCCAGGCCA AGGGCGTTGA CGTGCGCGAG GGCGTCAAGG TCACGGAGAT CCTGTTCCAC GGCGACCGCC CCGTCGCGGC CCGCTGGTCA GCCTCGGACA ACTCCGGGGC CGCCGGCACC ATCGCGTTCG ACTTCCTGGT CGACGCCTCG GGCCGGGCCG GGGTGATGGC GACCAAGTAC CTGAAGAACC GCCGATACCA CGAGGCCTTC AAGAACGTCG CGGTCTGGTC CTACTGGCGC GACGTCAAGC CGCTGGAGGT CGGGCCGAAG GGCGCCATCG CGGTCTGCTC GGTGCCCTAC GGCTGGTTCT GGGCCATTCC GCTGCACGAC GGCACGACCT CGATCGGCCT GGTGGCCAAG CGCACCACAT TCTCCGACGA GCGCGAGCGG CTCGGCAGCA TCGAGGCGGT CTACGCCGAC GCGATGACCC AGGCCCCGCG GATCCTGGAG ATGACCCAGG GCGCGAACAA GATCGAGGGT TACAAGGTCG AGCAGGACTA CTCCTACGTC TCCGAGCGCA AGTCCGGCCC CGGCTACGTC CTGGTCGGCG ACGCGGCCGC CTTCCTGGAC CCGCTGCTGT CCACCGGGGT GCACCTGTCG ACGTTCAGCG CGCTGCTCGC GGCCGCCTCG GTCTCCGCGG TGCTGGACGG CGAGCTGGCC GAGCAGGAGG CGGTGGACTT CTACGAGCGG GCCTACCACC AGGCCTACGA GCGGCTGCTG GTGGTCGTCT CCTTCTTCTA CAACAGCTAC AACCGGCAGA CCCAGTTCTT CGAGGCCGAC AAGCTGACCC GGCGCGAGAG GCACATGCTG AACCTCTACG AGTCTTTCCT GCACATCGTC ACCGGCATCG AGGACCTGGA CGACTCGATC GATGGCGGCG AGGCGCTGGA GGAGGTCGCG CAGCAGATCG CGACCCAGAA GAAGATCGAC GCCGGGCACA ACGAGGCGAT GAACTCGCTG CCGGACTCGC CGCGGCAGGC CGTCGGCGGG TTGTACCTGG AACTGGAGCC CCGGCTACGC ATCCGCCGCA CATCCGAGGC GCCCGCCGGC CCGGAGCCGG TGCGGGAAGC GAGGGCCGGG TTGTGA
|
Protein sequence | MTEQNTVKTA NDVDNPDNAE KTRVLVIGGG PGGSTAATLL ARQGIEVTLL ESAIFPRYHI GESILPSVLP VLDLLGVREE VDNHGFVRKD GAYFEWGPEN WDLNFDHLSG ASRHSYQVIR SEFDHMLLKN AQAKGVDVRE GVKVTEILFH GDRPVAARWS ASDNSGAAGT IAFDFLVDAS GRAGVMATKY LKNRRYHEAF KNVAVWSYWR DVKPLEVGPK GAIAVCSVPY GWFWAIPLHD GTTSIGLVAK RTTFSDERER LGSIEAVYAD AMTQAPRILE MTQGANKIEG YKVEQDYSYV SERKSGPGYV LVGDAAAFLD PLLSTGVHLS TFSALLAAAS VSAVLDGELA EQEAVDFYER AYHQAYERLL VVVSFFYNSY NRQTQFFEAD KLTRRERHML NLYESFLHIV TGIEDLDDSI DGGEALEEVA QQIATQKKID AGHNEAMNSL PDSPRQAVGG LYLELEPRLR IRRTSEAPAG PEPVREARAG L
|
| |