Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caci_2724 |
Symbol | |
ID | 8334073 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Catenulispora acidiphila DSM 44928 |
Kingdom | Bacteria |
Replicon accession | NC_013131 |
Strand | + |
Start bp | 3119401 |
End bp | 3120921 |
Gene Length | 1521 bp |
Protein Length | 506 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 644955874 |
Product | tryptophan halogenase |
Protein accession | YP_003113480 |
Protein GI | 256391916 |
COG category | [C] Energy production and conversion |
COG ID | [COG0644] Dehydrogenases (flavoproteins) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.656683 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 36 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCGCGCT CTGACAGAAG GATCCTGACA AAAAGGATGG TGTTTTCGGA GATGACCGAG GTAATCGTCG TCGGCGGCGG TCCGGCCGGA TCGACCGCAG CCGCCCTACT GGCCAAGAAC GGAGTTTCGG TCACGCTCCT GGAACGCGAG GCGTTCCCCC GCTACCACGT CGGCGAGTCG ATCACGTACT CGTGCCGGGG CGTGCTGGAC TACATCGGCG CGCTGGAGAA GATCGAAGCC CGCGGCTACA CCCGCAAGAC CGGCGTGCTG CTGCGCTGGG GACAGGAGCT GGACTGGGCG ATCGACTGGA CCGCGCAGTT CGGGCCGGAC GTGCGCTCCT GGCAGGTGGA CCGGGAGGAC TTCGACCAGG TGCTGCTGCA GCACGCGGCC GAGTGCGGCG CCCAGGTCCT GGAGCAGGCG CAGGTCAAGC GGGTGGTGTT CGAGGACGGC CGCGCCGTCG GCGTCGAGTG GACGCCCCAG GGCCACAGCG AGCCGCAGAT CACCCGCGCG GACCTGGTCC TCGACGCCTC GGGCCGGGCC GGTCTGATCA GCGCCCAGCA CTTCCGCGAC CGCCGCGCCA CCGAGATCTT CCGCAACGTC GCGATCTGGG GCTACTGGGA CGGCGGCGAG CTGCTGCCGG ACAGTCCGTC CGGGAGCATC AACGTCATCT CCTCGCCCGA GGGCTGGTAC TGGGTCATCC CGCTGAGCGG GAACCGGTTC AGCGTCGGCT ACGTCACGCA CAAATCAGTG TTCGTCGAGC GGCGCAAGGA CTACGACACC CTGGACGACA TGCTCGCGGC GGTGGTTGCC GAGTCGCCGA CGGTGAGCGA GGCGATGGCC AAGGGCACGG TCCGCCCGGG CGCCCGGGTC GAGCAGGACT TCTCCTACGC GGCCGACAGC TTCTGCGGCC CGGGCCACTT CCTGGTCGGC GACGCGGCCT GCTTCCTGGA CCCGCTGCTG TCCACCGGCG TGCACCTGGC CATCTACAGC GGACTGCTCG CGGCCGCCTC GGTGCTCTCG ATCGAGAACG GCGACGTCAC CGAGACCGAG GCCTACGCGT TCTACGAGTC CCGCTACCGC AACTCCTACG AGCGGCTGTT CACGCTGGTC GCGGGCTTCT ACCAGAAGCA CGCCGGCAAG GACCGCTACT TCGAGCTGGC CAAGGCCCTG ACCCGGGAGC ACAAGGGGCT GGAGGGCAGC GCCGACCTGG CGTTCGGCGA GATCACCTCC GGCATCACCG ACCTGCGCGA GGCCAAGGAC GACAGCGGGC TCGGCGACCG GCCGATCCGC GAGTCCGTCG CCGAGGCGGC CTCCCGGCGC TCGAAGGTCC AGGACCTGCT CAGCGCGACC GAGCAGGCGC AGCAGCGGGC CGAGGCCGGG CTGCCCAACA CCGGCGGCGA CCGCGGCCGC TCGCGGGTGC AGATCGACGC CGACGACCTC TACGACGCCG CGACCGGGCT GCACCTGGTC ATGGAACCGC GGCTGGGGAT CCAGCGGGCG GCGGTGCCGG CGGCCGGCTG A
|
Protein sequence | MSRSDRRILT KRMVFSEMTE VIVVGGGPAG STAAALLAKN GVSVTLLERE AFPRYHVGES ITYSCRGVLD YIGALEKIEA RGYTRKTGVL LRWGQELDWA IDWTAQFGPD VRSWQVDRED FDQVLLQHAA ECGAQVLEQA QVKRVVFEDG RAVGVEWTPQ GHSEPQITRA DLVLDASGRA GLISAQHFRD RRATEIFRNV AIWGYWDGGE LLPDSPSGSI NVISSPEGWY WVIPLSGNRF SVGYVTHKSV FVERRKDYDT LDDMLAAVVA ESPTVSEAMA KGTVRPGARV EQDFSYAADS FCGPGHFLVG DAACFLDPLL STGVHLAIYS GLLAAASVLS IENGDVTETE AYAFYESRYR NSYERLFTLV AGFYQKHAGK DRYFELAKAL TREHKGLEGS ADLAFGEITS GITDLREAKD DSGLGDRPIR ESVAEAASRR SKVQDLLSAT EQAQQRAEAG LPNTGGDRGR SRVQIDADDL YDAATGLHLV MEPRLGIQRA AVPAAG
|
| |