Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sde_2391 |
Symbol | |
ID | 3967911 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Saccharophagus degradans 2-40 |
Kingdom | Bacteria |
Replicon accession | NC_007912 |
Strand | + |
Start bp | 3032944 |
End bp | 3034524 |
Gene Length | 1581 bp |
Protein Length | 526 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 637921482 |
Product | tryptophan halogenase, putative |
Protein accession | YP_527863 |
Protein GI | 90022036 |
COG category | [C] Energy production and conversion |
COG ID | [COG0644] Dehydrogenases (flavoproteins) |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 4 |
Fosmid unclonability p-value | 0.000100477 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGCAAACAA GACCAGCCGA TCAAACTGCT CCGCTAAAAA ATATTGTTAT AGTGGGTGGC GGTACCGCAG GCTGGCTAAC CGCAGGGCGA CTGGCAGCAC AATTTAATAC AGGCGCAGAT GCAGTTAAAA ATACGAACAA TATTCGCGTT ACACTTATTG AATCGCCCAA CATACCTACC GTTGGTGTAG GTGAAGGTAC TTGGCCCACA ATGCGATCAA CCTTAATTAA AATGGGCATT CGCGAAACAG ATTTTTTAAC CCAATGCGAC GCAACCTTTA AACAAGGCGC AAAATTTGCG CGCTGGACAA CAGGCAAGCA AGACGATTTT TACTACCACC CGCTTATGCT GCCGCAAGGT TTTGGCAAAA CAGACCTCGC ACAACACTGG CAAACCGTTA AACAAGTAAC CGGCCAATCT TTTTCAGAAG CAGTTTGCAT TCAACAAGCT ATTTGTGAAA AGGGACTCGC CCCCAAAACC ATTCGCGCCC CCGAATTTAA CGGTGCGGCC AATTACGCCT ACCACCTCAA CGCGGGTAAA TTTGCCACCT TTTTACAAAA ACACTGCACA CAAAACTTAG GCGTTAATCA TATTCTGGAT GACGTAAGCG GTGTTAACAT CGCAGACAAC GGCGATATAG CCAGTGTAAT AACCAAAGCA AACGGCAATA TTGAAGGCGA TTTATTTGTA GACTGCACTG GTTTTAACGC GCTGCTAGTA GGCAAGCACT ACCAAGTACC GTTTAAAGAC TGTAGCGATG TACTTTTCAT AGACAGTGCT TTAGCCGTAC AACTCCCCTA CTCCAAAGCA GACTCCCCCA TTGCCTCGCA CACTATTTCT ACCGCACAAG ATGCCGGCTG GATATGGGAT ATAGGTCTTA CCCACAGACG CGGTATTGGC CACGTGTATT CAAGCAGGCA CACTAGCGAA AGCGATGCGC TACAAGCGCT GGCAACTTAC ACTCAAACAG ATTGCGACAA GCTAGATGTA AGAAAAATAC CCATTAAATC GGGCCACCGC GAAAAGTTCT GGGTAAATAA TTGCGTTGCA GTGGGCTTAG CTGCGGGGTT TCTCGAGCCA CTAGAAGCCT CTGCACTTGT GCTGGTTGAG CTTTCAGCAC AAATGATAGC CGAGCAACTA CCGGCTAACA GAGCAACCAT GAATATAGTG GCAAAACGTT TTAACGAAAC CTTTCTTTAC CGCTGGGATA AAATTATCGA CTTTTTAAAA TTGCACTATT GCATTAGCCA GCGCACAGAC ACCGCCTTTT GGCGCGACAA CTGCGACCCA GCAACCATTC CACAAAGCTT GCAAGATTTA CTAGCGCTTT GGCAACATCG CGCCCCAAGC GACCTAGACT TTACCAGTAA CAACGAAGTA TTCCCTGCTG CTAGCTACCA ATATGTCCTG TACGGCATGG GGTTTAATAC CCAATTTAGC AATACAGGCT TATATAACGC AGCAGTGGCA GACGCGCACT TTATGCGCAA GCAATTGAAC GAGGACGAAG CCCTTAAGGC ATTGCCAAGC AACCGAGAAC TATTAGAAAA AATTGCCCAA TTCGGCTTAC AGCCGGTATA A
|
Protein sequence | MQTRPADQTA PLKNIVIVGG GTAGWLTAGR LAAQFNTGAD AVKNTNNIRV TLIESPNIPT VGVGEGTWPT MRSTLIKMGI RETDFLTQCD ATFKQGAKFA RWTTGKQDDF YYHPLMLPQG FGKTDLAQHW QTVKQVTGQS FSEAVCIQQA ICEKGLAPKT IRAPEFNGAA NYAYHLNAGK FATFLQKHCT QNLGVNHILD DVSGVNIADN GDIASVITKA NGNIEGDLFV DCTGFNALLV GKHYQVPFKD CSDVLFIDSA LAVQLPYSKA DSPIASHTIS TAQDAGWIWD IGLTHRRGIG HVYSSRHTSE SDALQALATY TQTDCDKLDV RKIPIKSGHR EKFWVNNCVA VGLAAGFLEP LEASALVLVE LSAQMIAEQL PANRATMNIV AKRFNETFLY RWDKIIDFLK LHYCISQRTD TAFWRDNCDP ATIPQSLQDL LALWQHRAPS DLDFTSNNEV FPAASYQYVL YGMGFNTQFS NTGLYNAAVA DAHFMRKQLN EDEALKALPS NRELLEKIAQ FGLQPV
|
| |