Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sbal_1079 |
Symbol | |
ID | 4843891 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shewanella baltica OS155 |
Kingdom | Bacteria |
Replicon accession | NC_009052 |
Strand | - |
Start bp | 1240411 |
End bp | 1241937 |
Gene Length | 1527 bp |
Protein Length | 508 aa |
Translation table | 11 |
GC content | 39% |
IMG OID | 640118302 |
Product | tryptophan halogenase |
Protein accession | YP_001049471 |
Protein GI | 126173322 |
COG category | [C] Energy production and conversion |
COG ID | [COG0644] Dehydrogenases (flavoproteins) |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.488063 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATAATT CTCCGCTAAA AAGTATCACT ATTGTCGGTG GCGGTAGCAC AGGTTGGATG ACGGCTCTCT ACTTGAGTAA GTTATATAAC CATTCGTCTA ATATCATTGA TATTAGACTT ATCGAAAGTA AAGAAATTGG TATTATTGGC GTTGGTGAAG CTACCGTTCA TAGTCTACGA TTCTTCTTTG CAGCTATGGG ATTAGATGAA AATGAACTAC TTAGAGAAAC TAACGCAACA TTAAAGACCG GCATATTATT TCGTAACTGG ATGAAACCAG TCGATGGTAA AATGCATGAA TATTTCCATC CATTTGAGCA ACAAAAACCA TCTGGAAAAA TTGATAATGC AAGTGCTTGG ATTTTAAATA ATCAGATTAA TGAATCAAAC AGCACTCCCT TTGCTGAAGC AACAAGTCTT TCATTTAGCT TAATGCAAAA TAACTTAAGC CCTAAAACGC ATTACTTGAA TCAATATCAA GGTATTGTCC CCTATGGTTA CCACTTAGAC GCGACGCTCA TGGCGGCCTT TTTAAAGCGT AAAGCCACCC AAGCAGGTGT AGAGCATATT GAAGCAAATG TAACGGACGT TCTGGTTAAT GATGGCAACA TCACTGAGGT TGTAACTGAC CTAGGCTCAT TCAAGAGCGA CATATTTATC GACTGCACAG GCTTTAAAGG GCTTTTAATT CAAAGCCTAA AGAATGATAA CTGGCAATCA TTCGAAACTG AATTACCTTG CAATAAAGCC GTTGCAATGC AGCGGGAATA TCTGCCTCAC CAGCTACCAA GAGCCTATAC GACAGCAACA GCGCTTAGTC ATGGTTGGGT TTGGGAAATT GACCTAACGA ATCGTCAAGG AACGGGTTAT GTCTACGATG GCAATAGTCT AACGAGGGAG CAAGCTGAGG CTGAATTAAA AGCTTATCTT GGCGATGAAC AAGAGATCAT AAGAACTGTT CACTTAGATA TGAAGATAGG TTGTAGACGT GAATTTTGGG TTGGAAATTG TATTGCAATG GGGCTCGCAG GTGGATTTAT TGAGCCCTTA GAATCGACAG GCTTGCATCT TATAAACTTG GGGGCAAGAT TGCTAGCGAC GCATTTAACG TCATCAAATC CCTCCCAAGC AATAAGGACG TCATATAATA CCGCAATGAA AGGCTTGTAT ACCGATCTTA GACAGTTTAT CGTACTTCAT TACTGCTTAA CTGATAGAGA TGATAATGAA TTTTGGCAAA AAGCCGCACA AAGCTCAGCA TTCATCCCTG CTCTAGAACA AAAAATGACA CTATGGAAAG ACAAGGTTTG TGAGTATGTC GATCTTGCCA ATGGATATAG TTCAGTTTTT ACAGATGAGA ATTATCGCTA CGTACTTTAT GGTATGCAGC ATATTCCTGA AATCCATATA CCTTGTCCCA AAACTGAAAT ATCAAGAATA CTCGACAACT TAAAAACACA ACAGTTTAGA GCCAAAGAGC AATCTTTATC ACATCAAGAA TTTTTAGCAA AAATTAAAAA ATTTTAA
|
Protein sequence | MNNSPLKSIT IVGGGSTGWM TALYLSKLYN HSSNIIDIRL IESKEIGIIG VGEATVHSLR FFFAAMGLDE NELLRETNAT LKTGILFRNW MKPVDGKMHE YFHPFEQQKP SGKIDNASAW ILNNQINESN STPFAEATSL SFSLMQNNLS PKTHYLNQYQ GIVPYGYHLD ATLMAAFLKR KATQAGVEHI EANVTDVLVN DGNITEVVTD LGSFKSDIFI DCTGFKGLLI QSLKNDNWQS FETELPCNKA VAMQREYLPH QLPRAYTTAT ALSHGWVWEI DLTNRQGTGY VYDGNSLTRE QAEAELKAYL GDEQEIIRTV HLDMKIGCRR EFWVGNCIAM GLAGGFIEPL ESTGLHLINL GARLLATHLT SSNPSQAIRT SYNTAMKGLY TDLRQFIVLH YCLTDRDDNE FWQKAAQSSA FIPALEQKMT LWKDKVCEYV DLANGYSSVF TDENYRYVLY GMQHIPEIHI PCPKTEISRI LDNLKTQQFR AKEQSLSHQE FLAKIKKF
|
| |