Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ssed_1209 |
Symbol | |
ID | 5614016 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shewanella sediminis HAW-EB3 |
Kingdom | Bacteria |
Replicon accession | NC_009831 |
Strand | + |
Start bp | 1438522 |
End bp | 1440048 |
Gene Length | 1527 bp |
Protein Length | 508 aa |
Translation table | 11 |
GC content | 44% |
IMG OID | 640932058 |
Product | tryptophan halogenase |
Protein accession | YP_001472948 |
Protein GI | 157374348 |
COG category | [C] Energy production and conversion |
COG ID | [COG0644] Dehydrogenases (flavoproteins) |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.311625 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 28 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGATATTA AGAGAGTTGT AATTGTTGGC GGCGGAACTG CTGGCTGGTT AGCAGCCAAT CACCTGGGTA AAGCTGCATT AAATAATGAG CTGTTATCGG TAACCTTAAT TGAATCACCC GACATTCCAA CTATCGGTGT GGGGGAGGGA ACGGTCCCAG CAATAAGGCA GTCTTTGCAA AGCTTTGGTA TTAGTGAAAC AGAGTTTATT AAACGTTGTG ACGTGACGTT TAAACAGTCG ATAAAGTTTG CCAACTGGCT TGATAAAGAG GTACATGGTC AACACAACTT CTATCATCAC CTATTTGATA TGCCAAGCTC CTTAGGGAAG GAGTTAACCC AAGCATGGTT ATCAAAAAAG GGCAGGACTT ACGCGGAGAC CATTTCACCA CAACATGCTG TTTGTGAGGC TTACAAAGGT CCTAAAACAA TTAGTGATGC TGAATATAAG GGACATTTGG GTTATGCCTA TCACCTAAAT GCCGCTAAGT TTGCCAAGCT GTTAGGTGAG AATGCGCAAG AGCGCTTTAA TGTTCAGCAT ATAAGTGCAA ATGTGCAAGA AGTGATTTTG GGTGAAGATG GGGAGATTAA GTCTCTGGTT ACCGACAGCG TAGGGACACT CGATTTTGAC TTTTATATTG ACTGTAGTGG CTTCGCTTCC TTATTAATTG ATAAAGCACT CAAGGTTCCC TTTGTAGATA AAGCAGAGCA GTTGTTCGTC GATAAAGCGA TAACGGTACA AGTGCCAACA GATCCAGCCA GCGCTATTCC CCCGTTCACT ATTGCAACTG CACATCAGGC CGGGTGGATC TGGGATATTG CGTTGAGTAA CCGCAGAGGA GTGGGGTTCG TCTATTCAAG TAAATATATG GATGATGAGA CTGCTGTCTG TAAACTGGAC CATTATCTCG GTGGCAAGTT GTCGGAGCAT AGCTATCGAA CTATACCTAT GAAGGTGGGT TATAGAGAGC GTTTCTGGGA GAAAAACTGT GTTGCATTGG GACTCGCTCA AGGCTTTTTA GAGCCCTTGG AGGCAACCTC AATCCTACTG ACTGATTTCT CCGCAGGATT TTTGGCCAAT AGATTCCCAA CTTCCGCAGC GCAACTGGAT GGAATGAGGC AACAGTTTAA TCAGGTGATG GGTTATGCCT GGGAGCGAGT GGTTGATTTT ATCAAGCTGC ATTATTGCTT GTCTGATCGA ATTGATTCTC AGTTTTGGAT AGATAATGGA GACCCGGGTA CCATGTCAGA TGAACTGAGT AAGCGTTTAT CGCTCTGGAG TAGCTTTATC CCTAATCGAG AAGACTTCTT CAGTAAGTTT GAGGTATTTG ATCTCGAGAA CTACCTGTAT GTTTTATACG GTATGAATTT ACCAACTCAG GTAGCGTTAA GCCGTGCAAG TGATGAGAGC AATGCCAGGA TGCATAGCGA TAAGATTGCC CGGATAGCGG ATCAGTTAGT CAATGAACTT CCAAATCATA GAGAGCTCTT AGAGAAGATA AATCGCTATG GCTTGCAAGA GGTATAG
|
Protein sequence | MDIKRVVIVG GGTAGWLAAN HLGKAALNNE LLSVTLIESP DIPTIGVGEG TVPAIRQSLQ SFGISETEFI KRCDVTFKQS IKFANWLDKE VHGQHNFYHH LFDMPSSLGK ELTQAWLSKK GRTYAETISP QHAVCEAYKG PKTISDAEYK GHLGYAYHLN AAKFAKLLGE NAQERFNVQH ISANVQEVIL GEDGEIKSLV TDSVGTLDFD FYIDCSGFAS LLIDKALKVP FVDKAEQLFV DKAITVQVPT DPASAIPPFT IATAHQAGWI WDIALSNRRG VGFVYSSKYM DDETAVCKLD HYLGGKLSEH SYRTIPMKVG YRERFWEKNC VALGLAQGFL EPLEATSILL TDFSAGFLAN RFPTSAAQLD GMRQQFNQVM GYAWERVVDF IKLHYCLSDR IDSQFWIDNG DPGTMSDELS KRLSLWSSFI PNREDFFSKF EVFDLENYLY VLYGMNLPTQ VALSRASDES NARMHSDKIA RIADQLVNEL PNHRELLEKI NRYGLQEV
|
| |