Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sbal223_3181 |
Symbol | |
ID | 7085794 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shewanella baltica OS223 |
Kingdom | Bacteria |
Replicon accession | NC_011663 |
Strand | + |
Start bp | 3769848 |
End bp | 3771359 |
Gene Length | 1512 bp |
Protein Length | 503 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 643462065 |
Product | tryptophan halogenase |
Protein accession | YP_002359089 |
Protein GI | 217974338 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0665] Glycine/D-amino acid oxidases (deaminating) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.025127 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACAACAG CCATAAAAAG AGTCGTGATA GCGGGTGGCG GAACAGCCGG ATGGATGGCA GCTGCGGCGC TCACTAAGTT GATGGGCAAA CACTTAGAAA TTGTGCTGGT CGAATCCGAT GAAATAGGTA CGGTCGGTGT GGGCGAAGCG ACGATCCCCA CATTACACAT TTTTCATCGA TTACTCGGCC TCAAAGAACA AGAGGTCATG GCAGCGACAA ATGCGACCTT TAAACTCGGG ATCTCCTTTG AAAATTGGCA GGATATCAAT AAAAACTACC TGCATTCCTT TGGTTTTTTA GGTAAAGATT GTTGGGCCTG TGGTTTTCAA CATTTTTGGC TGAAAGGCAA ACAACTCGGC ATGGTGAGTG AAATAGGCGA CTACTGCGCC GAACATCTCG CCGCCCGTCA AGGACGATTT GCCGTACTGC CAAACCAAGA TTACAACCAT GCTTACCACA TGGACGCCAG TCTCTATGCC AAGTACCTTC GAAAAATGGC CGAACAACAC GGCATTCACC GTATTGAAGG AAAAATAAAA CAGGTATTAC AGCATCATGA TAGCGGCAAT ATCAAAGCGC TCGTCTTAGA GAATGACGAA ACAATCGAAG GGGATTTGTT TATCGATTGC ACCGGCTTTC GCGCACTGCT GATCGAGCAA ACGCTTAATA CTGGCTTTGA GGATTGGAGT CATTATCTCC CCTGCGACAG TGCGATTGCG GTGCAAACCC AGTCCGTGGG TGCCCCTATT CCTTATACTC GCTCGATTGC CCGCGACTCG GGTTGGCAGT GGCGTATTCC ACTGCAAAAT CGCACGGGCA ATGGCCTGGT CTTTTGCAGC AAATTTATCT CCGATGAGGA TGCCACTGAA TTGCTCCTCG CTAATCTTGA AGGGGAACCG CTGAATAAGC CTAGGGTCAT TAAGTTTAAA ACCGGCACCC GTCGCCTGCA TTGGCATAAA AACTGTGTTG CCGTCGGGCT GTCCGGCGGC TTTTTAGAGC CACTCGAATC CACCAGCATT CATTTAATCC AACGTAGTAT TATCAGATTG ATGCAACTAT TTCCGTCGGC GGGCATAGTG CAATCCGATA TTGATGAGTT TAATCAACAA ACTAAACTCG AAATGGACAA TATCCGCGAC TTTATCATCT TGCATTACAA GGCCACGGAA CGCGAAGACA GCCGTTTCTG GCGTTATTGC AAAAATATGG ACATTCCAGC GTCGCTCAAA CATCGAATCG AAATGTTTGC CGACAGCGGC AAAGTCTATA AATACGGTAG TGAACTCTTT GGCGAAAGCT CATGGATCCA AGTGATGATG GGCCAAGGCA TAATGCCAAA ACACTATCAC CCGATAGTCG ATGTGATGGA AGAGCCTGAA CTCGAAGCCT TCTTAAACAA CATTAAATCA ACCGTTAAGC GTAAGGTCGA AAGCCTACCT GCCCACATCG ATTTTATTCA GCACTATTGC CCAGCTGAGG TTCTAGCCAA CACTAAACCT ATGGCAATGT GA
|
Protein sequence | MTTAIKRVVI AGGGTAGWMA AAALTKLMGK HLEIVLVESD EIGTVGVGEA TIPTLHIFHR LLGLKEQEVM AATNATFKLG ISFENWQDIN KNYLHSFGFL GKDCWACGFQ HFWLKGKQLG MVSEIGDYCA EHLAARQGRF AVLPNQDYNH AYHMDASLYA KYLRKMAEQH GIHRIEGKIK QVLQHHDSGN IKALVLENDE TIEGDLFIDC TGFRALLIEQ TLNTGFEDWS HYLPCDSAIA VQTQSVGAPI PYTRSIARDS GWQWRIPLQN RTGNGLVFCS KFISDEDATE LLLANLEGEP LNKPRVIKFK TGTRRLHWHK NCVAVGLSGG FLEPLESTSI HLIQRSIIRL MQLFPSAGIV QSDIDEFNQQ TKLEMDNIRD FIILHYKATE REDSRFWRYC KNMDIPASLK HRIEMFADSG KVYKYGSELF GESSWIQVMM GQGIMPKHYH PIVDVMEEPE LEAFLNNIKS TVKRKVESLP AHIDFIQHYC PAEVLANTKP MAM
|
| |