Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sbal223_3178 |
Symbol | |
ID | 7085791 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shewanella baltica OS223 |
Kingdom | Bacteria |
Replicon accession | NC_011663 |
Strand | + |
Start bp | 3766382 |
End bp | 3768013 |
Gene Length | 1632 bp |
Protein Length | 543 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 643462062 |
Product | tryptophan halogenase |
Protein accession | YP_002359086 |
Protein GI | 217974335 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 0.0464622 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAGCAAG CAACTCACAC AGCCATTAAT AACATTGTCA TTGTCGGTGG CGGCACAGCC GGTTGGATAA CCGCTGCCCT TTTGGCCGCC GAGCATAATG TCGATAAAGG TCAGCTTGCG CACTCGCCTA AACTCAATAT CACATTAATA GAATCCCCCG ATGTGGCCAC TATTGGCGTC GGCGAAGGCA CTTGGCCTTC GATGCGTAAC ACCTTAGATA AAATCGGGAT CAGCGAAACC GAATTTATCC GCCAATGCGA CGCGAGCTTC AAACAGGGCT CACGCTTTAT TCACTGGCAG CACGATGATA CTGAACATAC TAACCCATTC TGTTCCAACC AATATTTACA TCCCTTTAGC CTACCCCATG GTCATCAAGA ATTAGACTTG TGCCCTTTTT GGTTGCCTCA CAGTGACAAG GTCAGTTTTG CTCAGGCGGT ATCCAACCAA GATGCGCTGA CTCAGTTAGG GCTAGCGCCT AAAACCATTG CGACGCCTGA ATATCATTTT CAAAACAATT ACGGTTATCA CTTAGATGCG GGGAAATTTA GCCAACTGTT AATGCAGCAT TGCACCGAAA AACTGGGGAT AAAGTACATT CGTGATCATG TCACTCAGGT TAAAAGTCTT GCCAATGGCG ATATCGAAAG CCTTGCGACC AAAGAACATG GCGTGATCTT AGGCGATATG TTTGTCGATT GTTCCGGGAC GAAATCATTG CTGCTGGGCG AGCATTTTAA CGTGCCCTTC CTCTGCCAAA AAGCTGTGCT GTTTAACGAC AGCGCCTTAG CATTACAAGT CCCCTATGCC GAAGAAAATA GCCCTATCGC CTCATGCACA CTCTCAACGG CGCAGCCCAA TGGCTGGATC TGGGATATAG GTTTACCGAC TCGCAAAGGC GTGGGTTATG TCTATTCATC GGCCCATTGT AGCGATGACG AGGCTGAACG AACCTTAAGA GCCTATTTAA CCAATGACAC ATCAACTAAT CCTCAATCGG CATCCAACAA CGCAGTTGAT AGTCGTAAAC AAGAATGTCG AAAGCTAAAT ATCAACCCCG GCTACCATGC AAAATGCTGG CAAAACAACT GTATCGCCAT CGGCATGGCG GCAGGCTTTA TTGAACCTTT AGAAGCATCG GCATTGGCTT TAGTGGAATG GACGGCGAAT ACCTTAGCCA CGCAACTGCC AACGCATCGC GGCGTAATGG ACACGATTGC CCTGCGAGTG AATGAACGCT ACGAGCGCCA CTGGCAGCAA ATCATCGACT TTTTGAAACT GCATTATGTC GTCAGTCGAC GCGAAGTGGA TGGCTACTGG CGCGATCACC GGGAGGCCGC ATCCATCCCT GAAAGATTAC AGCAGCAACT CGACCTATGG CGTTATCAAG CGCCAAGCAG CCACGATATT AGCTACAAAG AACCCTTATT TCCGGCGGCG AGTTTCCAAT ATGTGCTTTA TGGTATGGGC TTTAATACTG CCCTACCGAC TCATATCAAA CCGTCGCAAC AACAAGTTGC CCAAAGACTT TTTAGCGAGA ATCAGCAAAA GATCCATGGA TTGAGCCAAA GCCTTCCGAG CAATCGCGAC CTATTAAACA AGGTCAGGCA ATTTGGATTC CCTAAGATTT AG
|
Protein sequence | MQQATHTAIN NIVIVGGGTA GWITAALLAA EHNVDKGQLA HSPKLNITLI ESPDVATIGV GEGTWPSMRN TLDKIGISET EFIRQCDASF KQGSRFIHWQ HDDTEHTNPF CSNQYLHPFS LPHGHQELDL CPFWLPHSDK VSFAQAVSNQ DALTQLGLAP KTIATPEYHF QNNYGYHLDA GKFSQLLMQH CTEKLGIKYI RDHVTQVKSL ANGDIESLAT KEHGVILGDM FVDCSGTKSL LLGEHFNVPF LCQKAVLFND SALALQVPYA EENSPIASCT LSTAQPNGWI WDIGLPTRKG VGYVYSSAHC SDDEAERTLR AYLTNDTSTN PQSASNNAVD SRKQECRKLN INPGYHAKCW QNNCIAIGMA AGFIEPLEAS ALALVEWTAN TLATQLPTHR GVMDTIALRV NERYERHWQQ IIDFLKLHYV VSRREVDGYW RDHREAASIP ERLQQQLDLW RYQAPSSHDI SYKEPLFPAA SFQYVLYGMG FNTALPTHIK PSQQQVAQRL FSENQQKIHG LSQSLPSNRD LLNKVRQFGF PKI
|
| |