Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sbal195_1212 |
Symbol | |
ID | 5752939 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shewanella baltica OS195 |
Kingdom | Bacteria |
Replicon accession | NC_009997 |
Strand | - |
Start bp | 1438554 |
End bp | 1440185 |
Gene Length | 1632 bp |
Protein Length | 543 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 641287481 |
Product | tryptophan halogenase |
Protein accession | YP_001553647 |
Protein GI | 160874331 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.381407 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAGCAAG CAACTCACAC AGCCATTAAT AACATTGTCA TTGTCGGTGG CGGCACAGCC GGTTGGTTAA CCGCTGCTCT TTTGGCCGCC GAGCATAATG TCGATAAAGG CCAGCTTGCG CACTCGCCTA AACTCAATAT CACATTAATA GAATCCCCCG ATGTGGCCAC GATCGGCGTC GGCGAAGGCA CTTGGCCTTC GATGCGTAAC ACCTTAGATA AAATCGGCAT CAGCGAAACC GAATTTATTC GCCAATGCGA CGCGAGCTTC AAACAGGGCT CACGCTTTAT TCACTGGCAG CACGATGATA CTGAACATAC TAAACCATTC GATTCTAACC AATATTTACA TCCCTTTAGC CTACCCCATG GCCATCAAGA GTTAGACTTG TGTCCCTTTT GGTTGCCTCA CAGTGACAAG GTCAGTTTTG CTCAGGCGGT ATCCAACCAA GATGCGCTGA CTCAGTTAGG GCTAGCGCCT AAAACCATTG CGACACCGGA ATATCATTTT CAAAACAATT ACGGCTATCA CTTAGATGCG GGTAAATTTA GCCAACTGTT AATGCTGCAT TGCACCGAAA AACTGGGGGT CAAGTACATT AGAGATCATG TCACTCAGGT TCACAGTCAT GCCAATGGCG ATATAGAAAG CCTTGCGACC AAAAAACATG GCGTGATCTT AGGTGATATG TTTGTCGATT GTTCCGGCAC TAAATCATTG CTGCTGGGCG AGCATTTTAA CGTGCCCTTC CACAGCCAAA AAGCTGTGCT GTTTAACGAC AGCGCCTTAG CAATACAAGT CCCCTATGCC GAAGAAAACA GCCCTATCGC CTCATGCACA CTCTCAACGG CGCAGCCCAA TGGCTGGATC TGGGATATAG GTTTACCGAC TCGCAAAGGC ATGGGTTATG TCTATTCATC AGCCCATTGT AGCGATGACG AGGCTGAACA AACCTTAAGA GCCTATTTAA CCAATGACAC ATCAACTAAT CCTCAATCGG CATCCAGCAA CGCGCTTGAT AGTCGTAAAC AAGAATGCCG AAAGCTAAAT ATCAACCCCG GCTATCATGC AAAATGCTGG CAAAACAACT GTATTGCCAT CGGCATGGCG GCGGGCTTTA TTGAGCCGCT AGAAGCATCG GCATTAGCCT TAGTGGAATG GACGGCGAAT ACCTTAGCCA CGCAACTGCC AACGCATCGC GGCGTAATGG ACACGATTGC CCAGCGAGTG AATGAACGCT ACGAGCGCCA CTGGCAGCAA ATCATCGACT TTTTGAAACT GCATTATGTC GTGAGTCGAC GCGAAGTGGA TGGCTACTGG CGCGATCACC GGGAGGCCGC ATCCATTCCA GAAAGATTAC AGCAGCAACT CGACTTATGG CGTTATCAAG CGCCAAGCAG TCACGATATT AGCTACAAAG AACCCTTATT TCCGGCGGCG AGTTTCCAAT ATGTGCTTTA TGGTATGGGC TTTAATACTA CCCTGCCCAC CCATATCAAG CCGTCGCAAC AGCAAGTTGC CCAAAGACTT TTTAGCGAGA ATCAACAAAA GGTCCATGCA TTGAGCCAAA GCCTTCCGAG CAATCGCGAC CTATTAAACA AGGTCAGGCA ATTTGGTTTC CCTAAGATTT AG
|
Protein sequence | MQQATHTAIN NIVIVGGGTA GWLTAALLAA EHNVDKGQLA HSPKLNITLI ESPDVATIGV GEGTWPSMRN TLDKIGISET EFIRQCDASF KQGSRFIHWQ HDDTEHTKPF DSNQYLHPFS LPHGHQELDL CPFWLPHSDK VSFAQAVSNQ DALTQLGLAP KTIATPEYHF QNNYGYHLDA GKFSQLLMLH CTEKLGVKYI RDHVTQVHSH ANGDIESLAT KKHGVILGDM FVDCSGTKSL LLGEHFNVPF HSQKAVLFND SALAIQVPYA EENSPIASCT LSTAQPNGWI WDIGLPTRKG MGYVYSSAHC SDDEAEQTLR AYLTNDTSTN PQSASSNALD SRKQECRKLN INPGYHAKCW QNNCIAIGMA AGFIEPLEAS ALALVEWTAN TLATQLPTHR GVMDTIAQRV NERYERHWQQ IIDFLKLHYV VSRREVDGYW RDHREAASIP ERLQQQLDLW RYQAPSSHDI SYKEPLFPAA SFQYVLYGMG FNTTLPTHIK PSQQQVAQRL FSENQQKVHA LSQSLPSNRD LLNKVRQFGF PKI
|
| |