Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sbal195_1436 |
Symbol | |
ID | 5753165 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shewanella baltica OS195 |
Kingdom | Bacteria |
Replicon accession | NC_009997 |
Strand | - |
Start bp | 1722989 |
End bp | 1724488 |
Gene Length | 1500 bp |
Protein Length | 499 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 641287706 |
Product | tryptophan halogenase |
Protein accession | YP_001553870 |
Protein GI | 160874554 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.333804 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 25 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAGTGA AACAAGCAAT CAGAAAAATC ATTATCTTAG GTGGCGGTAC CGCAGGCTGG ATGGCCGCGG CCGCCCTCGC GAATAACCCC GTATTTGCCG CCATTGAACT CTGTTTAGTG GAATCGGACA ACATAGGCAC TATCGGCGTT GGCGAAGGCT CTACGCCACA CCTAAAACGC TTTATGGACA ACTTAGGCAT TAGCGAAAAA GATTGGATGG AGCAATGCCA CGCCAGCTAT AAAACCGGCA TCGACTTTAT CAATTGGAAT GGTGACGGGC AGCAATACTT CCACCCTTTC TATTTTCAAA TGGATGTGAA ACCCGCTGAA GTTTTCTTTA TCAACGCCAA TGCAAGACGC CGAGGGCATG GCAATGCCGT TAAACCCGAT GCGTTTTTCT CTAGCGGCGT ACTGGCCAAA CACAATCTCA GCCCAAGGCC GAATAAAACA CTGCCCTGCG CTAACGAATA CGGCTATCAC TTCGATGCAA CTGAGCTGGC CAATTACTTA AAAGATTATG CCTGCCAACG TGGCGTACGG CAGATCATTG CTGATGTGGT CGAGGTTTCA ACCTCCCCAA ACCAGCAGAT TGAGACACTG ATCCTGGCAA ATGGCGAACG ACTGAGCGCC GACTTTTTCA TCGATGCCAG CGGTTTTAGC GCTAAGTTAA TCCATAAAGC CCTAGGCGTG CCCTTTCAAT CCTTCGCCGA GGAGTTGCTC AACGACAGCG CCGTCACTGT ACCTGCATTA ATGGACTCCA CTCAAACGCA GCAGGCAAAG TACCATACCC GCGCCACAGC GCTCAGTGCA GGCTGGCTAT GGCAAATCCC CCTCACCCAC AGGCTTGGCA ATGGCTATGT TTATAGCAGT CGCCACCTCA GTGCCAATGC CGCGGCTAAG GAATTATTGC ACAGTGTAAA TTTACCCGAG TCGACCCAAG TGCGTTTTCT CAAGCTGCGG GTCGGCGTCA GTGACAAGGC TTGGCACAAC AATGTACTGG CGATAGGGCT TGCACAAAGC TTTATCGAGC CACTCGAAGC CACGTCCATC ATGATGACCC AATTCACCCT TGAAAGATTT ATGTCGTTAT TCGAGCGTTA TCAATTAAAT AAGCAGGCAG AAACCTTAAG TCGGCAAACA TTAAATCAAG CAGTAATGCA GTTAGTGCTC GGCATAAAGG ATTACATTCA GGCCCATTAT GTCACCAGTC AGCGCAGCGA ACCCTATTGG ATGGCAGCGA GAAAAGTCGC TATTTCCCAA AGGCTGACGC AACTATTAAA AGCTTGGTAT CAAGGGGAAG ACTTTGATCT GCTGCTTTAC CAATACGATC AGCAGCTCGC CTACTTTCGA CCCTCTTGGT ATGCGCTGTT AGCGGGAATG GATTACCGCG ACCCCAAACT GAAGCGCCCA TTTGAGCCGA TTTCAGCGGA CATCACCGCC CAGGCAATAA GCTACTGCCA AACCTTAGTG GAGCAGTATT TTCAGCCTAA GCATTCGTGA
|
Protein sequence | MKVKQAIRKI IILGGGTAGW MAAAALANNP VFAAIELCLV ESDNIGTIGV GEGSTPHLKR FMDNLGISEK DWMEQCHASY KTGIDFINWN GDGQQYFHPF YFQMDVKPAE VFFINANARR RGHGNAVKPD AFFSSGVLAK HNLSPRPNKT LPCANEYGYH FDATELANYL KDYACQRGVR QIIADVVEVS TSPNQQIETL ILANGERLSA DFFIDASGFS AKLIHKALGV PFQSFAEELL NDSAVTVPAL MDSTQTQQAK YHTRATALSA GWLWQIPLTH RLGNGYVYSS RHLSANAAAK ELLHSVNLPE STQVRFLKLR VGVSDKAWHN NVLAIGLAQS FIEPLEATSI MMTQFTLERF MSLFERYQLN KQAETLSRQT LNQAVMQLVL GIKDYIQAHY VTSQRSEPYW MAARKVAISQ RLTQLLKAWY QGEDFDLLLY QYDQQLAYFR PSWYALLAGM DYRDPKLKRP FEPISADITA QAISYCQTLV EQYFQPKHS
|
| |