Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BCZK4928 |
Symbol | |
ID | 3026672 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Bacillus cereus E33L |
Kingdom | Bacteria |
Replicon accession | NC_006274 |
Strand | + |
Start bp | 5018526 |
End bp | 5020454 |
Gene Length | 1929 bp |
Protein Length | 642 aa |
Translation table | 11 |
GC content | 36% |
IMG OID | 637549161 |
Product | sulfatase |
Protein accession | YP_086498 |
Protein GI | 52140333 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1368] Phosphoglycerol transferase and related proteins, alkaline phosphatase superfamily |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.00117274 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAGAAA CCTTGAAATC ACAATTTCAA AATGTGCGTT TCACTGTATT CGTAGCTTTA GCCGTATGGT TGAAGACATA TCTTATTACA CGCACAAGCT TTGATTTAAA ACTTGAATCT TTCATGCAAG AATTCATTTT ATTCCTTAGC CCATTAGCAG CATCATTACT GCTTGTTGGT CTTGCATTAT TTGCAAAAGG GAAAAAACGT AACTATATAG CACTTGGAAT TAATTTTGTT TTAACAATTA TTCTTGTTGG TAACGTAATG TTCTACGGAT TCTACAATGA CTTCGTTACT TTACCCGTAC TAGGACAAAC ATCTAACTTC GGAAGTTTAG GTTCTAGTGT GAAAGAATTA TTTAACTACA AAATCATCCT TGCATTTGCT GATATTATCG TATTCTTCAT TTTATTGAAG AAGATGAAGA ATTTTGCACC GACAGAACGT GTAGCACGCC CAATGCGTTC CCTATACTTC GTGTCAACAA TTGCTATTTT CTTCGCAAAC TTAGGATTGG CAGAAGCTGA GCGTCCTGAA CTATTAACAC GTTCATTCGA CCGCGTTATG CTCGTTAAAA ACTTAGGTTT ATATGTACAC CAAGTGTATG ATCTTGGCTT ACAAGCAAAA TCAAGTTCAC AAAAAGCATT TGCTGACGGT AGTAAGTTAC AGGAAACAGA GAACTACGTA AAAACAACGC AAAGCAAACC AGATCCAAAT ATGTTTGGTA CTGCAAAAGG GAAAAACGTA ATTGTTGTCT CTCTTGAGTC ATTACAAACA TTCTTAATTG GTGCAACAGT TAACGGACAA GAAGTTACAC CATTCTTAAA CCAATTTACG AAAGAAAGTT ATTACTTCGA TAACTTCTTC CATCAAACTG GTCAAGGAAA AACATCTGAC GCTGAATTCT TAGTAGATAC TTCCATGTAT CCACTAGACC GTGGTGCTGT ATTCTTCACA CACGGTAACA ACGAATATAC AGCAACTCCA GAAATTTTAC GCGAGCAAGG ATATCATACA TCTGTATTCC ACGCGAACAA CGCAACGTTC TGGAACCGTA ACATTATGTA TCCGGCACTT GGTTATGACC GTTACTACAA CGAGCTTGAC TACAAGATTA CGCCAGAAAC AAAATTAAAT TGGGGATTAA AAGATATCGA ATACTTCGAT CAATCTGTCG ATATGTTAAA AGAAGTGAAG CAACCGTTCT ACACTCGCTT CCTTACGTTA ACAAACCATT ACCCATTCAC TTATGATGAA AGCACAAAAT TAATCGATGA ATACAATTCT GGTGATGGCG TATTTGACCG TTACATGGTA ACTGCTCGCT ATTTAGACGA AGCAATGAAA CACTTTATTG AGCGTCTAAA AGCAGAGGGT ATTTACGACA ATTCAATTAT CGTATTCTAC GGTGATCACT ACGGTATTTC TGAAAACCAT AACCGTGCAA TGGCACAGTT CTTAGGAAAA GAAGAAATTA CTGCATTTGA CCATATGAAC TTACAAAAAA CACCGATGTT TATTCACGTT CCAGGTCAAA AAGAAGGTAA AACAATTTCA AAACCAACTG GTGAAATTGA CATTAAACCA ACAATTCTAA ACTTACTTGG TATAGATTCT ACGAATCAAA TTCAATTTGG TCATGATGTA TTCTCACCAG AAAATAAAGG ATTTGTTGTT CTTCGTGACG GTAGCTTCGT TACAGATAAG TACATGTATA CGAATAGTAC ATTCTACGAC CGTGCTACTG GCGAAGTTGT ACAATTACCA AAAGAAGAAT CTCAACCACT CATTGATCGT GCCCAAAATG AATTGAACAT GTCTGACAAA ATCATTGAAG GTGACTTACT TCGCTTCTCT GAAAGCAACA AGACAAAAAC TGGTGAAGTA AAGACAGCTA TTAAAGAAGA AAAGAAGAGC GCTGAGTAA
|
Protein sequence | MKETLKSQFQ NVRFTVFVAL AVWLKTYLIT RTSFDLKLES FMQEFILFLS PLAASLLLVG LALFAKGKKR NYIALGINFV LTIILVGNVM FYGFYNDFVT LPVLGQTSNF GSLGSSVKEL FNYKIILAFA DIIVFFILLK KMKNFAPTER VARPMRSLYF VSTIAIFFAN LGLAEAERPE LLTRSFDRVM LVKNLGLYVH QVYDLGLQAK SSSQKAFADG SKLQETENYV KTTQSKPDPN MFGTAKGKNV IVVSLESLQT FLIGATVNGQ EVTPFLNQFT KESYYFDNFF HQTGQGKTSD AEFLVDTSMY PLDRGAVFFT HGNNEYTATP EILREQGYHT SVFHANNATF WNRNIMYPAL GYDRYYNELD YKITPETKLN WGLKDIEYFD QSVDMLKEVK QPFYTRFLTL TNHYPFTYDE STKLIDEYNS GDGVFDRYMV TARYLDEAMK HFIERLKAEG IYDNSIIVFY GDHYGISENH NRAMAQFLGK EEITAFDHMN LQKTPMFIHV PGQKEGKTIS KPTGEIDIKP TILNLLGIDS TNQIQFGHDV FSPENKGFVV LRDGSFVTDK YMYTNSTFYD RATGEVVQLP KEESQPLIDR AQNELNMSDK IIEGDLLRFS ESNKTKTGEV KTAIKEEKKS AE
|
| |