Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BCAH820_5321 |
Symbol | |
ID | 7188018 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Bacillus cereus AH820 |
Kingdom | Bacteria |
Replicon accession | NC_011773 |
Strand | + |
Start bp | 5016743 |
End bp | 5018671 |
Gene Length | 1929 bp |
Protein Length | 642 aa |
Translation table | 11 |
GC content | 37% |
IMG OID | 643558731 |
Product | sulfatase |
Protein accession | YP_002454241 |
Protein GI | 218906407 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1368] Phosphoglycerol transferase and related proteins, alkaline phosphatase superfamily |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 2 |
Fosmid unclonability p-value | 3.10214e-55 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAAAGAAA CCTTGAAATC ACAATTTCAA AATGTGCGTT TCACTGTATT CGTAGCTTTA GCCGTATGGT TGAAGACATA TCTTATTACA CGCACAAGCT TTGATTTAAA ACTTGAATCT TTCATGCAAG AATTCATTTT ATTCCTTAGC CCATTAGCAG CATCATTACT GCTTGTTGGT CTTGCATTAT TTGCAAAAGG GAAAAAACGT AACTATATAG CACTTGGAAT TAATTTTGTT TTAACAATTA TTCTTGTAGG TAACGTAATG TTCTACGGAT TCTATAATGA CTTCGTTACT TTACCCGTAC TAGGACAAAC ATCTAACTTC GGAAGTTTAG GTTCTAGTGT GAAAGAATTA TTTAACTACA AAATCATCCT TGCATTTGCT GATATTATCG TATTCTTCGT TTTATTGAAG AAGATGAAGA ATTTTGCACC GACAGAACGT GTAGCACGCC CAATGCGTTC CCTATACTTC GTGTCAACAA TTGCTATTTT CTTCGCAAAC TTAGGACTGG CAGAAGCTGA GCGTCCTGAA CTATTAACAC GTTCATTCGA CCGCGTTATG CTCGTTAAAA ACTTAGGTTT ATATGTACAC CAAGTGTATG ACCTTGGCTT ACAAGCAAAA TCAAGTTCAC AAAAAGCATT TGCTGACGGT AGTAAGTTAC AGGAAACAGA GAACTACGTA AAAACAACGC AAAGCAAACC AGATCCAAAT ATGTTTGGTA CTGCAAAAGG GAAAAACGTA ATTGTCGTCT CTCTTGAGTC ATTACAAACA TTCTTAATTG GTGCAACAGT TAACGGACAA GAAGTTACAC CATTCTTAAA CCAATTTACG AAAGAAAGTT ATTACTTCGA TAACTTCTTC CATCAAACTG GTCAAGGAAA AACATCTGAC GCTGAATTCT TAGTAGATAC TTCCATGTAT CCACTAGACC GTGGTGCTGT ATTCTTCACA CACGGTAACA ACGAATACAC AGCAACTCCA GAAATTTTAC GTGAGCAAGG ATATCACACA TCTGTATTCC ACGCGAACAA TGCAACGTTC TGGAACCGTA ACATTATGTA TCCGGCACTT GGTTATGACC GTTACTACAA CGAGCTTGAC TACAAGATTA CGCCAGAAAC AAAATTAAAT TGGGGATTAA AAGATATCGA ATACTTCGAT CAATCTATCG ATATGTTAAA AGAAGTGAAG CAACCGTTCT ACACTCGCTT CCTTACGTTA ACAAACCATT ACCCATTCAC TTATGATGAA AGCACAAAAT TAATCGATGA ATACAATTCT GGTGATGGCG TATTTGACCG TTACATGGTA ACTGCTCGCT ATTTAGACGA AGCAATGAAA CACTTTATTG AGCGTCTAAA AGCAGAGGGT ATTTACGACA ACTCAATTAT CGTATTCTAC GGTGATCACT ACGGTATTTC TGAAAACCAT AACCGTGCAA TGGCACAGTT CTTAGGAAAA GAAGAAATTA CTGCATTTGA CCATATGAAC TTACAAAAAA CACCGATGTT TATTCACGTT CCAGGTCAAA AAGAAGGTAA AACAATTTCA AAACCAACTG GTGAAATTGA CATTAAACCA ACAATTCTAA ACTTACTTGG TATAGATTCT ACGAATCAAA TTCAATTTGG TCATGATGTA TTCTCACCAG AAAATAAAGG ATTTGTTGTT CTTCGTGACG GTAGCTTCGT TACAGATAAG TACATGTATA CGAACAGTAC ATTCTACGAC CGTGCTACTG GCGAAGTTGT ACAATTACCA AAAGAAGAAT CTCAACCACT CATTGATCGT GCTCAAAATG AATTGAACAT GTCTGACAAA ATCATTGAAG GTGACTTACT TCGCTTCTCT GAAAGCAACA AGACAAAAAC TGGTGAAGTA AAGACAGCTA TTAAAGAAGA AAAGAAGAGC GCTGAGTAA
|
Protein sequence | MKETLKSQFQ NVRFTVFVAL AVWLKTYLIT RTSFDLKLES FMQEFILFLS PLAASLLLVG LALFAKGKKR NYIALGINFV LTIILVGNVM FYGFYNDFVT LPVLGQTSNF GSLGSSVKEL FNYKIILAFA DIIVFFVLLK KMKNFAPTER VARPMRSLYF VSTIAIFFAN LGLAEAERPE LLTRSFDRVM LVKNLGLYVH QVYDLGLQAK SSSQKAFADG SKLQETENYV KTTQSKPDPN MFGTAKGKNV IVVSLESLQT FLIGATVNGQ EVTPFLNQFT KESYYFDNFF HQTGQGKTSD AEFLVDTSMY PLDRGAVFFT HGNNEYTATP EILREQGYHT SVFHANNATF WNRNIMYPAL GYDRYYNELD YKITPETKLN WGLKDIEYFD QSIDMLKEVK QPFYTRFLTL TNHYPFTYDE STKLIDEYNS GDGVFDRYMV TARYLDEAMK HFIERLKAEG IYDNSIIVFY GDHYGISENH NRAMAQFLGK EEITAFDHMN LQKTPMFIHV PGQKEGKTIS KPTGEIDIKP TILNLLGIDS TNQIQFGHDV FSPENKGFVV LRDGSFVTDK YMYTNSTFYD RATGEVVQLP KEESQPLIDR AQNELNMSDK IIEGDLLRFS ESNKTKTGEV KTAIKEEKKS AE
|
| |