Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BCG9842_B5606 |
Symbol | |
ID | 7184838 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Bacillus cereus G9842 |
Kingdom | Bacteria |
Replicon accession | NC_011772 |
Strand | + |
Start bp | 5097049 |
End bp | 5098977 |
Gene Length | 1929 bp |
Protein Length | 642 aa |
Translation table | 11 |
GC content | 37% |
IMG OID | 643553120 |
Product | sulfatase |
Protein accession | YP_002448761 |
Protein GI | 218900350 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1368] Phosphoglycerol transferase and related proteins, alkaline phosphatase superfamily |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.000000063545 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 0 |
Fosmid unclonability p-value | 7.68096e-25 |
Fosmid Hitchhiker | No |
Fosmid clonability | unclonable |
| |
Sequence |
Gene sequence | ATGAAAGAAA CCTTGAAATC GCAATTTCAA AATGTGCGTT TCACTGTATT CGTAGCTTTA GCCTTATGGC TGAAAACTTA CATTATTACA CGCACAAGCT TTGATTTAAA ACTTGAATCT TTCATGCAAG AATTCATTTT ATTCCTTAGC CCATTAGCAG CATCATTACT GCTTGTTGGT CTTGCATTAT TTGCAAAAGG GAAAAAACGT AACTATATAG CACTTGGAAT TAACTTTGTC TTAACAATCG TTCTTGTTGG TAACGTAATG TTCTACGGAT TCTACAATGA CTTCGTTACT TTACCAGTAC TCGGACAAAC ATCTAACTTC GGAAGTTTAG GTTCTAGTGT GAAAGAATTA TTTAACTACA AAATTATTCT TGCATTCGCA GACATTATCG TATTCTTCGT TCTATTGAAG AAAATGAAAA ACTTTGCACC TACAGAGCGT GTAGCACGCC CGATGCGTTC TCTATACTTC GTATCAACAG TTGCTATTTT CTTCGCAAAC TTAGGACTTG CAGAAGCTGA ACGCCCAGAA CTATTAACAC GTTCATTCGA CCGCGTTATG CTTGTTAAAA ACTTAGGATT ATATGTACAC CAAGTATACG ATCTTGGCTT ACAAGCGAAA TCAAGTTCAC AAAAAGCATT CGCTGACGGT AGTAAATTAC AAGAAACAGA AAACTACGTG AAAACAACAC AAAGTAAACC AGATCCAAAT ATGTTCGGTA CTGCAAAAGG GAAAAACGTA ATTGTTGTCT CTCTTGAGTC ATTACAAACA TTCTTAATTG GCGCAACAGT TAACGGACAA GAAGTTACAC CGTTCTTAAA CCAATTTACG AAAGAAAGTT ATTACTTTGA TAACTTCTTC CATCAAACTG GGCAAGGTAA AACATCTGAT GCTGAATTCC TAGTAGATAC TTCTTTATAT CCACTAGACC GCGGTGCTGT ATTCTTCACA CACGGTAACA ATGAATATAC AGCAACTCCA GAAATTTTAC GTCAGCAAGG GTATCACACG TCGGTATTCC ACGCAAACAA CGCAACATTC TGGAACCGTA ACATTATGTA TCCAGCACTT GGTTATGATC GTTACTACAA CGAGCTTGAT TACAAGATTA CGCCGGAAAC AAAATTAAAC TGGGGATTAA AAGATATCGA ATACTTCGAT CAATCTATTG ATATGTTGAA AGAAGTGAAG CAACCATTCT ATACTCGCTT CCTTACTTTA ACGAATCATT ACCCATTCAC TTACGATGAC AGTACAAAAT TAATCGACGA ATATAATTCT GGTGATGGAG TATTTGACCG TTACATGGTA ACTGCTCGTT ACTTAGACGA AGCAATGAAA CACTTTATTG AGCGTCTAAA AGCAGAAGGT ATTTACGACA ACTCAGTAAT CGTATTCTAC GGTGACCACT ACGGTATTTC TGAAAACCAT AACCGTGCAA TGGCACAGTT CTTAGGAAAA GAAGAAATCA CTGCATTTGA CCATATGAAC TTACAAAAAA CACCGATGTT TATTCACGTT CCAGGTCAAA AAGAAGGTAA AACAATTTCG AAACCAACTG GTGAAATTGA TATTAAACCA ACTATTCTAA ACTTACTTGG TGTAGATTCT ACAAATGACG TTCGATTCGG TCATGATGTA TTCTCACCTG ATAATAAAGG ATTTGTTGTT CTTCGTGACG GTAGCTTCAT TACAGATAAA TACATGTACA CAAACAGTAC ATTCTACGAC CGTGCTACTG GCGAAGTTGT ACAATTACCG AAAGAAGAAT CTCAACCACT AATTGATCGT GCCCAAAATG AATTGCACAT GTCTGATAAA ATCATTGAAG GTGACTTACT TCGCTTCTCT GAAAGTAATA AGATCAAAAC TGGTGAAGTA AAAACAGCTA TTAAAGAAGA AAAGAAGAGC GCTGAGTAA
|
Protein sequence | MKETLKSQFQ NVRFTVFVAL ALWLKTYIIT RTSFDLKLES FMQEFILFLS PLAASLLLVG LALFAKGKKR NYIALGINFV LTIVLVGNVM FYGFYNDFVT LPVLGQTSNF GSLGSSVKEL FNYKIILAFA DIIVFFVLLK KMKNFAPTER VARPMRSLYF VSTVAIFFAN LGLAEAERPE LLTRSFDRVM LVKNLGLYVH QVYDLGLQAK SSSQKAFADG SKLQETENYV KTTQSKPDPN MFGTAKGKNV IVVSLESLQT FLIGATVNGQ EVTPFLNQFT KESYYFDNFF HQTGQGKTSD AEFLVDTSLY PLDRGAVFFT HGNNEYTATP EILRQQGYHT SVFHANNATF WNRNIMYPAL GYDRYYNELD YKITPETKLN WGLKDIEYFD QSIDMLKEVK QPFYTRFLTL TNHYPFTYDD STKLIDEYNS GDGVFDRYMV TARYLDEAMK HFIERLKAEG IYDNSVIVFY GDHYGISENH NRAMAQFLGK EEITAFDHMN LQKTPMFIHV PGQKEGKTIS KPTGEIDIKP TILNLLGVDS TNDVRFGHDV FSPDNKGFVV LRDGSFITDK YMYTNSTFYD RATGEVVQLP KEESQPLIDR AQNELHMSDK IIEGDLLRFS ESNKIKTGEV KTAIKEEKKS AE
|
| |