Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BCAH187_A3804 |
Symbol | |
ID | 7074274 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Bacillus cereus AH187 |
Kingdom | Bacteria |
Replicon accession | NC_011658 |
Strand | + |
Start bp | 3549962 |
End bp | 3551881 |
Gene Length | 1920 bp |
Protein Length | 639 aa |
Translation table | 11 |
GC content | 33% |
IMG OID | 643452235 |
Product | sulfatase |
Protein accession | YP_002339746 |
Protein GI | 217961178 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1368] Phosphoglycerol transferase and related proteins, alkaline phosphatase superfamily |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.00040285 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAATA AAATCAATTT ACAAATGCAA AATATAAGCT TTGTTTTAAT AATTGCTTTA GCAGTATGGT TAAAAACATA TCTTATTACA AGATTCAGTT TTGATTTAAA AATTGAGTCT TCTACACAAG AGCTGATTTT ATTTATTAGC CCTCTCGCTG CTTCATTAGC TTTTGTCGGA TGCGCATTAT TTGCAACCGG TGAAAAGCGA AATTATATCG CGCTATGTAT TAATTTCTTA TTAACAATTG TACTTGTTGG GAATGTAATG TTCTATGATT TTTATAGTGA TTTCGTTACG TTACCAGTAC TTGGACAAAC ATCAAACTTT GGCCAATTAG GCGGTAGTAT TATAGAAATA TTGAACTACA AAATTATACT CGCATTCGCA GACATTATTT TCTTCTTTAT TTTATTAAAG AAGAAGGCAC TAGTCTTCCA AACGGGGCGC GTAACTCATT CGGCACGCTT GTTATATTTT CTCTTAACGA TTGGTGTATT TTTTGCAAAT CTACATCTTG CAGAAAAAGA GCGACCTGAA TTACTAACGA GATCATTCGA CCGAGTCATG CTTGTTAAAA ATTTAGGACT ATACACGCAT CAAGTATATG ATTTAACACT ACAAGTAAAA GCAGGATCAC AAAAGGCACT TGCTGATAGT AGTAAATTAC AAGAAACAGA AAACTACGTG AAGGCAAACG AAAGTCAGCC GAATCCTAAC ATGTTTGGTG TAGCGAAAGG AAAAAACGTA ATTGTTGTTA CTCTTGAATC CTTACAAACC TTTTTAATAG GCGCAACAGT TAACGGACAA GAAGTTACAC CATTTTTGAA TGAATTCATA AATGAAAGCT ATTACTTTGA TAACTTTTTC CACCAAACTG GGCAAGGAAA AACATCCGAT TCTGAATTTC TAATTGATAC GTCGTTATAT CCATTAAATC GAGGAGCTGT ATTCTTCACA CACGGTAACA ATGATTATAC TGCGACTCCA GAAATTTTAC GTCAGCAAGG TTATTTCACT TCTGTATTCC ATGCGAATAA CGCAACATTT TGGAATCGTA ATATCATGTA CTCTGCTCTC GGTTATGATC GTTACTATAA TGAACTTGAT TACAAAATTA CGCCCGAAAC TAATTTAAAT TGGGGATTAA AAGATATCGA ATACTTTGAT CAATCAGTAG ATATATTAAA AACTGTTGAT CAACCATTTT ATGCTCGTTT CCTTACTTTA ACGAACCATT ATCCATTTAC GTATGATGAA GATACAAAAT TCATTGAACC ATATAACTCT GGTAATGGCG TATTTGATCG TTATATCGTA ACAGCACGTT ACTTAGACGA ATCAATTAAA AAATTTATTG AGCGTTTAAA AGCCGAGGGA ATGTATGATG ATTCTATTAT TGTATTATAC GGTGATCATT ATGGCATTTC TGAAAAACAT AATCGTGCAA TGGCACAGTT TTTAGAAAAA GATCAAATAA CAGAATTTGA TACTTTAAAT TTACAACGTA CACCTTTATA TATTCATGTC CCTGGACAAA CAGAAGGTCA AACGATTTCA AAGCCTACGG GACAAATCGA TATGAAACCT ACTATTCTCA ATTTATTAGG GATTGATACT ACGAACGATA TTCGTTTTGG TCATGATATG TTTTCAGATG AATATACCGG TTTTGTTGTT TTACGTGATG GTAGCTTCAT TACAGACAAA TATGCATACA AAAACAACAC TTTCTATGAC CGCATAACAG GTGAAATTGT AGATTTACCA AAAAAAGAAG CTCAAGCACT CATTAACCGT GCACAAAATG AATTACGAAT GTCTGACAAA ATTATTGAAG GAGATTTACT ACGTTTCTCA GAAAGTAATA AAATTAAAAC TGGCGAAGTA CAAACTAAAA TTAAAGAAAC TGAAAAATAA
|
Protein sequence | MKNKINLQMQ NISFVLIIAL AVWLKTYLIT RFSFDLKIES STQELILFIS PLAASLAFVG CALFATGEKR NYIALCINFL LTIVLVGNVM FYDFYSDFVT LPVLGQTSNF GQLGGSIIEI LNYKIILAFA DIIFFFILLK KKALVFQTGR VTHSARLLYF LLTIGVFFAN LHLAEKERPE LLTRSFDRVM LVKNLGLYTH QVYDLTLQVK AGSQKALADS SKLQETENYV KANESQPNPN MFGVAKGKNV IVVTLESLQT FLIGATVNGQ EVTPFLNEFI NESYYFDNFF HQTGQGKTSD SEFLIDTSLY PLNRGAVFFT HGNNDYTATP EILRQQGYFT SVFHANNATF WNRNIMYSAL GYDRYYNELD YKITPETNLN WGLKDIEYFD QSVDILKTVD QPFYARFLTL TNHYPFTYDE DTKFIEPYNS GNGVFDRYIV TARYLDESIK KFIERLKAEG MYDDSIIVLY GDHYGISEKH NRAMAQFLEK DQITEFDTLN LQRTPLYIHV PGQTEGQTIS KPTGQIDMKP TILNLLGIDT TNDIRFGHDM FSDEYTGFVV LRDGSFITDK YAYKNNTFYD RITGEIVDLP KKEAQALINR AQNELRMSDK IIEGDLLRFS ESNKIKTGEV QTKIKETEK
|
| |