Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BCE_3797 |
Symbol | |
ID | 2751716 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Bacillus cereus ATCC 10987 |
Kingdom | Bacteria |
Replicon accession | NC_003909 |
Strand | + |
Start bp | 3540755 |
End bp | 3542674 |
Gene Length | 1920 bp |
Protein Length | 639 aa |
Translation table | 11 |
GC content | 33% |
IMG OID | 637280598 |
Product | sulfatase |
Protein accession | NP_980094 |
Protein GI | 42782847 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1368] Phosphoglycerol transferase and related proteins, alkaline phosphatase superfamily |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.000542687 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAATA AAATCAATTT ACAAATGCAA AATATAAGCT TTGTTTTAAT AATTGCTTTA GCAGTATGGT TAAAAACATA TCTTATTACA AGGTTCAGTT TTGATTTAAA AATTGAATCT TCTACACAAG AGCTTATTTT GTTTATTAGT CCACTCGCTA CATCATTAGC TTTTGTCGGA TTAGCATTAT TTGCAACCGG TGAAAAAAGA AATTATATCG CGCTATGTAT CAACTTCGTA TTAACAATCG TACTCGTTGG AAATGTAATG TTCTATGATT TTTATAGCGA TTTCGTTACG TTACCAGTAC TTGGACAAAC ATCAAACTTC GGCCAATTAG GCGGCAGTAT TATAGAAATA TTGAACTACA AAATTATACT CGCATTCGTA GACATTATTT TCTTCTTTAT TTTATTGAAG AAAAAATCAA TAGTCTTCCA AACAGGACGT GTAACTCATC CGGCACGCTT GTTATATTTT CTTTTAACGA TTGGTGTATT TTTTGCTAAT CTACATCTTG CAGAAAAAGA GCGACCTGAA CTATTAACAA GATCATTTGA CCGAGTTATG CTTGTTAAAA ATTTAGGACT ATACACGCAT CAAGTTTATG ATTTAACACT GCAAGTAAAA GCAGGATCAC AAAAGGCACT TGCTGATAGT AGCAAATTAC AAGAAACTGA AAACTACGTG AAGGCAAACC AAAGTGAGCC GAACCCTAAT ATGTTTGGTG TAGCGAAAGG AAAAAACGTA ATTGTTGTCA CTCTTGAATC CCTGCAAACC TTTTTAATAG GAGCAACAGT TAACGGACAA GAAGTTACAC CATTTTTGAA TGAATTCATA AATGAAAGCT ATTACTTTGA TAACTTTTTC CACCAAACTG GACAAGGAAA AACATCCGAT TCTGAATTTC TAATCGATAC GTCATTGTAT CCATTAAATC GAGGAGCTGT ATTCTTCACA CACGGTAACA ACGATTATAC TGCGACTCCA GAAATTCTAC GTCAGCAAGG TTATTTCACT TCTGTATTCC ATGCGAATAA CGCCACATTT TGGAATCGTA ACATTATGTA CTCTGCTCTC GGTTATGATC GTTACTATAA TGAACTTGAT TACAAAATTA CGCCCGAAAC AAATTTAAAT TGGGGCTTAA AAGATATCGA ATACTTTGAT CAATCAGTAG ATATATTAAA AACTGTTGAT CAACCATTTT ACGCTCGTTT TCTTACTTTA ACGAACCATT ATCCATTTAC GTATGATGAA GATACACAAT TCATTGAACC ATATAACTCT GGTAATGGCG TATTTGATCG TTATATTGTA ACAGCACGTT ACTTAGACGA ATCAATAAAA AAATTTATTG AGCGTTTAAA AGCTGAGGGA ATGTATGATG ATTCTATTAT TGTTTTATAT GGTGATCATT ATGGCATTTC TGAAAAACAT AATCGCGCGA TGGCACAGTT TTTAGAAAAA GATCAAATAA CAGAATTTGA TACTTTAAAT TTACAACGCA CACCTTTATA TATTCATATC CCTGGACAAA CAGAAGGTCG AACGATTTCA AAGCCTACGG GACAAATCGA TATGAAACCT ACTATTCTCA ATTTATTAGG AATTGATACT ACAAACGATA TTCGTTTTGG CCATGATATG TTTTCGGATG AATATGCCGG TTTTGTTGTT TTACGTGATG GTAGCTTCAT TACAGACAAG TATGCATACA AAAACAATAC TTTCTATGAC CGCATAACAG GTGAAATTGT AGACTTACCA AAAGAAGAAG CAGAAGCACT CATTAAACGT GCACAAAATG AATTACGAAT GTCTGACAAA ATTATTGAAG GCGATTTATT ACGTTTCTCA GAAAGTAATA AAATTAAAAC TGGTGAAGTA CAAACTAAAA TTAAAGAGAC TGAAAAATAA
|
Protein sequence | MKNKINLQMQ NISFVLIIAL AVWLKTYLIT RFSFDLKIES STQELILFIS PLATSLAFVG LALFATGEKR NYIALCINFV LTIVLVGNVM FYDFYSDFVT LPVLGQTSNF GQLGGSIIEI LNYKIILAFV DIIFFFILLK KKSIVFQTGR VTHPARLLYF LLTIGVFFAN LHLAEKERPE LLTRSFDRVM LVKNLGLYTH QVYDLTLQVK AGSQKALADS SKLQETENYV KANQSEPNPN MFGVAKGKNV IVVTLESLQT FLIGATVNGQ EVTPFLNEFI NESYYFDNFF HQTGQGKTSD SEFLIDTSLY PLNRGAVFFT HGNNDYTATP EILRQQGYFT SVFHANNATF WNRNIMYSAL GYDRYYNELD YKITPETNLN WGLKDIEYFD QSVDILKTVD QPFYARFLTL TNHYPFTYDE DTQFIEPYNS GNGVFDRYIV TARYLDESIK KFIERLKAEG MYDDSIIVLY GDHYGISEKH NRAMAQFLEK DQITEFDTLN LQRTPLYIHI PGQTEGRTIS KPTGQIDMKP TILNLLGIDT TNDIRFGHDM FSDEYAGFVV LRDGSFITDK YAYKNNTFYD RITGEIVDLP KEEAEALIKR AQNELRMSDK IIEGDLLRFS ESNKIKTGEV QTKIKETEK
|
| |