Gene BCG9842_B5606 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBCG9842_B5606 
Symbol 
ID7184838 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus cereus G9842 
KingdomBacteria 
Replicon accessionNC_011772 
Strand
Start bp5097049 
End bp5098977 
Gene Length1929 bp 
Protein Length642 aa 
Translation table11 
GC content37% 
IMG OID643553120 
Productsulfatase 
Protein accessionYP_002448761 
Protein GI218900350 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1368] Phosphoglycerol transferase and related proteins, alkaline phosphatase superfamily 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000000063545 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value7.68096e-25 
Fosmid HitchhikerNo 
Fosmid clonabilityunclonable 
 

Sequence

Gene sequence
ATGAAAGAAA CCTTGAAATC GCAATTTCAA AATGTGCGTT TCACTGTATT CGTAGCTTTA 
GCCTTATGGC TGAAAACTTA CATTATTACA CGCACAAGCT TTGATTTAAA ACTTGAATCT
TTCATGCAAG AATTCATTTT ATTCCTTAGC CCATTAGCAG CATCATTACT GCTTGTTGGT
CTTGCATTAT TTGCAAAAGG GAAAAAACGT AACTATATAG CACTTGGAAT TAACTTTGTC
TTAACAATCG TTCTTGTTGG TAACGTAATG TTCTACGGAT TCTACAATGA CTTCGTTACT
TTACCAGTAC TCGGACAAAC ATCTAACTTC GGAAGTTTAG GTTCTAGTGT GAAAGAATTA
TTTAACTACA AAATTATTCT TGCATTCGCA GACATTATCG TATTCTTCGT TCTATTGAAG
AAAATGAAAA ACTTTGCACC TACAGAGCGT GTAGCACGCC CGATGCGTTC TCTATACTTC
GTATCAACAG TTGCTATTTT CTTCGCAAAC TTAGGACTTG CAGAAGCTGA ACGCCCAGAA
CTATTAACAC GTTCATTCGA CCGCGTTATG CTTGTTAAAA ACTTAGGATT ATATGTACAC
CAAGTATACG ATCTTGGCTT ACAAGCGAAA TCAAGTTCAC AAAAAGCATT CGCTGACGGT
AGTAAATTAC AAGAAACAGA AAACTACGTG AAAACAACAC AAAGTAAACC AGATCCAAAT
ATGTTCGGTA CTGCAAAAGG GAAAAACGTA ATTGTTGTCT CTCTTGAGTC ATTACAAACA
TTCTTAATTG GCGCAACAGT TAACGGACAA GAAGTTACAC CGTTCTTAAA CCAATTTACG
AAAGAAAGTT ATTACTTTGA TAACTTCTTC CATCAAACTG GGCAAGGTAA AACATCTGAT
GCTGAATTCC TAGTAGATAC TTCTTTATAT CCACTAGACC GCGGTGCTGT ATTCTTCACA
CACGGTAACA ATGAATATAC AGCAACTCCA GAAATTTTAC GTCAGCAAGG GTATCACACG
TCGGTATTCC ACGCAAACAA CGCAACATTC TGGAACCGTA ACATTATGTA TCCAGCACTT
GGTTATGATC GTTACTACAA CGAGCTTGAT TACAAGATTA CGCCGGAAAC AAAATTAAAC
TGGGGATTAA AAGATATCGA ATACTTCGAT CAATCTATTG ATATGTTGAA AGAAGTGAAG
CAACCATTCT ATACTCGCTT CCTTACTTTA ACGAATCATT ACCCATTCAC TTACGATGAC
AGTACAAAAT TAATCGACGA ATATAATTCT GGTGATGGAG TATTTGACCG TTACATGGTA
ACTGCTCGTT ACTTAGACGA AGCAATGAAA CACTTTATTG AGCGTCTAAA AGCAGAAGGT
ATTTACGACA ACTCAGTAAT CGTATTCTAC GGTGACCACT ACGGTATTTC TGAAAACCAT
AACCGTGCAA TGGCACAGTT CTTAGGAAAA GAAGAAATCA CTGCATTTGA CCATATGAAC
TTACAAAAAA CACCGATGTT TATTCACGTT CCAGGTCAAA AAGAAGGTAA AACAATTTCG
AAACCAACTG GTGAAATTGA TATTAAACCA ACTATTCTAA ACTTACTTGG TGTAGATTCT
ACAAATGACG TTCGATTCGG TCATGATGTA TTCTCACCTG ATAATAAAGG ATTTGTTGTT
CTTCGTGACG GTAGCTTCAT TACAGATAAA TACATGTACA CAAACAGTAC ATTCTACGAC
CGTGCTACTG GCGAAGTTGT ACAATTACCG AAAGAAGAAT CTCAACCACT AATTGATCGT
GCCCAAAATG AATTGCACAT GTCTGATAAA ATCATTGAAG GTGACTTACT TCGCTTCTCT
GAAAGTAATA AGATCAAAAC TGGTGAAGTA AAAACAGCTA TTAAAGAAGA AAAGAAGAGC
GCTGAGTAA
 
Protein sequence
MKETLKSQFQ NVRFTVFVAL ALWLKTYIIT RTSFDLKLES FMQEFILFLS PLAASLLLVG 
LALFAKGKKR NYIALGINFV LTIVLVGNVM FYGFYNDFVT LPVLGQTSNF GSLGSSVKEL
FNYKIILAFA DIIVFFVLLK KMKNFAPTER VARPMRSLYF VSTVAIFFAN LGLAEAERPE
LLTRSFDRVM LVKNLGLYVH QVYDLGLQAK SSSQKAFADG SKLQETENYV KTTQSKPDPN
MFGTAKGKNV IVVSLESLQT FLIGATVNGQ EVTPFLNQFT KESYYFDNFF HQTGQGKTSD
AEFLVDTSLY PLDRGAVFFT HGNNEYTATP EILRQQGYHT SVFHANNATF WNRNIMYPAL
GYDRYYNELD YKITPETKLN WGLKDIEYFD QSIDMLKEVK QPFYTRFLTL TNHYPFTYDD
STKLIDEYNS GDGVFDRYMV TARYLDEAMK HFIERLKAEG IYDNSVIVFY GDHYGISENH
NRAMAQFLGK EEITAFDHMN LQKTPMFIHV PGQKEGKTIS KPTGEIDIKP TILNLLGVDS
TNDVRFGHDV FSPDNKGFVV LRDGSFITDK YMYTNSTFYD RATGEVVQLP KEESQPLIDR
AQNELHMSDK IIEGDLLRFS ESNKIKTGEV KTAIKEEKKS AE