Gene BCZK4928 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBCZK4928 
Symbol 
ID3026672 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus cereus E33L 
KingdomBacteria 
Replicon accessionNC_006274 
Strand
Start bp5018526 
End bp5020454 
Gene Length1929 bp 
Protein Length642 aa 
Translation table11 
GC content36% 
IMG OID637549161 
Productsulfatase 
Protein accessionYP_086498 
Protein GI52140333 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1368] Phosphoglycerol transferase and related proteins, alkaline phosphatase superfamily 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00117274 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAGAAA CCTTGAAATC ACAATTTCAA AATGTGCGTT TCACTGTATT CGTAGCTTTA 
GCCGTATGGT TGAAGACATA TCTTATTACA CGCACAAGCT TTGATTTAAA ACTTGAATCT
TTCATGCAAG AATTCATTTT ATTCCTTAGC CCATTAGCAG CATCATTACT GCTTGTTGGT
CTTGCATTAT TTGCAAAAGG GAAAAAACGT AACTATATAG CACTTGGAAT TAATTTTGTT
TTAACAATTA TTCTTGTTGG TAACGTAATG TTCTACGGAT TCTACAATGA CTTCGTTACT
TTACCCGTAC TAGGACAAAC ATCTAACTTC GGAAGTTTAG GTTCTAGTGT GAAAGAATTA
TTTAACTACA AAATCATCCT TGCATTTGCT GATATTATCG TATTCTTCAT TTTATTGAAG
AAGATGAAGA ATTTTGCACC GACAGAACGT GTAGCACGCC CAATGCGTTC CCTATACTTC
GTGTCAACAA TTGCTATTTT CTTCGCAAAC TTAGGATTGG CAGAAGCTGA GCGTCCTGAA
CTATTAACAC GTTCATTCGA CCGCGTTATG CTCGTTAAAA ACTTAGGTTT ATATGTACAC
CAAGTGTATG ATCTTGGCTT ACAAGCAAAA TCAAGTTCAC AAAAAGCATT TGCTGACGGT
AGTAAGTTAC AGGAAACAGA GAACTACGTA AAAACAACGC AAAGCAAACC AGATCCAAAT
ATGTTTGGTA CTGCAAAAGG GAAAAACGTA ATTGTTGTCT CTCTTGAGTC ATTACAAACA
TTCTTAATTG GTGCAACAGT TAACGGACAA GAAGTTACAC CATTCTTAAA CCAATTTACG
AAAGAAAGTT ATTACTTCGA TAACTTCTTC CATCAAACTG GTCAAGGAAA AACATCTGAC
GCTGAATTCT TAGTAGATAC TTCCATGTAT CCACTAGACC GTGGTGCTGT ATTCTTCACA
CACGGTAACA ACGAATATAC AGCAACTCCA GAAATTTTAC GCGAGCAAGG ATATCATACA
TCTGTATTCC ACGCGAACAA CGCAACGTTC TGGAACCGTA ACATTATGTA TCCGGCACTT
GGTTATGACC GTTACTACAA CGAGCTTGAC TACAAGATTA CGCCAGAAAC AAAATTAAAT
TGGGGATTAA AAGATATCGA ATACTTCGAT CAATCTGTCG ATATGTTAAA AGAAGTGAAG
CAACCGTTCT ACACTCGCTT CCTTACGTTA ACAAACCATT ACCCATTCAC TTATGATGAA
AGCACAAAAT TAATCGATGA ATACAATTCT GGTGATGGCG TATTTGACCG TTACATGGTA
ACTGCTCGCT ATTTAGACGA AGCAATGAAA CACTTTATTG AGCGTCTAAA AGCAGAGGGT
ATTTACGACA ATTCAATTAT CGTATTCTAC GGTGATCACT ACGGTATTTC TGAAAACCAT
AACCGTGCAA TGGCACAGTT CTTAGGAAAA GAAGAAATTA CTGCATTTGA CCATATGAAC
TTACAAAAAA CACCGATGTT TATTCACGTT CCAGGTCAAA AAGAAGGTAA AACAATTTCA
AAACCAACTG GTGAAATTGA CATTAAACCA ACAATTCTAA ACTTACTTGG TATAGATTCT
ACGAATCAAA TTCAATTTGG TCATGATGTA TTCTCACCAG AAAATAAAGG ATTTGTTGTT
CTTCGTGACG GTAGCTTCGT TACAGATAAG TACATGTATA CGAATAGTAC ATTCTACGAC
CGTGCTACTG GCGAAGTTGT ACAATTACCA AAAGAAGAAT CTCAACCACT CATTGATCGT
GCCCAAAATG AATTGAACAT GTCTGACAAA ATCATTGAAG GTGACTTACT TCGCTTCTCT
GAAAGCAACA AGACAAAAAC TGGTGAAGTA AAGACAGCTA TTAAAGAAGA AAAGAAGAGC
GCTGAGTAA
 
Protein sequence
MKETLKSQFQ NVRFTVFVAL AVWLKTYLIT RTSFDLKLES FMQEFILFLS PLAASLLLVG 
LALFAKGKKR NYIALGINFV LTIILVGNVM FYGFYNDFVT LPVLGQTSNF GSLGSSVKEL
FNYKIILAFA DIIVFFILLK KMKNFAPTER VARPMRSLYF VSTIAIFFAN LGLAEAERPE
LLTRSFDRVM LVKNLGLYVH QVYDLGLQAK SSSQKAFADG SKLQETENYV KTTQSKPDPN
MFGTAKGKNV IVVSLESLQT FLIGATVNGQ EVTPFLNQFT KESYYFDNFF HQTGQGKTSD
AEFLVDTSMY PLDRGAVFFT HGNNEYTATP EILREQGYHT SVFHANNATF WNRNIMYPAL
GYDRYYNELD YKITPETKLN WGLKDIEYFD QSVDMLKEVK QPFYTRFLTL TNHYPFTYDE
STKLIDEYNS GDGVFDRYMV TARYLDEAMK HFIERLKAEG IYDNSIIVFY GDHYGISENH
NRAMAQFLGK EEITAFDHMN LQKTPMFIHV PGQKEGKTIS KPTGEIDIKP TILNLLGIDS
TNQIQFGHDV FSPENKGFVV LRDGSFVTDK YMYTNSTFYD RATGEVVQLP KEESQPLIDR
AQNELNMSDK IIEGDLLRFS ESNKTKTGEV KTAIKEEKKS AE