Gene BCZK3517 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBCZK3517 
Symbol 
ID3024602 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus cereus E33L 
KingdomBacteria 
Replicon accessionNC_006274 
Strand
Start bp3643154 
End bp3645073 
Gene Length1920 bp 
Protein Length639 aa 
Translation table11 
GC content33% 
IMG OID637547733 
Productsulfatase 
Protein accessionYP_085099 
Protein GI52141730 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1368] Phosphoglycerol transferase and related proteins, alkaline phosphatase superfamily 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAATA AAATCAATTT ACAAATGCAA AATATAAGCT TTGTTTTAAT AATTGCTTTA 
GCAGTATGGT TAAAAACATA TCTTATTACA AGGTTCAGTT TTGATTTGAA AATTGAGTCT
TCTACACAAG AGCTGATTTT ATTTATTAGC CCACTCGCTG CATCATTAGC TTTTGTCGGA
TTAGCATTAT TTGCAACCGG TGAAAAGCGA AATTATATCG CGCTATGTAT TAATTTCTTA
TTAACAATTG TACTTGTTGG GAATGTAATG TTCTATGATT TTTATAGTGA TTTCGTTACG
TTACCAGTAC TTGGACAAAC ATCAAACTTT GGCCAATTAG GCGGTAGTAT TATAGAAATA
TTGAACTACA AAATTATACT CGCATTCGCA GACATTATTT TCTTCTTTAT TTTATTAAAG
AAGAAGGCAC TAGTCTTCCA AACGGGGCGC GTAACTCATT CGGCACGCTT GTTATATTTT
CTCTTAACAA TTGGTGTATT TTTTGCAAAT CTACATCTTG CAGAAAAAGA GCGACCTGAA
TTACTAACGA GATCATTCGA CCGAGTCATG CTTGTTAAAA ATTTAGGACT ATACACGCAT
CAAGTATATG ATTTAACACT ACAAGTAAAA GCAGGATCAC AAAAGGCACT TGCTGATAGT
AGTAAATTAC AAGAAACAGA AAACTACGTA AAGGCAAACC AAAGTGAACC GAATCCTAAT
ATGTTTGGTG CAGCGAAAGG AAAAAACGTA ATTGTCGTTA CTCTTGAATC CTTGCAAACC
TTTTTAATAG GCGCATCAGT GAACGGTCAA GAAGTTACAC CATTCTTGAA TGAATTCATA
AATGAAAGCT ATTACTTTAA TAACTTTTTC CACCAAACTG GGCAAGGAAA AACGTCCGAT
TCTGAATTTC TAATTGATAC GTCGTTGTAT CCATTAAATC GAGGAGCTGT ATTTTTCACA
CACGGTAACA ATGATTATAC TGCGACGCCA GAAATTTTAC GTCAGCAAGG TTATTTCACT
TCTGTATTCC ATGCGAATAA TGCAACATTT TGGAATCGTA ACATTATGTA CTCTGCTCTC
GGTTATGATC GTTACTATAA TGAACTTGAT TACAAAATTA CGCCTGAAAC AAATTTAAAT
TGGGGCTTAA AAGATATCGA GTACTTTGAT CAATCAGTAG ATATATTAAA AACTGTTGAT
CAACCATTTT ACGCTCGTTT CCTTACTTTA ACAAACCATT ATCCATTTAC GTATGATGAA
GATACTAAAT TCATTGAACC ATATAACTCT GGTAATGGCG TGTTTGATCG TTATATCGTA
ACAGCACGTT ACTTAGACGA ATCCATTAAA AAATTTATTG AGCGTTTAAA AGCTGAGGGA
ATGTATGATG ATTCTATTAT TGTTTTATAT GGTGATCATT ATGGCATTTC TGAAAAACAT
AATCGTGCAA TGGCACAGTT TTTAGAAAAA GATCAAATAA CAGAATTTGA TACTCTGAAT
TTACAACGCA CACCTTTATA TATTCATATG CCTGGCCAAA CAGAAGGTGA AACGATTTCA
AAGCCTACAG GACAAATTGA TATAAAACCT ACTATTCTAA ATTTACTTGG GATTGATACT
ACGAACGATA TTCGTTTTGG TCATGATATG TTTTCAAATG AATATACTGG TTTTGTTGTT
TTACGTGATG GTAGCTTTAT TACAGACAAG TATGCATACA AAAACAATAT TTTCTATGAC
CGCATAACAG GTGAAATTGT AGATTTACCA AAAAATGAAG CACAAGCACT CATTAAGCGT
GCGCAAAATG AATTACGAAT GTCTGACAAA ATTATTGAAG GCGATTTATT ACGTTTCTCA
GAAAGTAATA AGATTAAAAC TGGTGAAGTA CAAACTAAAA TTAAAGAAAC TGAAAAATAA
 
Protein sequence
MKNKINLQMQ NISFVLIIAL AVWLKTYLIT RFSFDLKIES STQELILFIS PLAASLAFVG 
LALFATGEKR NYIALCINFL LTIVLVGNVM FYDFYSDFVT LPVLGQTSNF GQLGGSIIEI
LNYKIILAFA DIIFFFILLK KKALVFQTGR VTHSARLLYF LLTIGVFFAN LHLAEKERPE
LLTRSFDRVM LVKNLGLYTH QVYDLTLQVK AGSQKALADS SKLQETENYV KANQSEPNPN
MFGAAKGKNV IVVTLESLQT FLIGASVNGQ EVTPFLNEFI NESYYFNNFF HQTGQGKTSD
SEFLIDTSLY PLNRGAVFFT HGNNDYTATP EILRQQGYFT SVFHANNATF WNRNIMYSAL
GYDRYYNELD YKITPETNLN WGLKDIEYFD QSVDILKTVD QPFYARFLTL TNHYPFTYDE
DTKFIEPYNS GNGVFDRYIV TARYLDESIK KFIERLKAEG MYDDSIIVLY GDHYGISEKH
NRAMAQFLEK DQITEFDTLN LQRTPLYIHM PGQTEGETIS KPTGQIDIKP TILNLLGIDT
TNDIRFGHDM FSNEYTGFVV LRDGSFITDK YAYKNNIFYD RITGEIVDLP KNEAQALIKR
AQNELRMSDK IIEGDLLRFS ESNKIKTGEV QTKIKETEK