Gene BCG9842_B1434 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBCG9842_B1434 
Symbol 
ID7182621 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus cereus G9842 
KingdomBacteria 
Replicon accessionNC_011772 
Strand
Start bp3698158 
End bp3700077 
Gene Length1920 bp 
Protein Length639 aa 
Translation table11 
GC content33% 
IMG OID643551607 
Productsulfatase 
Protein accessionYP_002447277 
Protein GI218898866 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1368] Phosphoglycerol transferase and related proteins, alkaline phosphatase superfamily 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.511966 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones40 
Fosmid unclonability p-value0.0230537 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGAATA AAATAAATTT ACAAATGCAA AATATAAGTT TTGTTATAAT AATGGCTTTA 
GCAGTATGGT TAAAAACATA TCTTATTACG CGATTCAGTT TTGATTTAAA AATTGAGTCT
TCAACGCAAG AGCTTATTTT GTTTATTAGC CCTCTAGCCG CATCATTAGC ATTTGTTGGA
TTAGCATTAT TTGCAACTGG TGAAAAGCGA AATTATATAG CGCTATGTAT TAATTTCTTA
TTAACGATCG TGCTTGTTGG AAATGTAATG TTCTATGATT TTTATAGTGA TTTCGTTACA
TTACCAGTAC TTGGACAAAC CTCAAACTTT GGCCAATTAG GCGGCAGTAT TATAGAGATA
TTAAACTACA AAATTATACT CGCATTCGTA GACATTGTTT TCTTCTTTAT TTTATTAAAA
AAGAAATCAT TGGTCTTTAA AACAGAACGT GTAACTCATT CTACACGCTT GTTATACTTT
CTTTTAACGA TTGGTGTATT CTTTGCGAAT CTACAGCTTG CAGAAAAAGA GCGCCCTGAA
TTATTAACGA GATCATTCGA CCGGGTAATG CTTGTCAAAA ATTTAGGCTT ATATACTCAC
CAAGTATATG ACTTAACACT GCAAGTAAAA GCTGGGTCAC AAAAAGCACT TGCTGATAGT
AGTAAATTAC AAGAAACTGA AAACTACGTA AAAGCAAACC AAAGCGAGCC AAACCCTAAT
ATGTTTGGTG CAGCGAAAGG AAAAAACGTA ATTGTCGTCA CTCTTGAATC CTTGCAGACC
TTCTTAATAG GCGCATCAGT CAATGGGCAA GAAGTTACAC CATTCCTAAA TGAATTCATA
AATGAAAGTT ATTACTTTGA TAACTTTTTC CATCAAACTG GTCAAGGGAA AACATCCGAT
TCTGAATTTC TAATCGATAC GTCGTTGTAT CCATTAAATC GAGGGGCTGT ATTCTTCACA
CACGGTAACA ATGATTATAC TGCGACTCCA GAAATTTTAC GTCAGCAAGG TTATTTCACT
TCTGTATTCC ATGCGAATAA CGCAACATTT TGGAATCGTA ATATTATGTA CTCCGCTCTT
GGTTATGATC GTTACTATAA TGAGCTTGAT TACAAAATTA CGCCAGAAAC AAATTTAAAT
TGGGGATTAA AAGATATCGA ATACTTTGAT CAATCAGTAG ATATATTAAA AACTGTTGAT
CAACCATTCT ATGCTCGTTT CCTTACTTTA ACAAACCATT ATCCATTCAC GTATGATGAA
GATACAAAAT TCATTGAACC ATACAACTCT GGTAATGGCG TATTCGATCG TTACATCGTA
ACTGCACGTT ACTTAGACGA ATCAATTAAA AAATTTATTG AGCGTTTAAA GGCCGAGGGA
ATGTATGATG ATTCTATTAT TGTGTTATAC GGTGATCATT ATGGCATTTC CGAAAAACAT
AATCGTGCAA TGGCACAGTT TTTAGACAAA GATCAAATAA CAGAATTTGA TACTTTAAAT
TTACAACGTA CACCTTTATA TATTCATATT CCTGGACAAA CAGAAGGTCA AACTATTTCA
AAGCCTACGG GACAAATCGA TATGAAACCT ACTATTCTAA ATTTATTAGG TGTTGGTTCT
ACGAATGATA TCCGTTTTGG CCATGATATG TTTTCAGATG AATATACTGG CTTTGTTGTT
TTACGCGATG GTAGCTTCGT TACAGATAAG TATGCATACA AAAACAACAC TTTCTACGAC
CGTATAACAG GGGAAATTGT AGATTTACCA AAAAAAGAAG CTCAAGCCCT CATTAAACGT
GCACAAAATG AATTACGAAT GTCTGACAAA ATTATTGAAG GCGATTTATT ACGCTTCTCA
GAAAGTAATA AAATTAAAAC TGGCGAAGTA CAAACTAAAA TTAAAGAAAC AGAAAAATAA
 
Protein sequence
MKNKINLQMQ NISFVIIMAL AVWLKTYLIT RFSFDLKIES STQELILFIS PLAASLAFVG 
LALFATGEKR NYIALCINFL LTIVLVGNVM FYDFYSDFVT LPVLGQTSNF GQLGGSIIEI
LNYKIILAFV DIVFFFILLK KKSLVFKTER VTHSTRLLYF LLTIGVFFAN LQLAEKERPE
LLTRSFDRVM LVKNLGLYTH QVYDLTLQVK AGSQKALADS SKLQETENYV KANQSEPNPN
MFGAAKGKNV IVVTLESLQT FLIGASVNGQ EVTPFLNEFI NESYYFDNFF HQTGQGKTSD
SEFLIDTSLY PLNRGAVFFT HGNNDYTATP EILRQQGYFT SVFHANNATF WNRNIMYSAL
GYDRYYNELD YKITPETNLN WGLKDIEYFD QSVDILKTVD QPFYARFLTL TNHYPFTYDE
DTKFIEPYNS GNGVFDRYIV TARYLDESIK KFIERLKAEG MYDDSIIVLY GDHYGISEKH
NRAMAQFLDK DQITEFDTLN LQRTPLYIHI PGQTEGQTIS KPTGQIDMKP TILNLLGVGS
TNDIRFGHDM FSDEYTGFVV LRDGSFVTDK YAYKNNTFYD RITGEIVDLP KKEAQALIKR
AQNELRMSDK IIEGDLLRFS ESNKIKTGEV QTKIKETEK