Gene BCAH820_5321 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBCAH820_5321 
Symbol 
ID7188018 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus cereus AH820 
KingdomBacteria 
Replicon accessionNC_011773 
Strand
Start bp5016743 
End bp5018671 
Gene Length1929 bp 
Protein Length642 aa 
Translation table11 
GC content37% 
IMG OID643558731 
Productsulfatase 
Protein accessionYP_002454241 
Protein GI218906407 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1368] Phosphoglycerol transferase and related proteins, alkaline phosphatase superfamily 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value3.10214e-55 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAAAGAAA CCTTGAAATC ACAATTTCAA AATGTGCGTT TCACTGTATT CGTAGCTTTA 
GCCGTATGGT TGAAGACATA TCTTATTACA CGCACAAGCT TTGATTTAAA ACTTGAATCT
TTCATGCAAG AATTCATTTT ATTCCTTAGC CCATTAGCAG CATCATTACT GCTTGTTGGT
CTTGCATTAT TTGCAAAAGG GAAAAAACGT AACTATATAG CACTTGGAAT TAATTTTGTT
TTAACAATTA TTCTTGTAGG TAACGTAATG TTCTACGGAT TCTATAATGA CTTCGTTACT
TTACCCGTAC TAGGACAAAC ATCTAACTTC GGAAGTTTAG GTTCTAGTGT GAAAGAATTA
TTTAACTACA AAATCATCCT TGCATTTGCT GATATTATCG TATTCTTCGT TTTATTGAAG
AAGATGAAGA ATTTTGCACC GACAGAACGT GTAGCACGCC CAATGCGTTC CCTATACTTC
GTGTCAACAA TTGCTATTTT CTTCGCAAAC TTAGGACTGG CAGAAGCTGA GCGTCCTGAA
CTATTAACAC GTTCATTCGA CCGCGTTATG CTCGTTAAAA ACTTAGGTTT ATATGTACAC
CAAGTGTATG ACCTTGGCTT ACAAGCAAAA TCAAGTTCAC AAAAAGCATT TGCTGACGGT
AGTAAGTTAC AGGAAACAGA GAACTACGTA AAAACAACGC AAAGCAAACC AGATCCAAAT
ATGTTTGGTA CTGCAAAAGG GAAAAACGTA ATTGTCGTCT CTCTTGAGTC ATTACAAACA
TTCTTAATTG GTGCAACAGT TAACGGACAA GAAGTTACAC CATTCTTAAA CCAATTTACG
AAAGAAAGTT ATTACTTCGA TAACTTCTTC CATCAAACTG GTCAAGGAAA AACATCTGAC
GCTGAATTCT TAGTAGATAC TTCCATGTAT CCACTAGACC GTGGTGCTGT ATTCTTCACA
CACGGTAACA ACGAATACAC AGCAACTCCA GAAATTTTAC GTGAGCAAGG ATATCACACA
TCTGTATTCC ACGCGAACAA TGCAACGTTC TGGAACCGTA ACATTATGTA TCCGGCACTT
GGTTATGACC GTTACTACAA CGAGCTTGAC TACAAGATTA CGCCAGAAAC AAAATTAAAT
TGGGGATTAA AAGATATCGA ATACTTCGAT CAATCTATCG ATATGTTAAA AGAAGTGAAG
CAACCGTTCT ACACTCGCTT CCTTACGTTA ACAAACCATT ACCCATTCAC TTATGATGAA
AGCACAAAAT TAATCGATGA ATACAATTCT GGTGATGGCG TATTTGACCG TTACATGGTA
ACTGCTCGCT ATTTAGACGA AGCAATGAAA CACTTTATTG AGCGTCTAAA AGCAGAGGGT
ATTTACGACA ACTCAATTAT CGTATTCTAC GGTGATCACT ACGGTATTTC TGAAAACCAT
AACCGTGCAA TGGCACAGTT CTTAGGAAAA GAAGAAATTA CTGCATTTGA CCATATGAAC
TTACAAAAAA CACCGATGTT TATTCACGTT CCAGGTCAAA AAGAAGGTAA AACAATTTCA
AAACCAACTG GTGAAATTGA CATTAAACCA ACAATTCTAA ACTTACTTGG TATAGATTCT
ACGAATCAAA TTCAATTTGG TCATGATGTA TTCTCACCAG AAAATAAAGG ATTTGTTGTT
CTTCGTGACG GTAGCTTCGT TACAGATAAG TACATGTATA CGAACAGTAC ATTCTACGAC
CGTGCTACTG GCGAAGTTGT ACAATTACCA AAAGAAGAAT CTCAACCACT CATTGATCGT
GCTCAAAATG AATTGAACAT GTCTGACAAA ATCATTGAAG GTGACTTACT TCGCTTCTCT
GAAAGCAACA AGACAAAAAC TGGTGAAGTA AAGACAGCTA TTAAAGAAGA AAAGAAGAGC
GCTGAGTAA
 
Protein sequence
MKETLKSQFQ NVRFTVFVAL AVWLKTYLIT RTSFDLKLES FMQEFILFLS PLAASLLLVG 
LALFAKGKKR NYIALGINFV LTIILVGNVM FYGFYNDFVT LPVLGQTSNF GSLGSSVKEL
FNYKIILAFA DIIVFFVLLK KMKNFAPTER VARPMRSLYF VSTIAIFFAN LGLAEAERPE
LLTRSFDRVM LVKNLGLYVH QVYDLGLQAK SSSQKAFADG SKLQETENYV KTTQSKPDPN
MFGTAKGKNV IVVSLESLQT FLIGATVNGQ EVTPFLNQFT KESYYFDNFF HQTGQGKTSD
AEFLVDTSMY PLDRGAVFFT HGNNEYTATP EILREQGYHT SVFHANNATF WNRNIMYPAL
GYDRYYNELD YKITPETKLN WGLKDIEYFD QSIDMLKEVK QPFYTRFLTL TNHYPFTYDE
STKLIDEYNS GDGVFDRYMV TARYLDEAMK HFIERLKAEG IYDNSIIVFY GDHYGISENH
NRAMAQFLGK EEITAFDHMN LQKTPMFIHV PGQKEGKTIS KPTGEIDIKP TILNLLGIDS
TNQIQFGHDV FSPENKGFVV LRDGSFVTDK YMYTNSTFYD RATGEVVQLP KEESQPLIDR
AQNELNMSDK IIEGDLLRFS ESNKTKTGEV KTAIKEEKKS AE