Gene BCAH187_A5404 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBCAH187_A5404 
Symbol 
ID7075096 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus cereus AH187 
KingdomBacteria 
Replicon accessionNC_011658 
Strand
Start bp4988881 
End bp4990809 
Gene Length1929 bp 
Protein Length642 aa 
Translation table11 
GC content37% 
IMG OID643453802 
Productsulfatase 
Protein accessionYP_002341293 
Protein GI217962717 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1368] Phosphoglycerol transferase and related proteins, alkaline phosphatase superfamily 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000000849395 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAGAAA CCTTGAAATC ACAATTTCAA AATGTGCGTT TCACTGCATT CGTAGCTTTA 
GCCTTATGGT TGAAGACATA TCTTATTACA CGCACAAGCT TTGATTTAAA ACTTGAATCT
TTCATGCAAG AATTCATTTT ATTCCTGAGC CCATTAGCAG CATCATTACT GCTTGTTGGT
CTTGCATTAT TTGCAAAAGG GAAAAAACGT AACTATATAG CACTTGGAAT TAATTTTGTC
TTAACAATCG TTCTTGTTGG TAACGTAATG TTCTACGGAT TCTACAATGA CTTCGTTACT
TTACCAGTAC TAGGACAAAC ATCTAACTTC GGAAGTTTAG GTTCTAGTGT AAAAGAATTG
TTTAACTACA AAATTATCCT CGCATTTGCT GATATTATCG TATTCTTCAT TTTATTGAAG
AAGATGAAGA ATTTTGCACC GACAGAACGT GTAGCACGCC CAATGCGTTC CCTATACTTC
GTGTCAACAA TTGCTATTTT CTTCGCAAAC TTAGGATTGG CAGAAGCTGA GCGTCCAGAA
CTATTAACAC GTTCATTCGA CCGCGTTATG CTTGTTAAAA ACTTAGGTTT ATATGTACAC
CAAGTGTATG ATCTTGGCTT ACAAGCAAAA TCAAGTTCAC AAAAGGCATT TGCTGACGGT
AGTAAGTTAC AGGAAACAGA GAACTACGTA AAAACAACGC AAAGCAAACC AGATCCAAAT
ATGTTTGGTA CTGCAAAAGG GAAGAACGTA ATCGTTGTCT CTCTTGAGTC ATTACAAACA
TTCTTAATTG GTGCAACAGT TAACGGACAA GAAGTTACAC CATTCTTAAA CCAATTTACG
AAAGAAAGTT ATTACTTTGA TAACTTCTTC CATCAAACTG GGCAAGGAAA AACATCTGAT
GCTGAATTCT TAGTAGATAC TTCAATGTAT CCACTAGACC GTGGTGCTGT ATTCTTCACA
CACGGTAACA ACGAATACAC AGCAACTCCA GAAATTTTAC GTCAGCAAGG ATACTACACT
GCAGTATTCC ACGCAAACAA CGCAACATTC TGGAACCGTA ACATTATGTA TCCGGCACTT
GGTTATGATC GTTACTACAA CGAGCTTGAC TATAAGATTA CGCCAGAAAC GAAATTAAAC
TGGGGATTAA AAGATATCGA GTACTTTGAT CAATCTATCG ATATGTTAAA ACAAGTGAAG
CAACCGTTCT ATACTCGCTT CCTTACTTTA ACAAACCATT ACCCATTCAC TTATGATGAC
AATACAAAAT TAATTGATGA ATACAATTCT GGTGATGGCG TATTTGACCG TTACATGGTA
ACAGCTCGCT ACTTAGATGA AGCAATGAAA CACTTTATTG AACGTCTAAA AGCAGAAGGT
ATTTACGACA ACTCAGTTAT CGTATTCTAC GGTGACCACT ACGGTATTTC TGAAAACCAT
AACCGTGCAA TGGCACAGTT CTTAGGAAAA GAAGAAATTA CTGCGTTTGA CCATATGAAC
TTACAAAAAA CACCGATGTT TATTCACGTT CCAGGTCAAA AAGAAGGTAA AACAATTTCA
AAACCAACTG GTGAAATTGA CATTAAACCA ACTATTCTAA ACTTACTTGG TGTAGATTCT
ACGAATGACG TTCGATTCGG TCATGACATA TTCTCACCTG ACAATAAAGG ATTCGTTGTT
CTTCGTGATG GTAGCTTCGT TACAGATAAG TACATGTACA CGAACAGTAC ATTCTACGAC
CGTGCTACTG GCGAAGTTGT ACAACTACCA AAAGAAGAAT CTCAACCACT TATTGATCGT
GCCCAAAATG AATTGAGCAT GTCTGACAAA ATCATTGAGG GTGACTTACT TCGCTTCTCT
GAAAGCAACA AGATCAAAAC TGGTGAAGTA AAGACAGCTA TTAAAGAAGA AAAGAAGAGC
GCTGAGTAA
 
Protein sequence
MKETLKSQFQ NVRFTAFVAL ALWLKTYLIT RTSFDLKLES FMQEFILFLS PLAASLLLVG 
LALFAKGKKR NYIALGINFV LTIVLVGNVM FYGFYNDFVT LPVLGQTSNF GSLGSSVKEL
FNYKIILAFA DIIVFFILLK KMKNFAPTER VARPMRSLYF VSTIAIFFAN LGLAEAERPE
LLTRSFDRVM LVKNLGLYVH QVYDLGLQAK SSSQKAFADG SKLQETENYV KTTQSKPDPN
MFGTAKGKNV IVVSLESLQT FLIGATVNGQ EVTPFLNQFT KESYYFDNFF HQTGQGKTSD
AEFLVDTSMY PLDRGAVFFT HGNNEYTATP EILRQQGYYT AVFHANNATF WNRNIMYPAL
GYDRYYNELD YKITPETKLN WGLKDIEYFD QSIDMLKQVK QPFYTRFLTL TNHYPFTYDD
NTKLIDEYNS GDGVFDRYMV TARYLDEAMK HFIERLKAEG IYDNSVIVFY GDHYGISENH
NRAMAQFLGK EEITAFDHMN LQKTPMFIHV PGQKEGKTIS KPTGEIDIKP TILNLLGVDS
TNDVRFGHDI FSPDNKGFVV LRDGSFVTDK YMYTNSTFYD RATGEVVQLP KEESQPLIDR
AQNELSMSDK IIEGDLLRFS ESNKIKTGEV KTAIKEEKKS AE