Gene BCAH187_A3804 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBCAH187_A3804 
Symbol 
ID7074274 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus cereus AH187 
KingdomBacteria 
Replicon accessionNC_011658 
Strand
Start bp3549962 
End bp3551881 
Gene Length1920 bp 
Protein Length639 aa 
Translation table11 
GC content33% 
IMG OID643452235 
Productsulfatase 
Protein accessionYP_002339746 
Protein GI217961178 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1368] Phosphoglycerol transferase and related proteins, alkaline phosphatase superfamily 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00040285 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAATA AAATCAATTT ACAAATGCAA AATATAAGCT TTGTTTTAAT AATTGCTTTA 
GCAGTATGGT TAAAAACATA TCTTATTACA AGATTCAGTT TTGATTTAAA AATTGAGTCT
TCTACACAAG AGCTGATTTT ATTTATTAGC CCTCTCGCTG CTTCATTAGC TTTTGTCGGA
TGCGCATTAT TTGCAACCGG TGAAAAGCGA AATTATATCG CGCTATGTAT TAATTTCTTA
TTAACAATTG TACTTGTTGG GAATGTAATG TTCTATGATT TTTATAGTGA TTTCGTTACG
TTACCAGTAC TTGGACAAAC ATCAAACTTT GGCCAATTAG GCGGTAGTAT TATAGAAATA
TTGAACTACA AAATTATACT CGCATTCGCA GACATTATTT TCTTCTTTAT TTTATTAAAG
AAGAAGGCAC TAGTCTTCCA AACGGGGCGC GTAACTCATT CGGCACGCTT GTTATATTTT
CTCTTAACGA TTGGTGTATT TTTTGCAAAT CTACATCTTG CAGAAAAAGA GCGACCTGAA
TTACTAACGA GATCATTCGA CCGAGTCATG CTTGTTAAAA ATTTAGGACT ATACACGCAT
CAAGTATATG ATTTAACACT ACAAGTAAAA GCAGGATCAC AAAAGGCACT TGCTGATAGT
AGTAAATTAC AAGAAACAGA AAACTACGTG AAGGCAAACG AAAGTCAGCC GAATCCTAAC
ATGTTTGGTG TAGCGAAAGG AAAAAACGTA ATTGTTGTTA CTCTTGAATC CTTACAAACC
TTTTTAATAG GCGCAACAGT TAACGGACAA GAAGTTACAC CATTTTTGAA TGAATTCATA
AATGAAAGCT ATTACTTTGA TAACTTTTTC CACCAAACTG GGCAAGGAAA AACATCCGAT
TCTGAATTTC TAATTGATAC GTCGTTATAT CCATTAAATC GAGGAGCTGT ATTCTTCACA
CACGGTAACA ATGATTATAC TGCGACTCCA GAAATTTTAC GTCAGCAAGG TTATTTCACT
TCTGTATTCC ATGCGAATAA CGCAACATTT TGGAATCGTA ATATCATGTA CTCTGCTCTC
GGTTATGATC GTTACTATAA TGAACTTGAT TACAAAATTA CGCCCGAAAC TAATTTAAAT
TGGGGATTAA AAGATATCGA ATACTTTGAT CAATCAGTAG ATATATTAAA AACTGTTGAT
CAACCATTTT ATGCTCGTTT CCTTACTTTA ACGAACCATT ATCCATTTAC GTATGATGAA
GATACAAAAT TCATTGAACC ATATAACTCT GGTAATGGCG TATTTGATCG TTATATCGTA
ACAGCACGTT ACTTAGACGA ATCAATTAAA AAATTTATTG AGCGTTTAAA AGCCGAGGGA
ATGTATGATG ATTCTATTAT TGTATTATAC GGTGATCATT ATGGCATTTC TGAAAAACAT
AATCGTGCAA TGGCACAGTT TTTAGAAAAA GATCAAATAA CAGAATTTGA TACTTTAAAT
TTACAACGTA CACCTTTATA TATTCATGTC CCTGGACAAA CAGAAGGTCA AACGATTTCA
AAGCCTACGG GACAAATCGA TATGAAACCT ACTATTCTCA ATTTATTAGG GATTGATACT
ACGAACGATA TTCGTTTTGG TCATGATATG TTTTCAGATG AATATACCGG TTTTGTTGTT
TTACGTGATG GTAGCTTCAT TACAGACAAA TATGCATACA AAAACAACAC TTTCTATGAC
CGCATAACAG GTGAAATTGT AGATTTACCA AAAAAAGAAG CTCAAGCACT CATTAACCGT
GCACAAAATG AATTACGAAT GTCTGACAAA ATTATTGAAG GAGATTTACT ACGTTTCTCA
GAAAGTAATA AAATTAAAAC TGGCGAAGTA CAAACTAAAA TTAAAGAAAC TGAAAAATAA
 
Protein sequence
MKNKINLQMQ NISFVLIIAL AVWLKTYLIT RFSFDLKIES STQELILFIS PLAASLAFVG 
CALFATGEKR NYIALCINFL LTIVLVGNVM FYDFYSDFVT LPVLGQTSNF GQLGGSIIEI
LNYKIILAFA DIIFFFILLK KKALVFQTGR VTHSARLLYF LLTIGVFFAN LHLAEKERPE
LLTRSFDRVM LVKNLGLYTH QVYDLTLQVK AGSQKALADS SKLQETENYV KANESQPNPN
MFGVAKGKNV IVVTLESLQT FLIGATVNGQ EVTPFLNEFI NESYYFDNFF HQTGQGKTSD
SEFLIDTSLY PLNRGAVFFT HGNNDYTATP EILRQQGYFT SVFHANNATF WNRNIMYSAL
GYDRYYNELD YKITPETNLN WGLKDIEYFD QSVDILKTVD QPFYARFLTL TNHYPFTYDE
DTKFIEPYNS GNGVFDRYIV TARYLDESIK KFIERLKAEG MYDDSIIVLY GDHYGISEKH
NRAMAQFLEK DQITEFDTLN LQRTPLYIHV PGQTEGQTIS KPTGQIDMKP TILNLLGIDT
TNDIRFGHDM FSDEYTGFVV LRDGSFITDK YAYKNNTFYD RITGEIVDLP KKEAQALINR
AQNELRMSDK IIEGDLLRFS ESNKIKTGEV QTKIKETEK