Gene BCG9842_B3874 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBCG9842_B3874 
Symbol 
ID7184382 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus cereus G9842 
KingdomBacteria 
Replicon accessionNC_011772 
Strand
Start bp1352824 
End bp1354710 
Gene Length1887 bp 
Protein Length628 aa 
Translation table11 
GC content33% 
IMG OID643549189 
Productsulfatase 
Protein accessionYP_002444859 
Protein GI218896448 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1368] Phosphoglycerol transferase and related proteins, alkaline phosphatase superfamily 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones48 
Fosmid unclonability p-value0.295207 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTACAAC ATCTGTTTCC AAAACTGCGA TTTGCACTCG TTGCAGTCGT TTTACTATGG 
ATTAAAACAT ATATTGTGTA CAAGCTAGCA TTTGATATTA AAATTGATAA TTTCTTTGAA
GAATTCATGC TTTTTATTAA TCCACTAGCT GCGTTACTTT TATTTTTCGG CTTAGCTTTA
CTTGCATCTA AGCACCGAAA CCGAATTATA ATTGGAATCA GTTTTATACT GTCATTTATT
TTATTTGGAA ACGCAATGTT TTATGGGTTC TATAACGATT TCGTTACTTT CCCGGTTTTA
TTCCAAACAA ACAATATGGC TGATTTAGGG ACAAGTATAA AAGAACTCTT TACGTACAAA
ACATTACTTT TATTTGCAGA TGCAATTATT TTAATGTTTA TTTCGCGTAA ATTCCCATCA
TTTGGCGACA AAACACCACT TTCCCGCTCA GAGAAGCGAA CTTTCTTTAG CGGTGTAACA
GCTTTATTAG CACTACAAAT TGTTGTATCA GTTATTTATA AACCACAAAT GTTCTCACGC
TCATTTGACC GTCAAACTGT TGTGAAAAAT TTAGGTTTAT ACACATATCA TCTATTTGAT
ATTACACTTC AATCCAAGTC TTCAGCTGAG CGTGTATTTG CAAGTGGCGA TGGATTTTCT
GAAATTAAGA ACTATACAGA CTCAAAAGAC AAGCAAGTTG ATAAAAACTT ATTTGGAGCT
GCAAAAGGTA AAAATGTAAT TTTAATTTCA ATGGAATCTA CACAAAGCTT TGTTATTAAT
AAAAAAATAA ATGGAAAAGA AATTACACCA TTTTTAAATG AATTTATTAA GGATAGCTTC
TATTTCGATA ACTTCTATCA TCAAACTGGA CAAGGTAAAA CTTCTGATGC TGAATTTATC
GTTGAAAACT CACTTTACCC ACTAGATCGT GGTTCTGTAT TCTTTACTCA TGCAACAAAT
GAATACACAG CTACACCAGA ACAATTAAAG AAATACGGAT ATTCTTCTGC CGTCTTCCAT
TCAAACGATA AAACGTTTTG GAATCGGGAT GTAATGTATC CTACACTTGG ATATGATCGT
TACTTTAATT TAAATGATTA CGTAGGAACG GAACAAATGT CTGTCGGTTG GGGATTAAAA
GATAAAGAGT TCTTTGAACA ATCTATTCCA AAGTTAAAAT CTTTACCGCA ACCGTTCTAT
ACAAAATTTA TTACATTAAC AAATCATTTT CCGTTTCTTC TAAATCCGGA AGACCAATAT
GTTGATGAAT TCAACTCAGA AAGTGGTGTT GTAAACCGCT ACTTCCCAAC TGTTCGTTAC
ACAGATGAAG CTCTTAAATT ATTTATTAAA CAATTAAAAG AAGAAGGACT GTATGATAAT
TCCGTTATTG TCATTTATGG TGATCATTAT GGTATTTCCG AAAACCATAA CGCAGCTATG
GCACAGTTCC TAGGAAAAGA TACTATTACA CCATTCGATT CTATGCAATT ACAACGCGTT
CCTCTTATTA TTCATGTGCC TGGTCAAGAA GGAAAAGTTG TTTCTAAAGT ATCTGGTCAA
ATTGATATTA AACCAACGCT CCTTCATTTA CTTGGTATTA AAACAAATAA ATCCGTTGAA
TTTGGAACTG ACTTATTTAT TAAAGAAAAA GACCCGCTTA TGGTAATGCG TGATGGTAGC
TTTGTTTCTG AAGAGTATGT TTATACAAAA AATATGTGCT ACAAAAGAAG TACGGGTGAA
GAAGCTGACA TGACGCTATG TCAGCCGTAT GTTGAAAAAG CAAAAACAGA ATTAAAACTC
TCCGATAAAT TAATTTATGG AGATTTATTA CGTTTCGATC CTAATAATCG ATATAAAACC
GGAACGATGA CAACGAAATT TGAATAG
 
Protein sequence
MLQHLFPKLR FALVAVVLLW IKTYIVYKLA FDIKIDNFFE EFMLFINPLA ALLLFFGLAL 
LASKHRNRII IGISFILSFI LFGNAMFYGF YNDFVTFPVL FQTNNMADLG TSIKELFTYK
TLLLFADAII LMFISRKFPS FGDKTPLSRS EKRTFFSGVT ALLALQIVVS VIYKPQMFSR
SFDRQTVVKN LGLYTYHLFD ITLQSKSSAE RVFASGDGFS EIKNYTDSKD KQVDKNLFGA
AKGKNVILIS MESTQSFVIN KKINGKEITP FLNEFIKDSF YFDNFYHQTG QGKTSDAEFI
VENSLYPLDR GSVFFTHATN EYTATPEQLK KYGYSSAVFH SNDKTFWNRD VMYPTLGYDR
YFNLNDYVGT EQMSVGWGLK DKEFFEQSIP KLKSLPQPFY TKFITLTNHF PFLLNPEDQY
VDEFNSESGV VNRYFPTVRY TDEALKLFIK QLKEEGLYDN SVIVIYGDHY GISENHNAAM
AQFLGKDTIT PFDSMQLQRV PLIIHVPGQE GKVVSKVSGQ IDIKPTLLHL LGIKTNKSVE
FGTDLFIKEK DPLMVMRDGS FVSEEYVYTK NMCYKRSTGE EADMTLCQPY VEKAKTELKL
SDKLIYGDLL RFDPNNRYKT GTMTTKFE