Gene BCG9842_B2289 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBCG9842_B2289 
Symbol 
ID7181784 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus cereus G9842 
KingdomBacteria 
Replicon accessionNC_011772 
Strand
Start bp2857832 
End bp2859805 
Gene Length1974 bp 
Protein Length657 aa 
Translation table11 
GC content35% 
IMG OID643550758 
Productsulfatase 
Protein accessionYP_002446428 
Protein GI218898017 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1368] Phosphoglycerol transferase and related proteins, alkaline phosphatase superfamily 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000000748062 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value1.58706e-23 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAAACAGT TCTTATTAAA GAGCAAAAGT GTGTTAAGCA ATCATTTTGG ATTCTTTCTG 
TTTGCCGTTA TTTTATTGTG GCTCAAAACG TATGCAGCCT ATGTAACGGA ATTTAATTTA
GGAATTTCAA ACACAATCCA AAAATTCTTG CTGTTTTTTA ACCCGCTTAG TTCAGCAGTT
CTATTTTTAG GACTTGCATT ATTTGCAAAA GGGAAGCGAT CTTATATTTG GTTAATTGTT
ATCAACTTGT TATTGTCGAT TCTTTTATAT GCAAACGTAG TTTACTATCG CTTTTTCAGT
GACTTTATTA CGTTCCCAAC ATTAACACAA ACGAATAACT TTGGAGATTT AGGTGGTAGT
ATTCTTGCGT TGCTACATCT TTATGATCCA CTATACTTCT TAGACACGAT TATATTAATT
GTGTTAGTTG CAACAAAATT TGCAAATCCA AAACCAATTC GTGTTGCGAA GCATAAAGTA
TCTCTAGTAT TTGTAGCAGG TATTTTATTA TTCAGTGTTA ATTTAGGTCT TGCAGAGTCT
GACCGTCCGG AATTATTAAC AAGAACGTTT GACCGTAATT ATATTGTGAA ATATTTAGGG
GCATACAACT ACACAATTTA TGATGGAATT CAAAGTGCGA AAGCATCGAC AGAACGAGCA
TTAGCTGATG GCGATAATAT GACAGAAGTA CGAAATTATT TAACATCAAC TTATGCAAGT
CCAAATCCTG AGTATTTCGG TAAAGGAAAG GGAATGAACG TAATTTATAT TCATTTAGAG
TCATTCCAAA ACTTCTTAAT TGATTACAAA TTAAATGGTC AAGAAGTTAC ACCGTTCTTA
AACTCATTTA CAAAAGATGC GAATACGCTA TACTTTGATA ACTTCTTCCA TCAAACAGGA
CAAGGGAAAA CATCTGATGC GGAGTTTATG TTAGAGAATT CAATGTTTGG TTTACCGCAA
GGTTCTGTCT TTACAACGAA ATCTCATAAT ACGTATCAAT CAGCACCAGC TATTTTAGGA
CAACAAGGAT ACACATCAGC AGTATTCCAT GGTAACTACA AAACATTCTG GAACCGTGAC
GATATTTATA AATCATTTGG TTTTAATAAA TTCTTTGATG CGTCATACTA CGATATGAAT
GAAAAAGACG TAGTAAACTA CGGATTAAAA GATAAACCGT TCTTTAATGA ATCTATTCCG
TTATTACAAA CGTTGAAACA ACCGTTCTAT ACGAAGTTTA TTACGTTATC GAACCATTTC
CCTTATCCAA TTGATAAGGC AGAAGCAACG ATTGAACCAG CAACAACAGG TGATTCATCA
GTAGATACGT ACTTCCAAAC AGCACGCTAT TTAGATGAAT CTGTAAAAGG CTTCATCGAT
TACTTGAAAC AATCTGGTTT ATATGATAAC TCAATTATCG TTATGTACGG AGACCATTAC
GGTATTTCAG ATAATCATAA CGCAGCAATG TCAAAAGTAA TGGGTAAAGA AATGAACTCA
TTTGAAAATG CACAGTTACA ACGTGTGCCT TTAATCGTTC GTGTACCAGG AGTGAAAGGT
GGCGTACAAC ATCAATATGG CGGTGAAATT GACGTTCTTC CTACGTTATT ACACTTACTA
GGTACAGATA CGAAAAACTA TGTTCAATTC GGTTCAGATT TATTATCACC AGAGCATAAA
CAAGTCGTTG CGTTCCGTAA CGGTAACTAC GTAAGCCCAA CTGTTACTGC ACTAAACGGC
AAATACTATG ATACAACAAC TGGAAAACCT GTAGAATTTA CAGATGAAAT AAAGAAAAAT
GAACAAATGG TTCAAAACTC ACTAAAATAC TCTGACCAAG TCGTAAATGG TGACTTATTA
CGATTCTACA CACCGGAAGG ATTCACTCCA GTAGATCGTT CGAAGTATAA CTATAACAAT
CGTGATAAAA ACAAAACGAA GGTAAAAACG ACTCCGGAAG GGGAAGCTAA ATAA
 
Protein sequence
MKQFLLKSKS VLSNHFGFFL FAVILLWLKT YAAYVTEFNL GISNTIQKFL LFFNPLSSAV 
LFLGLALFAK GKRSYIWLIV INLLLSILLY ANVVYYRFFS DFITFPTLTQ TNNFGDLGGS
ILALLHLYDP LYFLDTIILI VLVATKFANP KPIRVAKHKV SLVFVAGILL FSVNLGLAES
DRPELLTRTF DRNYIVKYLG AYNYTIYDGI QSAKASTERA LADGDNMTEV RNYLTSTYAS
PNPEYFGKGK GMNVIYIHLE SFQNFLIDYK LNGQEVTPFL NSFTKDANTL YFDNFFHQTG
QGKTSDAEFM LENSMFGLPQ GSVFTTKSHN TYQSAPAILG QQGYTSAVFH GNYKTFWNRD
DIYKSFGFNK FFDASYYDMN EKDVVNYGLK DKPFFNESIP LLQTLKQPFY TKFITLSNHF
PYPIDKAEAT IEPATTGDSS VDTYFQTARY LDESVKGFID YLKQSGLYDN SIIVMYGDHY
GISDNHNAAM SKVMGKEMNS FENAQLQRVP LIVRVPGVKG GVQHQYGGEI DVLPTLLHLL
GTDTKNYVQF GSDLLSPEHK QVVAFRNGNY VSPTVTALNG KYYDTTTGKP VEFTDEIKKN
EQMVQNSLKY SDQVVNGDLL RFYTPEGFTP VDRSKYNYNN RDKNKTKVKT TPEGEAK