Gene BCAH820_3776 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBCAH820_3776 
Symbol 
ID7191530 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus cereus AH820 
KingdomBacteria 
Replicon accessionNC_011773 
Strand
Start bp3603069 
End bp3604988 
Gene Length1920 bp 
Protein Length639 aa 
Translation table11 
GC content32% 
IMG OID643557187 
Productsulfatase 
Protein accessionYP_002452726 
Protein GI218904892 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1368] Phosphoglycerol transferase and related proteins, alkaline phosphatase superfamily 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones90 
Fosmid unclonability p-value0.0000208589 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAAAAATA AAATCAATTT ACAAATGCAA AATATAAGCT TTGTTTTAAT AATTGCTTTA 
GCAGTATGGT TAAAAACATA TCTTATTACA AGATTCAGTT TTGATTTAAA AATTGAATCT
TCTACACAAG AGCTTATTTT GTTTATTAGC CCGCTCGCTG CATCATTAGC TTTTGTCGGA
TTAGCATTAT TTGCAACCGG TGAAAAAAGA AATTATATCG CGCTATGTAT TAATTTTTTA
TTAACAATCA TACTTGTTGG AAATGTAATG TTCTATGATT TTTATAGTGA TTTCGTTACG
TTACCAGTAC TTGGACAAAC ATCAAACTTT GGCCAACTAG GCGGTAGTAT TATAGAAATA
TTGAACTACA AAATTATACT CGCATTCGTA GACATTATTT TTTTCTTTAT TTTATTAAAG
AAGAAAGCAC TAGTCTTTCA AACAGGACGT GTAACGCATC CGGCACGCTT CGTATATTTC
CTCTTAACGG TTGGTGTATT TTTTGCAAAT CTACACCTTG CAGAAAAAGA ACGACCTGAA
CTATTAACGA GATCGTTCGA CCGGGTCATG CTTGTTAAAA ATTTAGGACT ATACACGCAT
CAAGTTTATG ATTTAACACT GCAAGTAAAA GCAGGATCAC AAAAGGCACT TGCTGATAGT
AGCAAATTAC AAGAAACAGA AAACTACGTA AAGGCAAACC AAAGTGAACC GAATCCTAAT
ATGTTTGGTG CAGCGAAAGG AAAAAACGTA ATTGTCGTTA CTCTTGAATC CTTGCAAACC
TTTTTAATAG GCGCAAAAGT TAACGGAGAA GAAGTTACAC CATTTTTGAA TGAATTCATA
AATGAAAGCT ATTACTTTGA TAACTTTTTC CACCAAACTG GGCAAGGAAA AACATCCGAT
TCTGAATTTC TAATCGATAC GTCGTTATAT CCATTAAATC GAGGAGCTGT ATTTTTCACA
CACGGTAACA ATGATTATAC TGCGACGCCA GAAATTTTAC GTCAGCAAGG TTATTTCACT
TCTGTATTCC ATGCGAATAA TGCAACATTT TGGAATCGTA ACATTATGTA CTCTGCTCTC
GGTTATGATC GTTACTATAA TGAACTTGAT TACAAAATTA CGCCTGAAAC AAATTTAAAT
TGGGGCTTAA AAGATATCGA GTACTTTGAT CAATCAGTAG ATATATTAAA AACTGTTGAT
CAACCATTTT ACGCTCGTTT CCTTACTTTA ACAAACCATT ATCCATTTAC GTATGATGAA
GATACTAAAT TCATTGAACC ATATGACTCT GGTAATGGCG TGTTTGATCG TTATATCGTA
ACAGCACGTT ACTTAGACGA ATCCATTAAA AAATTTATTG AGCGTTTAAA AGCTGAGGGA
ATGTATGATG ATTCTATTAT TGTTTTATAT GGTGATCATT ATGGCATTTC TGAAAAACAT
AATCGTGCAA TGGCACAGTT TTTAGAAAAA GATCAAATAA CAGAATTTGA TACTCTGAAT
TTACAACGCA CACCTTTATA TATTCATATG CCTGGCCAAA CAGAAGGTGA AACGATTTCA
AAGCCTACAG GACAAATTGA TATAAAACCT ACTATTCTAA ATTTACTTGG GATTGATACT
ACGAACGATA TTCGTTTTGG TCATGATATG TTTTCAAATG AATATACTGG TTTTGTTGTT
TTACGTGATG GTAGCTTTAT TACAGACAAG TATGCATACA AAAACAATAT TTTCTATGAC
CGCATAACAG GTGAAATTGT AGATTTACCA AAAAATGAAG CACAAGCACT CATTAAGCGT
GCGCAAAATG AATTACGAAT GTCTGACAAA ATTATTGAAG GCGATTTATT ACGTTTCTCA
GAAAGTAATA AGATTAAAAC TGGTGAAGTA CAAACTAAAA TTAAAGAAAC TGAAAAATAA
 
Protein sequence
MKNKINLQMQ NISFVLIIAL AVWLKTYLIT RFSFDLKIES STQELILFIS PLAASLAFVG 
LALFATGEKR NYIALCINFL LTIILVGNVM FYDFYSDFVT LPVLGQTSNF GQLGGSIIEI
LNYKIILAFV DIIFFFILLK KKALVFQTGR VTHPARFVYF LLTVGVFFAN LHLAEKERPE
LLTRSFDRVM LVKNLGLYTH QVYDLTLQVK AGSQKALADS SKLQETENYV KANQSEPNPN
MFGAAKGKNV IVVTLESLQT FLIGAKVNGE EVTPFLNEFI NESYYFDNFF HQTGQGKTSD
SEFLIDTSLY PLNRGAVFFT HGNNDYTATP EILRQQGYFT SVFHANNATF WNRNIMYSAL
GYDRYYNELD YKITPETNLN WGLKDIEYFD QSVDILKTVD QPFYARFLTL TNHYPFTYDE
DTKFIEPYDS GNGVFDRYIV TARYLDESIK KFIERLKAEG MYDDSIIVLY GDHYGISEKH
NRAMAQFLEK DQITEFDTLN LQRTPLYIHM PGQTEGETIS KPTGQIDIKP TILNLLGIDT
TNDIRFGHDM FSNEYTGFVV LRDGSFITDK YAYKNNIFYD RITGEIVDLP KNEAQALIKR
AQNELRMSDK IIEGDLLRFS ESNKIKTGEV QTKIKETEK