Gene BCG9842_B3476 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBCG9842_B3476 
Symbol 
ID7184240 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus cereus G9842 
KingdomBacteria 
Replicon accessionNC_011772 
Strand
Start bp1727314 
End bp1728621 
Gene Length1308 bp 
Protein Length435 aa 
Translation table11 
GC content38% 
IMG OID643549576 
Productchlorohydrolase 
Protein accessionYP_002445246 
Protein GI218896835 
COG category[F] Nucleotide transport and metabolism
[R] General function prediction only 
COG ID[COG0402] Cytosine deaminase and related metal-dependent hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones35 
Fosmid unclonability p-value0.00153187 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
TTGAAAACAA CTTATGTAAA CGCTACAATT GTAACGATGA ATGAACAAAA CGAAGTGATA 
GAAAATGGAT ATATCATTGT AGAAAATGAT CAAATTATAG ATGTAAAGAG CGGAGAATTT
GCAAATGATT TTGAAGTAGA TGAAGTAATT GACATGAAAG GAAAGTGGGT TTTACCAGGG
CTTGTAAATA CACATACACA CGTTGTAATG AGTCTCCTAA GAGGTATTGG AGATGATATG
TTATTACAGC CATGGCTTGA GACGAGAATT TGGCCACTGG AAAGTCAGTT TACTCCAGAG
CTTGCGGTTG CTAGTACGGA ATTAGGATTA CTTGAAATGG TGAAAAGTGG TACAACATCA
TTCTCTGACA TGTTTAATCC AATTGGAGTA GATCAAGATG CAATTATGGA AACGGTATCA
AGGAGCGGGA TGCGAGCTGC TGTTTCAAGG ACTTTATTTA GCTTCGGAAC GAAAGACGAT
GAAAAGAAAG CGATTGAAGA AGCTGAGAAA TATGTGAAGC GTTATTATAA AGAAAGTGGC
ATGTTAACTA CGATGGTTGC ACCACATAGT CCATATACAT GTAGTACAGA ACTGTTAGAA
GAGTGCGCTC GTATTGCAGT AGAAAATCAA ACAATGGTTC ATATCCACCT TTCTGAAACA
GAGCGTGAAG TACGTGATAT TGAAGCACAG TACGGAAAAC GTCCAGTAGA ATATGCAGCG
AGCTGCGGGT TGTTTAAACG CCCAACAGTT ATTGCACACG GTGTAGTATT AAATGAAAAT
GAACGTGCAT TTTTAGCAGA ACATGACGTT CGAGTAGCAC ATAATCCGAA TAGTAATTTA
AAACTAGGAT CTGGTATAGC GAATGTAAAA GCGATGCTAG AAGCAGGAAT GAAAGTAGGA
ATTGCAACAG ATAGTGTGGC ATCAAACAAC AATTTAGATA TGTTTGAAGA AATGCGTATA
GCGACTTTAC TACAAAAAGG TATTCACCAA GATGCAACAG CTTTACCGGT TGAAACTGCT
CTTACACTTG CGACTAAAGG AGCTGCTGAA GTAATCGGGA TGAAACAAAC AGGATCACTT
GAGGTTGGAA AGTGTGCTGA TTTTATTACG ATTGACCCAT CTAATAAGCC GCATTTACAA
CCAGCAGATG AAGTGTTATC GCACCTGGTA TATGCAGCTA GTGGAAAAGA TATAAGCGAT
GTAATTATTA ACGGAAAGCG TGTCGTTTGG AATGGCGAAT GTAAAACGTT AGATGAAGAG
CGTATTATAT TTGAAGCGAG TCGTTATAAA CGAGGTTTAC AAAGATAG
 
Protein sequence
MKTTYVNATI VTMNEQNEVI ENGYIIVEND QIIDVKSGEF ANDFEVDEVI DMKGKWVLPG 
LVNTHTHVVM SLLRGIGDDM LLQPWLETRI WPLESQFTPE LAVASTELGL LEMVKSGTTS
FSDMFNPIGV DQDAIMETVS RSGMRAAVSR TLFSFGTKDD EKKAIEEAEK YVKRYYKESG
MLTTMVAPHS PYTCSTELLE ECARIAVENQ TMVHIHLSET EREVRDIEAQ YGKRPVEYAA
SCGLFKRPTV IAHGVVLNEN ERAFLAEHDV RVAHNPNSNL KLGSGIANVK AMLEAGMKVG
IATDSVASNN NLDMFEEMRI ATLLQKGIHQ DATALPVETA LTLATKGAAE VIGMKQTGSL
EVGKCADFIT IDPSNKPHLQ PADEVLSHLV YAASGKDISD VIINGKRVVW NGECKTLDEE
RIIFEASRYK RGLQR