Gene BCAH820_0417 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBCAH820_0417 
Symbol 
ID7187524 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus cereus AH820 
KingdomBacteria 
Replicon accessionNC_011773 
Strand
Start bp404780 
End bp407755 
Gene Length2976 bp 
Protein Length991 aa 
Translation table11 
GC content35% 
IMG OID643553828 
Productphage infection protein 
Protein accessionYP_002449409 
Protein GI218901575 
COG category[S] Function unknown 
COG ID[COG1511] Predicted membrane protein 
TIGRFAM ID[TIGR03061] YhgE/Pip N-terminal domain
[TIGR03062] YhgE/Pip C-terminal domain 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones100 
Fosmid unclonability p-value0.00466576 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAAACAAA TTTGGCGTAT TTATAAGACA GATTTACGGA ATGTAGCCAA ACATTGGGCT 
GCAATTGTTA TTGTTGTAGG GTTAATGATT TTACCATCGT TATATGCATG GTTTAACATT
AAAGCTTCTT GGGATCCGTA TGGAAATACA AAAGAGGTTC CCATTGCAGT TTCTAACCAA
GATGCAGGCT CTAATTTAAG AGGGAAAGAT ATTAATATCG GGGACGAGAT TGTAGATTCC
CTTAAGAAAA ACAAAAATCT TGGCTGGAAA TTTGTAGATG AAAAACAAGC AATTTACGGA
GTTGAACGTG GGGATTACTA TGCAAGCATT ACAATCCCAA AAGACTTCTC TGAAAAGATT
GCAACTGTTC TGGATGAAAA TCCACAGAAA CCAGAACTTG ATTACTATGT AAATGAAAAG
GTAAACGCGA TTGCACCGAA AATTACAGCA AAAGGTGCAT CAGGTATTAC AGAAGAAATT
AGTAAAAACT TTGTTAAAAC AGCGAACGGT GAGATTTTTA AAATCTTCAA TGACCTTGGA
ATCGATTTAG AAACAAACTT ACCAAGTATT GAGAAAGTAA AAGATCTTGT ATTTAAGTTA
GAAGCGCAGT TCCCAGAAAT GAATACACTG ATGGATAAGG CGTTAGATGA TGCGACTCGA
GCAGAAGATG TTGTGAAAGT AGCGCAGAAA GAGTTGCCGG TTGTGGAAAG TGTTATTAAT
GATGGGCAGG ATACACTTAA GAGTTTAGAT GCGTTCTTTG CGAGAAATGA TGAAACATTG
AATCGCGCTC CGGGAACTAT TAAGAATAAT TTAATTGTAA CGAAGCAGGG CTTGGATGCT
GCAGCGAAAA TTACGGATTA CTTAAAAAAT CCTTCATTTG ATTTTAATCT GACACTTCCA
GATCCAGCTA AATTCCCAGT GTTACCTAAT ATAACTCTTC CACAAATACC GCAAATACCA
GAAATTCCAG CGTTACCGCA AGTGAATGGT GATGGGTATA AGAATATTGC TAAAAATATT
GATCAAACTG TAAATAATGT TTTCAGCTCA ATACGTGTTG GAACAACTTA TGCACAAGGT
GTTATAAACG GGTTACAAAA TGGTAATTTT GATCCAGAAA AAGCGAAACA AGATCTTAAT
AATGTTTCTG AGACGCTGCA AGGACGTGCG GATAGTGTTT CATATGTAAT TGATGTCTTT
ACTAAATTTA AAGAGTCGGC ATCAACTGAT TTTGGAAAAG AATTCTTCCA AAAGAGAATA
GAAAGATTAA CAAACCTTAA ATCAGCTATC GAAAATGCAA ATGGTGGCGT TAAAGATATT
GCTAATATTA TTGGTACAGG GCAAGAAGTG AAACAAGATG TAAGAGATGC TACTAATAAA
AAGTTAGATG CAATAAATAA CTTGGTAAAT CAAGCAGAAG CAGATTATAA TGCAACATTC
GTAACTGATT TTGAAAAAGC AGTAAGTACA GCCGAGCAAT TGAAAGATAA GGCTGAAGGT
GTAAAAGAAG ATGCGCAACA GCTTAGAGGT AATTTAAATC AAGATATAAA AAAAGCGAAT
GAGTTAGTAA ATCAAACAAA TGAGGCATTA GACAGTGGTA GAGAAAAGTA CGATAAGGCT
GTAAGTGATT ACAGCAGAGT AAAAACAGAA CTTGGAAAAG CAAGAGAAGA CTTAAGCAAC
GCAGGTGTTA ATGGATTAGA CTCTACAAAA GTAGCCTTAA ATGATTTAAA TGGACAGTTC
AAAGCAGCAT GGAATTTAGT TAATGATATG ATTCCAGTTC TGGAAAATAC AAATAAAGTT
TTAGCAGATG TAAATAGTGA TAAGAACACG AATAACACTA TTTCTAAATT AAATAAAGTA
AAAGATGGTC TTCAAAAAGG AATCAATTTA ACTGATAAAG GAATTGACTC TATTAATAAA
GGGCAAAAAC CAGCAGCAGA TGTTATTGAG AGTATTAATG AAGTATCTAA GAGTGTTTCG
GGTCAAATTG GTGACATTTT AGCTAAATAT GATTCAGAAA TCACTCCAAA TTTCAATGCG
GCTATTGCTC GTACGAAAGA GATGTCTAAA AACACATCTC AAATTTTAAA AGAAGCAGAC
AAAAAGCTTC CGGATGTTAA AAAAATACTA GAAGATTCAT CAAAAGGCTT AGTAGATGGT
AAAAAGAAAT TAGCAGATAT TAAAGCTGAA ATGCCAGCTA CTGAGAAAAA GATTAAAGAA
TTAGCAGATA AAATTCGTGA TTTTGAATCT GAAGAGGATT TAAAAGACAT CATCCGTTTG
TTGAAAAACG ATGTTGAGAA GCAAAGTGAT TACTTTGCGA ATCCAGTAAA TTTAAAAGAG
AATAAATTAT TTGCAATGCC GAACTACGGT TCAGCAATGT CACCATTCTA TACAGTGCTT
GCATTATGGG TAGGCGCATT ATTAATGGTT TCATTATTAA CAGTAGAAGT ACATGAAGAG
GGCGCGAATT ATAAGAGCCA TGAAATTTAC TTCGGACGTT TATTAACATT CTTAACGATA
GGTCTTTCAC AGGCGTTTAT CGTATCGATG GGAGATATAT TCTTACTCGG TACGTACGTA
GTCGATAAGT TCTGGTTTGT ATTATTTAGT TTATTTATAG GTGGAGTATT TGTTTGTATC
GTGTACTCAC TCGTTTCTAT TTTCGGAAAC GTAGGGAAAT CGATGGCGAT CATTTTACTT
GTACTGCAAG TAGCAGGATC GGGCGGAACA TTCCCAATTC AAATGACACC GGCGTTCTTC
CAAGCGATTT ATCCGTTCTT ACCGTTCACA TACGCAATTA GTGCAATTCG TGAAACAGTA
GGCGGAATGC TATGGGATAT TGTAACGCGA GATTTACTTG TGTTAAGTGC TTTCGTAGTA
GTTATGATTG TTGCTGCGCT ATTACTGAAA ACACCAATTA ACAAATCAAG TGAAAAATTC
GTTGAGAATG CAAAAGGAAG TAAAATCATT CACTAA
 
Protein sequence
MKQIWRIYKT DLRNVAKHWA AIVIVVGLMI LPSLYAWFNI KASWDPYGNT KEVPIAVSNQ 
DAGSNLRGKD INIGDEIVDS LKKNKNLGWK FVDEKQAIYG VERGDYYASI TIPKDFSEKI
ATVLDENPQK PELDYYVNEK VNAIAPKITA KGASGITEEI SKNFVKTANG EIFKIFNDLG
IDLETNLPSI EKVKDLVFKL EAQFPEMNTL MDKALDDATR AEDVVKVAQK ELPVVESVIN
DGQDTLKSLD AFFARNDETL NRAPGTIKNN LIVTKQGLDA AAKITDYLKN PSFDFNLTLP
DPAKFPVLPN ITLPQIPQIP EIPALPQVNG DGYKNIAKNI DQTVNNVFSS IRVGTTYAQG
VINGLQNGNF DPEKAKQDLN NVSETLQGRA DSVSYVIDVF TKFKESASTD FGKEFFQKRI
ERLTNLKSAI ENANGGVKDI ANIIGTGQEV KQDVRDATNK KLDAINNLVN QAEADYNATF
VTDFEKAVST AEQLKDKAEG VKEDAQQLRG NLNQDIKKAN ELVNQTNEAL DSGREKYDKA
VSDYSRVKTE LGKAREDLSN AGVNGLDSTK VALNDLNGQF KAAWNLVNDM IPVLENTNKV
LADVNSDKNT NNTISKLNKV KDGLQKGINL TDKGIDSINK GQKPAADVIE SINEVSKSVS
GQIGDILAKY DSEITPNFNA AIARTKEMSK NTSQILKEAD KKLPDVKKIL EDSSKGLVDG
KKKLADIKAE MPATEKKIKE LADKIRDFES EEDLKDIIRL LKNDVEKQSD YFANPVNLKE
NKLFAMPNYG SAMSPFYTVL ALWVGALLMV SLLTVEVHEE GANYKSHEIY FGRLLTFLTI
GLSQAFIVSM GDIFLLGTYV VDKFWFVLFS LFIGGVFVCI VYSLVSIFGN VGKSMAIILL
VLQVAGSGGT FPIQMTPAFF QAIYPFLPFT YAISAIRETV GGMLWDIVTR DLLVLSAFVV
VMIVAALLLK TPINKSSEKF VENAKGSKII H