Gene BCAH187_A0682 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBCAH187_A0682 
Symbol 
ID7074929 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus cereus AH187 
KingdomBacteria 
Replicon accessionNC_011658 
Strand
Start bp624887 
End bp627868 
Gene Length2982 bp 
Protein Length993 aa 
Translation table11 
GC content31% 
IMG OID643449175 
Productinternalin protein 
Protein accessionYP_002336685 
Protein GI217958141 
COG category[M] Cell wall/membrane/envelope biogenesis
[S] Function unknown 
COG ID[COG4886] Leucine-rich repeat (LRR) protein
[COG5386] Cell surface protein 
TIGRFAM ID[TIGR01167] LPXTG-motif cell wall anchor domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000101433 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGAAAAGAA ATAAAAGAAA ACATATAAAT GCAATGATTA TAGCGGCGAC GTTATCACTT 
CCGTTTGCTG TTTATTCAAC ACCTGCTTTA GCGGCAGTAG CAATTGAGGC GAATAAAACG
GGACAAGGTT TAGAAGATGG TACATATGAC GCTGTTATTA AAGCGTATAA AGATAAAACA
AATGAAGAGT CTATGGCAGC TGTTTATATA AAAAATCCGA AATTAACAGT TGAGAACGGA
AAGAAAATTG TAACAGCAAC GTTAAGTGAT AGTGATTTCT TTCAATACTT GAAAACAGAG
GATATACATA CTCCAGGTGT GTTTCATGAT GTGAAAGTAA TATCAGAAGA TAAAAAGAAA
AATGGAACGA AAGTGATTCA GTTTGAAGTA GGGGAATTAG GGAAGAGATA TAATATGCAG
ATGCATATTT ATATTCCGAC AATGGCTTAT GACAATAAGT ATCAAGTACA GTTTGAAGTA
AACACATTAA ATTTAGAAAA TAATGTTCCA GAAGAACAAA AGGAAAATGA AGAGGATAAA
TTGGATCAAC AAGATAAAAA CGGAAATGTA GTATTAGATA AGCAATTACA AAAGCATATT
AATAAATATA ACTTGAATAG AGAAAATTTA GATACCCCAA TAACTAAGGA AGATTTATTA
AAAGTTAAAT CTTTAATAGT CGTTGAAGCC AAAAGTAAAG GAATAAAAGA CGTAAGCGGT
CTAGAATATA TGAAGAACTT AGAAAACTTA ACGTTGGAAG AAGTTAAGTT AGAAAATATA
AAATTTATCT CGAATTTGAG GCAATTGAAA TCATTAAGTA TAACCTATGG CGAACTTGAA
GATATTGGAC CTTTGGCTGA GTTAGAACAT ATTGAGAGTT TAAGCTTGAG AAATAATAAA
ATTTCAGATT TAAGCCCACT AAGTCAAATG AAGAAGATTA AATTGCTAGA TTTAAATAGT
AATTATATAA AAGATATTAA ACCATTATTT ACAGCGAAAT CTTTAAGGAC TTTGACTGTA
GCAAATAATC AAATTAGTAA TGCTAATCTT GCTGGGATTG AGCAACTGAA GAATGTGAAG
AGTTTATCTT TAAGTAACAA TGGACTTACT AATATTGAAC ATATTACACC AATGAAAAAA
TTAGTAGAGT TAGACCTTTC TAAAAATGAA TTAGAAAACA TCGAACCTTT ATCAAGAATG
TCTACTGTAC AATCACTTAA TTTAGAAGAA AACTATATTT CAGATATAAC ACCACTTAGT
CAATTAACAG GTTTATATGA TTTAAAGCTA GCGTCAAATG AAATTCGTGA TGTTAGACCG
GTTCAAGAGT TAGGAAAAAG AATGTACATT GACGTTCAAA GACAAAAAAT CTTTTTAGAT
GATGTAGAAA AAGATAAGGA AGTTAAAATA CCTATCTATA ATTTACAAGG AGAGCCACTC
GATACTATTC AATTAAAGAA TGGAGATGGA ATAGTTAATA ATGGTTCTGT TAAATGGAGT
ACTACCGGTG AAAAAACATA CGAATTTATA TTAGATATAA AGCCAGAAGA AAATCGTATT
AAGTTTAATG GAATAGTAAT TCAAAATGTT GTTGAAAGGT TAGATGAAAT AAAAGAGGAT
AATGAACAAA AGGAAAATGT AATTCTCGAT AAAACTTTAC AACAACATAT TAATAAAGAG
AATTTAGGTA GAGAGAATTT AAACACTCCC ATCACCAAAG AAGATTTATT ACAGATTAAA
AAATTAGAGA TCCTTAAAGA AAAAGGAAAT GAGATAAAAG ATATAACAGG TTTAGAGTAC
ATGACGAACT TAGAAAATCT CACTTTAGAA GGAGTAGGTC TGAAAAATAT TGAGTTCATC
TCAAACTTGA AACAATTGAA TAATGTGAAT GTATCTCATA ATCAAATTGA AGATATAACA
CCACTATCTT CATTGGAAAA TTTACAGTGG TTAAATCTTG AAGACAATCA TATTAAAGAT
GTAACGGTTA TTGGTTCCAT GCTAAACCTA TTTAGCTTAA ATCTAGCTGG GAATGAGATT
CGTGATGTAA GGCCGTTAAT ACAATTAGGC CAGTGGGGAA CAATTGATGT TAGAAGGCAA
AAGGTCATTT TGGATGATGC AGAAATAAAT AAAGAAGTGA AGATACCTGT ATATGATTTA
GAAGGGGAAC GAATTGAAAA GATTACGTTA AAGAGTGCAG GTGGAATGCT TACTGATGAG
GGAATCATTT GGAGTACTCT AGGAGAAAAA ATATATGAAT TTGACTTGGA TGCAGATCAT
TATGAGACGG GCATATTATA TAGTGGCATC GTCATGCAGA ATATAGTAGA AAAATTAATA
CCGAAAGAAG AAGTAAAAGA GCCGGAAAAA GAAGTTGAAG AAACAAAAGA AGAAGTGAAA
GAAACGATAA AAGAAGTTGA AGAAGAGCAA GAAGAAGTAA AAGAGCCGGA AAAAGAAGTT
GAAGAAACAA AAGAAGAAGT GAAAGAAACG ATAAAAGAAG TTGAAGAAGA GCAAGAAGAA
GTAAAAGAGC CGGAAAAAGA AGTTGAAGAA ACAAAAGAAG AAGTGAAAGA AACGATAAAA
GAAGTTGAAG AAGAGCAAGA GGAAGTAAAA GAGTCAATAA AAGAAGTTGA AGAAGAGCAA
GAGGAAGTAA AAGAGCCAAT AAAAGAAGTT GAAGAGGTAA AAGAAGAAGT GGACGAGCCA
ACAACAGGAG TTGAAGAGGC GAAAGCTGAG ATAAAAGGAA CAGGAAAAGA AATTGAAGGT
TCAAAAGACG CAGTAAATCA ATCCACAGTA GTCCAAGAAC AAAACGTGAA TAATCAAGTT
GTGAAAGAAA ATAAACCAGT TGTTAATAAG CAAGAAGAAA GTAAGAAATC ATTAGGAGCA
ACAGGTGGAC AAGAGAATAC ATCAACATTA CTTTCAGGCA TAGCGTTAGT TCTTTCAGCG
ATGAGTATGT TTGTATTTAG AAAGAGATTA TTTAAGAAAT AA
 
Protein sequence
MKRNKRKHIN AMIIAATLSL PFAVYSTPAL AAVAIEANKT GQGLEDGTYD AVIKAYKDKT 
NEESMAAVYI KNPKLTVENG KKIVTATLSD SDFFQYLKTE DIHTPGVFHD VKVISEDKKK
NGTKVIQFEV GELGKRYNMQ MHIYIPTMAY DNKYQVQFEV NTLNLENNVP EEQKENEEDK
LDQQDKNGNV VLDKQLQKHI NKYNLNRENL DTPITKEDLL KVKSLIVVEA KSKGIKDVSG
LEYMKNLENL TLEEVKLENI KFISNLRQLK SLSITYGELE DIGPLAELEH IESLSLRNNK
ISDLSPLSQM KKIKLLDLNS NYIKDIKPLF TAKSLRTLTV ANNQISNANL AGIEQLKNVK
SLSLSNNGLT NIEHITPMKK LVELDLSKNE LENIEPLSRM STVQSLNLEE NYISDITPLS
QLTGLYDLKL ASNEIRDVRP VQELGKRMYI DVQRQKIFLD DVEKDKEVKI PIYNLQGEPL
DTIQLKNGDG IVNNGSVKWS TTGEKTYEFI LDIKPEENRI KFNGIVIQNV VERLDEIKED
NEQKENVILD KTLQQHINKE NLGRENLNTP ITKEDLLQIK KLEILKEKGN EIKDITGLEY
MTNLENLTLE GVGLKNIEFI SNLKQLNNVN VSHNQIEDIT PLSSLENLQW LNLEDNHIKD
VTVIGSMLNL FSLNLAGNEI RDVRPLIQLG QWGTIDVRRQ KVILDDAEIN KEVKIPVYDL
EGERIEKITL KSAGGMLTDE GIIWSTLGEK IYEFDLDADH YETGILYSGI VMQNIVEKLI
PKEEVKEPEK EVEETKEEVK ETIKEVEEEQ EEVKEPEKEV EETKEEVKET IKEVEEEQEE
VKEPEKEVEE TKEEVKETIK EVEEEQEEVK ESIKEVEEEQ EEVKEPIKEV EEVKEEVDEP
TTGVEEAKAE IKGTGKEIEG SKDAVNQSTV VQEQNVNNQV VKENKPVVNK QEESKKSLGA
TGGQENTSTL LSGIALVLSA MSMFVFRKRL FKK