Gene BCAH820_0608 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBCAH820_0608 
Symbol 
ID7191640 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus cereus AH820 
KingdomBacteria 
Replicon accessionNC_011773 
Strand
Start bp566143 
End bp569181 
Gene Length3039 bp 
Protein Length1012 aa 
Translation table11 
GC content31% 
IMG OID643554019 
Productinternalin protein 
Protein accessionYP_002449581 
Protein GI218901747 
COG category[M] Cell wall/membrane/envelope biogenesis
[S] Function unknown 
COG ID[COG4886] Leucine-rich repeat (LRR) protein
[COG5386] Cell surface protein 
TIGRFAM ID[TIGR01167] LPXTG-motif cell wall anchor domain 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones202 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGAAACAAA ATAAAAGAAA ACGTATAAAT GCAATGGTTA TAGCGGCGGC GTTATCACTG 
CCGTTTGCTG TTTATTCAAC ACCTGCTTTA GCGGCAGTGG CAATTGAGGC GAATAAAACT
GGACATGTTT TAGAAGATGG TACATATGAC GCTGTTATTA AGGCGTATAA AGATAAAACG
AATGAAGAAT CTATGGCAGC TGTTTATATA AAAAATCCGA AATTAACAAT TGAGAATGGA
AAGAAAATTG TAACGGCAAC GTTAAGTGAT AGTGATTTCT TCCAATATCT AAAAACAGAA
GATATTCATA CTCCTGGTGT ATTTCATGAT GTGAAAGTAA TATCAGAAGA TAAAAAGAAA
AATGGAACGA AAGTGATTCA GTTTGAAGTA GGAGAATTAG GAAAAAGGTA TAATATGCGA
ATGCATATTT ATATTCCAAC AATGGCCTAT GATAATAAGT ACCAAGTACA ATTTGAAGTA
AATACATTGA ATTTAGATAA AGATGTTCCA GAAGAACAAA AGGAAAATAA GGAGGATAAA
TTGGATCAAC AAGATGCGAA TGTAATAATA GATAAGCAAT TACAAAGGCA TATTAATAAA
TATAACTTGA ATAGAGAGAA TTTAAATGCG CCAATAACTA AGGAAGATTT ATTAAAAGTT
AAATCTTTAA TAGTCGTTGA AGCTAAAAGT AAAGGAATAA AAGACGTAAG CGGTCTAGAA
TATATGACGA ACTTAGAAAA CTTAACGTTG GAAGAAGTTA AGTTAGAAAA TATAAAATTT
ATCTCGAATT TGAGGCAATT AAAATCAGTA AGTATAACCT ATGCCGAACT TGAAGATATT
GGACCTTTGG CTGAGTTAGA ACATATTGAG AGTTTAAGCT TGAGAAATAA TAAAATTTCA
GATTTAAGCC CACTAAGTCA AATGAAGAAG ATTAAATTGC TAGATTTAAA TAGTAATTAT
ATAAAAGATA TAAAGCCATT ATTTACAGTG AAATCTTTAA GGACTTTAAC TGTAGCAAAT
AACCAAATTA GTAATGCAGG TCTTGAAGGA GTTCACCAAT TAAAGAATTT AAAGACATTT
GAAATAAGCA ATAATGGATT GAGTAATGTC GAACATATTA ATGGAATGAA TAAATTAATT
GAATTAGGGC TTTCCAAAAA TGAATTAGTA GATCTTACAC CATTATCAAA ATTATCAGGG
TTACAAAAAC TAAATTTAGA AGAAAACTTT ATTTCAGATA TAACGCCACT TAGTCAATTA
ACAAGTTTAT ATGATTTAAA ACTAGGTTCA AATGAAATTC GTGATGTTAG ACCGGTTCAA
GAGCTAGGAA AAAGAATGTA TATTGATATT CAAAGACAAA AAATCTTTTT AGATGATGTA
GAAAAAGATA AGGAAGTTAA AATACCTATC TATAATTTAC AAGGAGAGCC AATTGATACT
ATTCAATTGA ATAGTGAAGA TGGAATAGTT AATAATGGTT CTGTTAAATG GGGTACTACC
GGTGAAAAAA CATACGAATT TATGTTAGAT ATAAAGCCAG AAGAGAATCG TATTAAGTTT
AATGGAACAG TAATTCAAAA TGTTGTTGAA AGGTTAGATG AAATAAAAGA GGATAATGAA
CAAAAGGAAA GTGTAATTCT CGATAAAACT TTACAACAAC ATATTAATAA AGAGAATTTA
GGTAGAGAGA ATTTAAACGC TCCTATCACA AAAGAAGATT TATTACAGAT TAAAAAATTA
GAGATACTTA AAGAAAAAGG AAAAGAGATA AAAGATATAA CAGGTTTAGA GTACATGACG
AACTTAGAAA AACTCACTTT AGAAGGAGTA GGTTTAAAGA ATCTCGAATT TATCTCGAAC
TTAGAAAAGT TGAACGATGT GAATGTATCT CATAATCAAA TTGAGGATAT AACACCACTA
TCTGCATTAA AAAATCTACA ATGGTTAAAT CTTGCGGACA ATCATATTAA AGATGTATCG
GTTCTCGGTT CCATGCTAGA TTTACTTAGC TTAAAATTAT CTGGAAATGA GATTCGTGAT
GTAAGGCCGT TAATACAATT AGGTCAGTGG TTTTCAATTG ATGTGGGAAG ACAAAAAATC
GTTTTAAGTG AAGCGAAAGT AAATGAGGAA ATTCAAGTTC CTGTATATGA TTTAGAAGGA
GAAAGTATTG AGAATATTAA ATTGATAAGC GAAGGAGGGA CGTTTAATAA CGGAGTAATA
AAATGGAATA CCCCAGGTGA AAAGGTATAT AAATTTGATT TAGATTCTGA TGGAATTAGC
ATAAGGTTTA ACGGAACAGT TATACAGAGT ATAGTGGAAA AAGAAGAAGT GAAAGAACCG
GTAAAAGAAG TTGAAGAAGC AAAAGAAGAA GTGAAAGAAC CGGTAAAAGA AGTTGAAGAA
GCAAAAGAAG AAGTGAAAGA ACCGGTAAAA GAAGTTGAAG AAACAAAAGA AGAAGTAAAA
GAGCCGGTAA AAGAAGTTGA AGAAGCAAAA GAAAAAGTGA AAGAACCGGT AAAAGAAGTT
GAAGAAGCAA AAGAAGAAGT GAAAGAACCG GTAAAAGAAG TTGAAGAAAC AAAAGAAGAA
GTAAAAGAGC CGGTAAAAGA AGTTGAAGAA GCAAAAGAAG AAGTGAAAGA ACCGATAAAA
GAAGTTGAAG AAACAAAAGA AGAAGTGAAA GAACCGGTAA AAGAAGTTGA AGAAACAAAA
GAAGAAATAA AAGAGCCGGT AGAAGAAGTT GAAGGTACAA AAGAAGAAGT AAAAGAGCCA
ATAAAAGAAG TTGAAGAAGC GAAAGAACCA AAGAAAGAAG TAAAAGAATC AGCAACAGGA
TTGGATCAAG AGCCAAAAGG GAAAAATCAA GTTGTTGAAA ACGAGGGAAG AAAAGCAAAC
ACTTTAAATA AACAATATAC TAATAAGCCA GAGGAAGGCA AGAAATCTTT ACCATCAACA
GGCGGTGAAG CTAGCACATC GACTTTACTT TCTGGCATAA CACTTGTTCT TTCCGCACTA
AGTATGTTCG TATTTAGAAA GAGGTTATTT AAGAAATAA
 
Protein sequence
MKQNKRKRIN AMVIAAALSL PFAVYSTPAL AAVAIEANKT GHVLEDGTYD AVIKAYKDKT 
NEESMAAVYI KNPKLTIENG KKIVTATLSD SDFFQYLKTE DIHTPGVFHD VKVISEDKKK
NGTKVIQFEV GELGKRYNMR MHIYIPTMAY DNKYQVQFEV NTLNLDKDVP EEQKENKEDK
LDQQDANVII DKQLQRHINK YNLNRENLNA PITKEDLLKV KSLIVVEAKS KGIKDVSGLE
YMTNLENLTL EEVKLENIKF ISNLRQLKSV SITYAELEDI GPLAELEHIE SLSLRNNKIS
DLSPLSQMKK IKLLDLNSNY IKDIKPLFTV KSLRTLTVAN NQISNAGLEG VHQLKNLKTF
EISNNGLSNV EHINGMNKLI ELGLSKNELV DLTPLSKLSG LQKLNLEENF ISDITPLSQL
TSLYDLKLGS NEIRDVRPVQ ELGKRMYIDI QRQKIFLDDV EKDKEVKIPI YNLQGEPIDT
IQLNSEDGIV NNGSVKWGTT GEKTYEFMLD IKPEENRIKF NGTVIQNVVE RLDEIKEDNE
QKESVILDKT LQQHINKENL GRENLNAPIT KEDLLQIKKL EILKEKGKEI KDITGLEYMT
NLEKLTLEGV GLKNLEFISN LEKLNDVNVS HNQIEDITPL SALKNLQWLN LADNHIKDVS
VLGSMLDLLS LKLSGNEIRD VRPLIQLGQW FSIDVGRQKI VLSEAKVNEE IQVPVYDLEG
ESIENIKLIS EGGTFNNGVI KWNTPGEKVY KFDLDSDGIS IRFNGTVIQS IVEKEEVKEP
VKEVEEAKEE VKEPVKEVEE AKEEVKEPVK EVEETKEEVK EPVKEVEEAK EKVKEPVKEV
EEAKEEVKEP VKEVEETKEE VKEPVKEVEE AKEEVKEPIK EVEETKEEVK EPVKEVEETK
EEIKEPVEEV EGTKEEVKEP IKEVEEAKEP KKEVKESATG LDQEPKGKNQ VVENEGRKAN
TLNKQYTNKP EEGKKSLPST GGEASTSTLL SGITLVLSAL SMFVFRKRLF KK