Gene BCG9842_B4750 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBCG9842_B4750 
Symbol 
ID7183450 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus cereus G9842 
KingdomBacteria 
Replicon accessionNC_011772 
Strand
Start bp527943 
End bp530930 
Gene Length2988 bp 
Protein Length995 aa 
Translation table11 
GC content31% 
IMG OID643548324 
Productinternalin protein 
Protein accessionYP_002444017 
Protein GI218895606 
COG category[M] Cell wall/membrane/envelope biogenesis
[S] Function unknown 
COG ID[COG4886] Leucine-rich repeat (LRR) protein
[COG5386] Cell surface protein 
TIGRFAM ID[TIGR01167] LPXTG-motif cell wall anchor domain 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value0.000909405 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
TTGAAACAAA ATAAAAGAAA ACGTATAAAT GCAATGATTA TAGCGGCGGC GTTATCACTT 
CCGTTTGCTG TTTATTCAAC ACCTGCTTTA GCGGCAGTGG CAATTGAGGC GAATAAAACG
GGACAAGGTT TAGAAGATGG TACATATGAT GCTGTTATTA AAGCGTATAA AGATAAAACG
AATGAAGAGT CTATGGCAGC TGTTTATATA AAGGATCCGA AATTAACAAT TGAGAATGGA
AAGAAAATTG TAACAGCAAC GTTAAGTGAT AGTGATTTCT TCCAATACTT GAAAACAGAG
GATATTCATA CGCCAGGTGT GTTTCATGAT GTAAAGGTCC TATCAGAAGA CAAAAAGAAA
AATGGGACGA AGGTTATTCA ATTTGAAGTT GGAGAATTAG GAAAAACATA CAATATGCAA
ATGCATATTT ATATTCCAAC GATGGCCTAT GATAATAAAT ATCAAGTGCA GTTTGAAGTG
AATGCTATAA ATTTAGAAAA CAATGTTTCA GAAAAACAAA AGGAAAATAA AGAGGAGCAA
CAAGATGAAA ACGGAAATGT AATATTAGAT AAGCAATTAC AAAAATATAT TAATAAATAT
AACTTAGATA GAGATAATGT AGATGCGCCA ATCACAAAGA AAGATTTATT ACAAATTAAA
ACATTATCCA TTTATTCAGG TAAAGGGATA AATGAAATAG CTGGTTTAGA GTATATGACA
AATTTAGAGA AGTTGACGTT ACGAGAGTCT AATGTAACAG ATATATCAGC TATCTCGAAA
TTGAGAAGTT TGAAGTACGT TGATTTAACT TCTAATTCAA TTGAAAGTAT TCATCCAATT
GGGCAATTAG AGAATATTAA TATGCTTTTT TTAAGAGATA ATAAAATTTC TGATCTTACA
CCATTAAGTA AAATGAAAAA AATCAAAACA TTAGATTTAA TCGGTAATAA CATTAAAGAT
ATCCAGCCAT TATTTACATT ATCAACTATG AAACAATTAT ACTTAGCAAA TAATCAAATC
AGTGATCTTA ATGGAATTGA TCGATTAAAT AATGTGGAAC TATTATGGAT AGGGAACAAT
AAAATTAATA ATGTTGAATC TATTAGTAAA ATGAGTAATC TTATTGAACT AGAAATTGCT
GATAGTGAAA TAAAAGATAT ATCACCATTA TCTCAATTAG GAATTTTACA AGTGCTGAAT
TTAGAAGAGA ATTATATCTC TGATATATCG CCGTTGAGCA CTTTAACAAA TTTACATGAG
ATAAATCTTG GAGCAAATGA AATTTCTGAC GTAAGGCCTG TTGAGGAATT AGGTAAGCGA
ATTTCAATTG ACATTCAAAG ACAAAAAATC TTTTTAAATG AAGCAAGCGT AGATGAGGAA
TTAAAAATCC CAGTATACAA CCTTAAGGGA GAACCACTTC AAAATATTAA TGTAAAAAGT
GAGGGGGCTA CTCTGAATAA CGGATTTATA AAATGGAATA GTCCTGGAGA AAAAATATAT
GAATTTAAAC TAGATACTAA TTCTACTGAA AGTAAAATAA GATTTAATGG TACGGTTATA
CAGAATATAG TTGAAAAACA AAAAGAACGT GCAAATGTAA TTCTCGATAA AACTTTACAA
CAACATATTA ATAAAGAGAA TTTAGGTAGA GAGAACTTAA ACGCTCCTAT CACAAAAGAA
GATTTATTAC AGGTTAAAAA ATTAGAGATA CTTAAAGAAA AAGGAAATGA GATAAAAGAT
ATAACAGGTT TAGAGTACAT GACGAACTTA GAAAACCTTA CTTTAGAAGG AGTAGGCCTG
AAAAATATTG ATTTCATCTC AAACTTGAAA CGATTGAATA ATGTGAATGT ATCTCATAAT
CAAATTGAAG ATATAACACC GCTATCTTCA TTGAAAAATT TACAGTGGTT AAATCTTACT
GAGAATCGTA TTACAGATGT AACGGTTCTT GGCTCAATGT TAGACTTACT TAGTTTAAAA
TTAGCTGAAA ATGAGATTCG TGATGTAAGG CCATTAATAC AATTAGGTCA GTGGGTAACA
ATTGATGTTA GAAGGCAAAA GGTCATTTTG GATGATGCAG AAATAAATAA AGAAGTGAAA
ATACCTGTAT ATGATTTAGA GGGAGAGCCA ATTGAAAAGA TTACACTAAA GAGTGAAGGT
GGAACTCTTA CTGATGAGGG AATCATTTGG CGTACTTTAG GAGAAAAAAT ATATGAATTT
GATTTAGATG CAGATCATTA TGAGACTGGC ATATTATATA GTGGCATTGT AATGCAGAAT
ATAGTAGAAA AATTAATACC AAAAGAAGAA GTGAAAGAAC CAACAAAGGA AGTTGAAGAG
TCAAAAGAAG AAGTGAAAGA ACCAACAAAA GAAGTGGAAG AAACAAAAGA AGAAGTGAAA
GAACCAACAA AGGAAGTGGA AGAGTCAAAA GAAGAAGTGA AAGAACCAAC AAAGGAAGTG
GAAGAGTCAA AAGAAGAAGT GAAAGAACCA ACAAAAGAAG TTGAAGAGTC AAAAGAAGAA
GTAAAAGAAC CAACAAAAGA AGTTGAAGAG TCAAAAGAAG AAGTGAAAGA ACCAACAAAG
GAAGTGGAAG AGTCAAAAGA AGAAGTGAAA GAACCAACAA AGGAAGTGGA AGAGTCAAAA
GAAGAAGTGA AAGAACCAAC AAAAGAAGTT GAAGAGTCAA AAGAAGAAGT AAAAGAACCA
ACGAAAGAAG TGGAAGAGTC AAAAGAAGAA GTGAAAGAAC CAACAAAAGA AGTTGAAGAA
GCGAAAGAGG AAGTAAAAGA GCCAAAAGGA AATAATCAGG TTGTTGAAAA CGAAGGCAGA
ACAGCAGATA CTTTAAATAC ACAACATGTT AATAAGACGG AGGAAGGAAA GAAATCTTTA
CCATCAACAG GCGGTGAAGC TAGCACATCG ACTTTACTTT CTGGAATAAC ACTTGTTCTT
TCCGCACTAA GTATGTTCGT ATTTAGAAAG AGGTTATTTA AGAAATAA
 
Protein sequence
MKQNKRKRIN AMIIAAALSL PFAVYSTPAL AAVAIEANKT GQGLEDGTYD AVIKAYKDKT 
NEESMAAVYI KDPKLTIENG KKIVTATLSD SDFFQYLKTE DIHTPGVFHD VKVLSEDKKK
NGTKVIQFEV GELGKTYNMQ MHIYIPTMAY DNKYQVQFEV NAINLENNVS EKQKENKEEQ
QDENGNVILD KQLQKYINKY NLDRDNVDAP ITKKDLLQIK TLSIYSGKGI NEIAGLEYMT
NLEKLTLRES NVTDISAISK LRSLKYVDLT SNSIESIHPI GQLENINMLF LRDNKISDLT
PLSKMKKIKT LDLIGNNIKD IQPLFTLSTM KQLYLANNQI SDLNGIDRLN NVELLWIGNN
KINNVESISK MSNLIELEIA DSEIKDISPL SQLGILQVLN LEENYISDIS PLSTLTNLHE
INLGANEISD VRPVEELGKR ISIDIQRQKI FLNEASVDEE LKIPVYNLKG EPLQNINVKS
EGATLNNGFI KWNSPGEKIY EFKLDTNSTE SKIRFNGTVI QNIVEKQKER ANVILDKTLQ
QHINKENLGR ENLNAPITKE DLLQVKKLEI LKEKGNEIKD ITGLEYMTNL ENLTLEGVGL
KNIDFISNLK RLNNVNVSHN QIEDITPLSS LKNLQWLNLT ENRITDVTVL GSMLDLLSLK
LAENEIRDVR PLIQLGQWVT IDVRRQKVIL DDAEINKEVK IPVYDLEGEP IEKITLKSEG
GTLTDEGIIW RTLGEKIYEF DLDADHYETG ILYSGIVMQN IVEKLIPKEE VKEPTKEVEE
SKEEVKEPTK EVEETKEEVK EPTKEVEESK EEVKEPTKEV EESKEEVKEP TKEVEESKEE
VKEPTKEVEE SKEEVKEPTK EVEESKEEVK EPTKEVEESK EEVKEPTKEV EESKEEVKEP
TKEVEESKEE VKEPTKEVEE AKEEVKEPKG NNQVVENEGR TADTLNTQHV NKTEEGKKSL
PSTGGEASTS TLLSGITLVL SALSMFVFRK RLFKK