Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BAS0520 |
Symbol | |
ID | 2849896 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Bacillus anthracis str. Sterne |
Kingdom | Bacteria |
Replicon accession | NC_005945 |
Strand | + |
Start bp | 552274 |
End bp | 555486 |
Gene Length | 3213 bp |
Protein Length | 1070 aa |
Translation table | 11 |
GC content | 32% |
IMG OID | 637503762 |
Product | internalin |
Protein accession | YP_026798 |
Protein GI | 49183546 |
COG category | [M] Cell wall/membrane/envelope biogenesis [S] Function unknown |
COG ID | [COG4886] Leucine-rich repeat (LRR) protein [COG5386] Cell surface protein |
TIGRFAM ID | [TIGR01167] LPXTG-motif cell wall anchor domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.753967 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTTAGTGG CTGCGACATT ATCGTTGCCG TTTGCGGTTT ATGCTACACC TATCTTAGCT GCTACTGCTG CTACAGAGAA TATGGCTGTA CAAAGTCCAA AAAAGCATGT TTTTGATGCG GTAATAAAGG CTTATAAAGA TAACTCAGAT GAAGAGTCAT ATGCAACTGT ATATATAAAA GATCCAAAGC TGACGATTGA AAATGGGAAA AGAATAATTA CAGCAACATT AAAAGATAGT GATTTCTTTG ACTATCTGAA AGTCGAAGAT AGTAAAGAGC CAGGTGTCTT TCATGATGTA AAGGTGCTTT CAGAAGATAA AAGAAAACAT GGAACGAAAG TTATACAATT TGAAGTAGGT GAGTTAGGAA AAAGATATAA TATGCAAATG CATATTTTAA TTCCCACTTT AGGGTACGAT AAGGAATTCA AAATTCAGTT TGAAGTGAAT ATGCGTACAT TTGTAGAAAG CGATATAGAA GAGGATGAAG AAGAACAAAT TGAAGATACA CAAAATATCA TACGTGATAA ACGATTACAA CAAGCAATTA ATAAAAATGT ATTAAATAGA AAAGATGTAA ATGAACCTAT ATTTGAAGAA GATTTAAAAG AAATTAAAGA GTTAAATATA TATGCAGGTC AAGGAATTGA GAGTCTAAAA GGTTTAGAGT ATATGGAAAA TCTAGAAAGA ATAACAATAC AAGGATCTGA TGTACGAAAT ATAGCTCCTA TTTCACAACT AAAACGTTTA AAAGTAGTTG ATCTATCTTT TAATAAAATA GAAAATGTTG AGCCGCTTGT AAACTTAGAA AAACTGGATA TACTAGAGCT ACAAAATAAT AGAATTGCTG ACGTAACGCC ACTAAGTCAA CTTAAAAAGG TTAGGACAAT TAATTTATCA GGTAATAAAA TTAGTGATAT AAAGCCTTTA TATAATGTTT CTTCTTTAAG AAAGCTATAT GTAAGCAATA ATAAAATTAC TGATTTTACA GGCATTGAGC AATTGAATAA ATTAGGGACA TTAGGGGTAG GAAGTAACGG GCTTGTAAAT ATTGAACCGA TTAGTCAGAT GAGTGGCATT GTTGAACTTA ATCTTGAAAA AAATGATATT AAAGATATTA CATCATTATC TAAACTAACT GGCTTACAAT CACTTAACTT GGAAGAAAAC TATGTTTCGG ATGTATCATC ACTTAGTAAT TTGATTAATT TATATGAATT AAAACTTGCG ACAAATGAGA TTCGTGATAT AAGACCTATT CAAGAATTAG GAAAACGAAT TAAGATTGAT GCTCAAAGGC AAAAGGTCTT TTTAGATGAA GCCTATATGA ATGAAGAAGT GAAAATTCCT GTATATGATG TAAATGGGAC AGCACTTCAA AATATTGAGT GGAAGAGTGA AGGCGGAAGT ATTACGAACG GAGTAATAAA GTGGAATAGC CTTGGGGAAA AAATGTATGA ATTTAAGATG GATGCTGGCG AAAGTAAGAT AAGGTTCCAA GGGAGGGTAA TACAAAATAT TGTTGAAAAA CGAGAAGAAA GTTCGAACGT AATTCAAGAT ATGAAACTAA GACAATACAT GAATAAACAT AATTTTGAAC GGAAAAATGT AAATACCCCT ATAACGAAAG AAGATTTATT AACAGTTAAG GCTTTGAAAA TTACGGATGG GAAAAAAGAG GGGATAACAG ATTTTTCTGG ATTAGAATTC ATGACAAATG TGGAAGAATT GACATTACAA AATGTTAATA TGAAAAATGC GGAATTTATC TCAAGTTTGA GAAATTTGAA GTCAGTAGAT TTATCCTATA ATCAAATTGA AGATATTAAA CCGCTTCATT CATTAGAGGA TCTTGAAAAA TTAAATGTTA GCGATAACGG TATAAAAAAT GTTCCAGAAC TATTTAAGAT GCAGAAATTA AAAACTCTAG ACCTATCAAA TAATAAACTT GATAATGCTG CTTTGGATGG GATTCATCAA TTGGAAAATC TAGATGCATT GTTAGTAAAT AATAACGAAA TCAATAATTT AGATGAGATT AGCAAAGTTA GCAAATTGAA TAAGCTTGAA ATGATGAGCA ATAAAGTACG AGATATTTCT CCATTAGCTA GCTTAAAAAA CTTACAGTGG TTAAATTTAT CTGATAATAA GATTCAAGAT ATCTCTACTT TATCTTCTAT GCTTGATTTA CTTAGCTTGA AATTAGCTGG AAATGAGATT CGTGATGTAC GACCGGTCAT TCAATTGGCT CAATGGATAA CAGTCGATAT TAAAAACCAA AAAATTGTTT TAGAAGATGG ACAAATGAAT CAAGAAATCC AAATTCCTAT CTACGATTTA GAGGGAGAAA TCTTTGAAGA TATTGAACTG AAGAGTACAG ATGGTATAGT TACTGATAGA GGAACAGTCG TATGGAAAAC TCCAGGAGCA AAAATTTATT CATTCTCCTT AAATGGGAAT TATCACGGTC TGTCTCTATT ATTCAGTGGT ACAGTTACTC AAAATATAGT AGCGAAAGAA GAACCAAAAG AACCAGTGGA AGAAGTTGAA GGTTCGAAAG AAGAACCAAT AAAAGAAGCT GAAGGATCAA AAGAAGAGCC AAAAGAGCCA GCAAAAGAAG TTGAAGGATC AAAAGAAGAG CCAAAAGAGC CAGCAAAAGA AGTTGAAGGA TCAAAAGAAG AAGTAAAAGA ACCAGCAAAA GAAGTTGAAG GCCCGAAAGA AGAAGTGAAA GAACCAACAA AAGAAGTTGA AGGTCCGAAA GAAGAAGTGA AAGAACCAAC AAAAGAAGTT GAAGGTCCGA AAGAAGAAGT GAAAGAACCA ATGAAAGAAG TTGAAGGATC GAAAGAAGAA GTGAAAGGAC CAACGAAAGA AGCTGAAGGA TCGAAAGAAG AAGTGAAAGA GCCAACAACA GAAGTTGAAG GATCGAAAGA AGTAAAAGAA CCAGGAAAAG AAGTTGAAGG TTCAAAAGAT GCAATAAATC AATCAGCAGT AGCTCAAGAA ACAAACGTGA ACAATCAAGT TGGGAAAGAA AAAGTAGTAG AGAATCAAAA CATGAAAGAA AATAAACCAG CTGTTACTAA GCAAGAAGAA AGTAAGAAAT CACTAGGAGC AACAGGTGGA CAAGAGAATA CATCAACATT ACTTTCAGGC TTAGCACTAG TTCTTTCAGC ATTGAGTATG TTTGTATTTA GAAAGAGATT ATTTAAGAAA TAA
|
Protein sequence | MLVAATLSLP FAVYATPILA ATAATENMAV QSPKKHVFDA VIKAYKDNSD EESYATVYIK DPKLTIENGK RIITATLKDS DFFDYLKVED SKEPGVFHDV KVLSEDKRKH GTKVIQFEVG ELGKRYNMQM HILIPTLGYD KEFKIQFEVN MRTFVESDIE EDEEEQIEDT QNIIRDKRLQ QAINKNVLNR KDVNEPIFEE DLKEIKELNI YAGQGIESLK GLEYMENLER ITIQGSDVRN IAPISQLKRL KVVDLSFNKI ENVEPLVNLE KLDILELQNN RIADVTPLSQ LKKVRTINLS GNKISDIKPL YNVSSLRKLY VSNNKITDFT GIEQLNKLGT LGVGSNGLVN IEPISQMSGI VELNLEKNDI KDITSLSKLT GLQSLNLEEN YVSDVSSLSN LINLYELKLA TNEIRDIRPI QELGKRIKID AQRQKVFLDE AYMNEEVKIP VYDVNGTALQ NIEWKSEGGS ITNGVIKWNS LGEKMYEFKM DAGESKIRFQ GRVIQNIVEK REESSNVIQD MKLRQYMNKH NFERKNVNTP ITKEDLLTVK ALKITDGKKE GITDFSGLEF MTNVEELTLQ NVNMKNAEFI SSLRNLKSVD LSYNQIEDIK PLHSLEDLEK LNVSDNGIKN VPELFKMQKL KTLDLSNNKL DNAALDGIHQ LENLDALLVN NNEINNLDEI SKVSKLNKLE MMSNKVRDIS PLASLKNLQW LNLSDNKIQD ISTLSSMLDL LSLKLAGNEI RDVRPVIQLA QWITVDIKNQ KIVLEDGQMN QEIQIPIYDL EGEIFEDIEL KSTDGIVTDR GTVVWKTPGA KIYSFSLNGN YHGLSLLFSG TVTQNIVAKE EPKEPVEEVE GSKEEPIKEA EGSKEEPKEP AKEVEGSKEE PKEPAKEVEG SKEEVKEPAK EVEGPKEEVK EPTKEVEGPK EEVKEPTKEV EGPKEEVKEP MKEVEGSKEE VKGPTKEAEG SKEEVKEPTT EVEGSKEVKE PGKEVEGSKD AINQSAVAQE TNVNNQVGKE KVVENQNMKE NKPAVTKQEE SKKSLGATGG QENTSTLLSG LALVLSALSM FVFRKRLFKK
|
| |