Gene BAS0520 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBAS0520 
Symbol 
ID2849896 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus anthracis str. Sterne 
KingdomBacteria 
Replicon accessionNC_005945 
Strand
Start bp552274 
End bp555486 
Gene Length3213 bp 
Protein Length1070 aa 
Translation table11 
GC content32% 
IMG OID637503762 
Productinternalin 
Protein accessionYP_026798 
Protein GI49183546 
COG category[M] Cell wall/membrane/envelope biogenesis
[S] Function unknown 
COG ID[COG4886] Leucine-rich repeat (LRR) protein
[COG5386] Cell surface protein 
TIGRFAM ID[TIGR01167] LPXTG-motif cell wall anchor domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.753967 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTAGTGG CTGCGACATT ATCGTTGCCG TTTGCGGTTT ATGCTACACC TATCTTAGCT 
GCTACTGCTG CTACAGAGAA TATGGCTGTA CAAAGTCCAA AAAAGCATGT TTTTGATGCG
GTAATAAAGG CTTATAAAGA TAACTCAGAT GAAGAGTCAT ATGCAACTGT ATATATAAAA
GATCCAAAGC TGACGATTGA AAATGGGAAA AGAATAATTA CAGCAACATT AAAAGATAGT
GATTTCTTTG ACTATCTGAA AGTCGAAGAT AGTAAAGAGC CAGGTGTCTT TCATGATGTA
AAGGTGCTTT CAGAAGATAA AAGAAAACAT GGAACGAAAG TTATACAATT TGAAGTAGGT
GAGTTAGGAA AAAGATATAA TATGCAAATG CATATTTTAA TTCCCACTTT AGGGTACGAT
AAGGAATTCA AAATTCAGTT TGAAGTGAAT ATGCGTACAT TTGTAGAAAG CGATATAGAA
GAGGATGAAG AAGAACAAAT TGAAGATACA CAAAATATCA TACGTGATAA ACGATTACAA
CAAGCAATTA ATAAAAATGT ATTAAATAGA AAAGATGTAA ATGAACCTAT ATTTGAAGAA
GATTTAAAAG AAATTAAAGA GTTAAATATA TATGCAGGTC AAGGAATTGA GAGTCTAAAA
GGTTTAGAGT ATATGGAAAA TCTAGAAAGA ATAACAATAC AAGGATCTGA TGTACGAAAT
ATAGCTCCTA TTTCACAACT AAAACGTTTA AAAGTAGTTG ATCTATCTTT TAATAAAATA
GAAAATGTTG AGCCGCTTGT AAACTTAGAA AAACTGGATA TACTAGAGCT ACAAAATAAT
AGAATTGCTG ACGTAACGCC ACTAAGTCAA CTTAAAAAGG TTAGGACAAT TAATTTATCA
GGTAATAAAA TTAGTGATAT AAAGCCTTTA TATAATGTTT CTTCTTTAAG AAAGCTATAT
GTAAGCAATA ATAAAATTAC TGATTTTACA GGCATTGAGC AATTGAATAA ATTAGGGACA
TTAGGGGTAG GAAGTAACGG GCTTGTAAAT ATTGAACCGA TTAGTCAGAT GAGTGGCATT
GTTGAACTTA ATCTTGAAAA AAATGATATT AAAGATATTA CATCATTATC TAAACTAACT
GGCTTACAAT CACTTAACTT GGAAGAAAAC TATGTTTCGG ATGTATCATC ACTTAGTAAT
TTGATTAATT TATATGAATT AAAACTTGCG ACAAATGAGA TTCGTGATAT AAGACCTATT
CAAGAATTAG GAAAACGAAT TAAGATTGAT GCTCAAAGGC AAAAGGTCTT TTTAGATGAA
GCCTATATGA ATGAAGAAGT GAAAATTCCT GTATATGATG TAAATGGGAC AGCACTTCAA
AATATTGAGT GGAAGAGTGA AGGCGGAAGT ATTACGAACG GAGTAATAAA GTGGAATAGC
CTTGGGGAAA AAATGTATGA ATTTAAGATG GATGCTGGCG AAAGTAAGAT AAGGTTCCAA
GGGAGGGTAA TACAAAATAT TGTTGAAAAA CGAGAAGAAA GTTCGAACGT AATTCAAGAT
ATGAAACTAA GACAATACAT GAATAAACAT AATTTTGAAC GGAAAAATGT AAATACCCCT
ATAACGAAAG AAGATTTATT AACAGTTAAG GCTTTGAAAA TTACGGATGG GAAAAAAGAG
GGGATAACAG ATTTTTCTGG ATTAGAATTC ATGACAAATG TGGAAGAATT GACATTACAA
AATGTTAATA TGAAAAATGC GGAATTTATC TCAAGTTTGA GAAATTTGAA GTCAGTAGAT
TTATCCTATA ATCAAATTGA AGATATTAAA CCGCTTCATT CATTAGAGGA TCTTGAAAAA
TTAAATGTTA GCGATAACGG TATAAAAAAT GTTCCAGAAC TATTTAAGAT GCAGAAATTA
AAAACTCTAG ACCTATCAAA TAATAAACTT GATAATGCTG CTTTGGATGG GATTCATCAA
TTGGAAAATC TAGATGCATT GTTAGTAAAT AATAACGAAA TCAATAATTT AGATGAGATT
AGCAAAGTTA GCAAATTGAA TAAGCTTGAA ATGATGAGCA ATAAAGTACG AGATATTTCT
CCATTAGCTA GCTTAAAAAA CTTACAGTGG TTAAATTTAT CTGATAATAA GATTCAAGAT
ATCTCTACTT TATCTTCTAT GCTTGATTTA CTTAGCTTGA AATTAGCTGG AAATGAGATT
CGTGATGTAC GACCGGTCAT TCAATTGGCT CAATGGATAA CAGTCGATAT TAAAAACCAA
AAAATTGTTT TAGAAGATGG ACAAATGAAT CAAGAAATCC AAATTCCTAT CTACGATTTA
GAGGGAGAAA TCTTTGAAGA TATTGAACTG AAGAGTACAG ATGGTATAGT TACTGATAGA
GGAACAGTCG TATGGAAAAC TCCAGGAGCA AAAATTTATT CATTCTCCTT AAATGGGAAT
TATCACGGTC TGTCTCTATT ATTCAGTGGT ACAGTTACTC AAAATATAGT AGCGAAAGAA
GAACCAAAAG AACCAGTGGA AGAAGTTGAA GGTTCGAAAG AAGAACCAAT AAAAGAAGCT
GAAGGATCAA AAGAAGAGCC AAAAGAGCCA GCAAAAGAAG TTGAAGGATC AAAAGAAGAG
CCAAAAGAGC CAGCAAAAGA AGTTGAAGGA TCAAAAGAAG AAGTAAAAGA ACCAGCAAAA
GAAGTTGAAG GCCCGAAAGA AGAAGTGAAA GAACCAACAA AAGAAGTTGA AGGTCCGAAA
GAAGAAGTGA AAGAACCAAC AAAAGAAGTT GAAGGTCCGA AAGAAGAAGT GAAAGAACCA
ATGAAAGAAG TTGAAGGATC GAAAGAAGAA GTGAAAGGAC CAACGAAAGA AGCTGAAGGA
TCGAAAGAAG AAGTGAAAGA GCCAACAACA GAAGTTGAAG GATCGAAAGA AGTAAAAGAA
CCAGGAAAAG AAGTTGAAGG TTCAAAAGAT GCAATAAATC AATCAGCAGT AGCTCAAGAA
ACAAACGTGA ACAATCAAGT TGGGAAAGAA AAAGTAGTAG AGAATCAAAA CATGAAAGAA
AATAAACCAG CTGTTACTAA GCAAGAAGAA AGTAAGAAAT CACTAGGAGC AACAGGTGGA
CAAGAGAATA CATCAACATT ACTTTCAGGC TTAGCACTAG TTCTTTCAGC ATTGAGTATG
TTTGTATTTA GAAAGAGATT ATTTAAGAAA TAA
 
Protein sequence
MLVAATLSLP FAVYATPILA ATAATENMAV QSPKKHVFDA VIKAYKDNSD EESYATVYIK 
DPKLTIENGK RIITATLKDS DFFDYLKVED SKEPGVFHDV KVLSEDKRKH GTKVIQFEVG
ELGKRYNMQM HILIPTLGYD KEFKIQFEVN MRTFVESDIE EDEEEQIEDT QNIIRDKRLQ
QAINKNVLNR KDVNEPIFEE DLKEIKELNI YAGQGIESLK GLEYMENLER ITIQGSDVRN
IAPISQLKRL KVVDLSFNKI ENVEPLVNLE KLDILELQNN RIADVTPLSQ LKKVRTINLS
GNKISDIKPL YNVSSLRKLY VSNNKITDFT GIEQLNKLGT LGVGSNGLVN IEPISQMSGI
VELNLEKNDI KDITSLSKLT GLQSLNLEEN YVSDVSSLSN LINLYELKLA TNEIRDIRPI
QELGKRIKID AQRQKVFLDE AYMNEEVKIP VYDVNGTALQ NIEWKSEGGS ITNGVIKWNS
LGEKMYEFKM DAGESKIRFQ GRVIQNIVEK REESSNVIQD MKLRQYMNKH NFERKNVNTP
ITKEDLLTVK ALKITDGKKE GITDFSGLEF MTNVEELTLQ NVNMKNAEFI SSLRNLKSVD
LSYNQIEDIK PLHSLEDLEK LNVSDNGIKN VPELFKMQKL KTLDLSNNKL DNAALDGIHQ
LENLDALLVN NNEINNLDEI SKVSKLNKLE MMSNKVRDIS PLASLKNLQW LNLSDNKIQD
ISTLSSMLDL LSLKLAGNEI RDVRPVIQLA QWITVDIKNQ KIVLEDGQMN QEIQIPIYDL
EGEIFEDIEL KSTDGIVTDR GTVVWKTPGA KIYSFSLNGN YHGLSLLFSG TVTQNIVAKE
EPKEPVEEVE GSKEEPIKEA EGSKEEPKEP AKEVEGSKEE PKEPAKEVEG SKEEVKEPAK
EVEGPKEEVK EPTKEVEGPK EEVKEPTKEV EGPKEEVKEP MKEVEGSKEE VKGPTKEAEG
SKEEVKEPTT EVEGSKEVKE PGKEVEGSKD AINQSAVAQE TNVNNQVGKE KVVENQNMKE
NKPAVTKQEE SKKSLGATGG QENTSTLLSG LALVLSALSM FVFRKRLFKK