Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BT9727_0463 |
Symbol | |
ID | 2858727 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Bacillus thuringiensis serovar konkukian str. 97-27 |
Kingdom | Bacteria |
Replicon accession | NC_005957 |
Strand | + |
Start bp | 541177 |
End bp | 544041 |
Gene Length | 2865 bp |
Protein Length | 954 aa |
Translation table | 11 |
GC content | 30% |
IMG OID | 637511884 |
Product | internalin protein |
Protein accession | YP_034811 |
Protein GI | 49481764 |
COG category | [M] Cell wall/membrane/envelope biogenesis [S] Function unknown |
COG ID | [COG4886] Leucine-rich repeat (LRR) protein [COG5386] Cell surface protein |
TIGRFAM ID | [TIGR01167] LPXTG-motif cell wall anchor domain |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.0349325 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATTACAT TTCGAATTAT ATATTTCGGA GGAGATGTAA GGTTGAAACA AAATAAAAGA AAATGTATAA ATGCAATGGT TATAGCGGCG GCGTTATCAC TGCCGTTTGC TGTTTATTCA ACACCTGCTT TAGCGGCAGT GGCAATTGAG GCGAATAAAA CTGGACATGT TTTAGAAGAT GGTACATATG ACGCTGTTAT TAAGGCGTAT AAAGATAAAA CGAATGAAGA ATCTATGGCA GCTGTTTATA TAAAAAATCC GAAATTAACA ATTGAGAATG GAAAGAAAAT TGTAACGGCA ACGTTAAGTG ATAGTGATTT CTTCCAATAT CTAAAAACAG AAGATATTCA TACTCCTGGT GTATTTCATG ATGTGAAAGT AATATCAGAA GATAAAAAGA AAAATGGAAC GAAAGTGATT CAGTTTGAAG TAGGAGAATT AGGAAAAAGG TATAATATGC GAATGCATAT TTATATTCCA ACAATGGCCT ATGATAATAA GTACCAAGTA CAATTTGAAG TAAATACATT GAATTTAGAT AAAGATGTTC CAGAAGAACA AAAGGAAAAT AAGGAGGATA AATTGGATCA ACAAGATGCG AATGTAATAA TAGATAAGCA ATTACAAAGG CATATTAATA AATATAACTT GAATAGAGAG AATTTAAATG CGCCAATAAC TAAGGAAGAT TTATTAAAAG TTAAATCTTT AATAGTCGTT GAAGCTAAAA GTAAAGGAAT AAAAGACGTA AGCGGTCTAG AATATATGAC GAACTTAGAA AACTTAACGT TGGAAGAAGT TAAGTTAGAA AATATAAAAT TTATCTCGAA TTTGAGGCAA TTAAAATCAG TAAGTATAAC CTATGCCGAA CTTGAAGATA TTGGACCTTT GGCTGAGTTA GAACATATTG AGAGTTTAAG CTTGAGAAAT AATAAAATTT CAAATTTAAG CCCACTAAGT CAAATGAAGA AGATTAAATT GCTAGATTTA AATAGTAATT ATATAAAAGA TATAAAGCCA TTATTTACAG TGAAATCTTT AAGGACTTTA ACTGTAGCAA ATAATCAAAT TAGTAATGCA GGTCTTGAAG GAGTTCACCA ATTAAAGAAT TTAAAGACAT TTGAAATAAG CAATAATGGA TTGAGTAATG TCGAACATAT TAATGGAATG AATAAATTAA TTGAATTAGG GCTTTCCAAA AATGAATTGG TAGATCTTAC ACCATTATCA AAATTATCAG GGTTACAAAA ACTAAATTTA GAAGAAAACT TTATTTCAGA TATAACGCCA CTTAGTCAAT TAACAAGTTT ATATGATTTA AAACTAGGTT CAAATGAAAT TCGTGATGTT AGACCGGTTC AAGAGCTAGG AAAAAGAATG TATATTGATA TTCAAAGACA AAAAATCTTT TTAGAAGATG TAGAAAAAGA TAAGGAAGTT AAAATACCTA TCTATAATTT ACAAGGAGAG CCAATTGATA CTATTCAATT GAATAGTGAA GATGGAATAG TTAATAATGG TTCTGTTAAA TGGGGTACTA CCGGTGAAAA AACATACGAA TTTATGTTAG ATATAAAGCC AGAAGAGAAT CGTATTAAGT TTAATGGAAC AGTAATTCAA AATGTTGTTG AAAGGTTAGA TGAAATAAAA GAGGATAATG AACAAAAGGA AAGTGTAATT CTCGATAAAA CTTTACAACA ACATATTAAT AAAGAGAATT TAGGTAGAGA GAATTTAAAC GCTCCTATCA CAAAAGAAGA TTTATTACAG ATTAAAAAAT TAGAGATACT TAAAGAAAAA GAAAAAGAAA AAGGAAAAGA GATAAAAGAT ATAACAGGTT TAGAGTACAT GACAAACTTA GAAAAACTCA CTTTAGAAGG AGTAGGTTTA AAGAATCTCG AATTTATCTC GAACTTAGAA AAGTTGAACG ATGTGAATGT ATCTCATAAT CAAATTGAGG ATATAACACC ACTATCTTCA TTAAAAAGTT TACAGTGGTT AAATCTTGCT GATAATCAAA TTAAAGATGT TTCAGTTCTT AGCCCAATGT TAGACCTACT CAGTTTAAAA TTAGCTGAAA ATGAGATTCG TGATGTAAGG CCGTTAATAC AATTAGGCCA ATGGTTTTCA ATTGATGCCG GAAGACAGAA AATCTTTTTA GATGAAGCAA AAGTAAATGA AGAAATTCAA GTTCCTGTAT ATGATTTAGA AGGAGAAATT ATTGAGAATA TTAAACTGAC AAGTGAGGAT GGAACATTTA ATAACGGAGT AATAAAATGG AGTACTCCAG GTGAAAAGGT CTATAAATTT GATTTAGATT CGGATGAAAT TAGTATAAGT TTTAACGGAA CAGTAATCCA GAATATAGTT GAAAAAGAAG AAGAAAAAGA ACCAACAAAA GAAGTTGAAG AATCAAAAGA AGAAGAAAAA GAACCAACAA AAGAAGTTGA AGAATCAAAA GAGGAAGAAA AAGAACCAAC AAAAGAAGTT GAAGAATCAA AAGAGGAAGT AAAAGAACCA ACAAAAGAAG TTGAAGAATC AAAAGAGGAA GTAAAAGAAC CAACAAAAGA AGTTGAAGAA TCAAAAGAAG AAGTAGCACA AGAAATCGAA AAATCGAAAG AAGAAATCAA TCAATCGGCA CCAGTTCAAG AACAAAACGT GAATAATCAA GTTGTGAAAG AAAAAGTAGT AGAGAATCAA AGCATGAAAG AAAATAAACC AGTTGTTAAT AAAGAAGAAG AAAGTAAGAA ATCGCTAGGA GCAACAGGTG GACAAGCGAA TACATCAACG TTACTTTCGG GTGTAGCATT AGTTCTTTCA GCACTGAGTA TGTTTGTATT TAGAAAGAGA TTATTTAAGA AATAA
|
Protein sequence | MITFRIIYFG GDVRLKQNKR KCINAMVIAA ALSLPFAVYS TPALAAVAIE ANKTGHVLED GTYDAVIKAY KDKTNEESMA AVYIKNPKLT IENGKKIVTA TLSDSDFFQY LKTEDIHTPG VFHDVKVISE DKKKNGTKVI QFEVGELGKR YNMRMHIYIP TMAYDNKYQV QFEVNTLNLD KDVPEEQKEN KEDKLDQQDA NVIIDKQLQR HINKYNLNRE NLNAPITKED LLKVKSLIVV EAKSKGIKDV SGLEYMTNLE NLTLEEVKLE NIKFISNLRQ LKSVSITYAE LEDIGPLAEL EHIESLSLRN NKISNLSPLS QMKKIKLLDL NSNYIKDIKP LFTVKSLRTL TVANNQISNA GLEGVHQLKN LKTFEISNNG LSNVEHINGM NKLIELGLSK NELVDLTPLS KLSGLQKLNL EENFISDITP LSQLTSLYDL KLGSNEIRDV RPVQELGKRM YIDIQRQKIF LEDVEKDKEV KIPIYNLQGE PIDTIQLNSE DGIVNNGSVK WGTTGEKTYE FMLDIKPEEN RIKFNGTVIQ NVVERLDEIK EDNEQKESVI LDKTLQQHIN KENLGRENLN APITKEDLLQ IKKLEILKEK EKEKGKEIKD ITGLEYMTNL EKLTLEGVGL KNLEFISNLE KLNDVNVSHN QIEDITPLSS LKSLQWLNLA DNQIKDVSVL SPMLDLLSLK LAENEIRDVR PLIQLGQWFS IDAGRQKIFL DEAKVNEEIQ VPVYDLEGEI IENIKLTSED GTFNNGVIKW STPGEKVYKF DLDSDEISIS FNGTVIQNIV EKEEEKEPTK EVEESKEEEK EPTKEVEESK EEEKEPTKEV EESKEEVKEP TKEVEESKEE VKEPTKEVEE SKEEVAQEIE KSKEEINQSA PVQEQNVNNQ VVKEKVVENQ SMKENKPVVN KEEESKKSLG ATGGQANTST LLSGVALVLS ALSMFVFRKR LFKK
|
| |