Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BT9727_4281 |
Symbol | |
ID | 2853689 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Bacillus thuringiensis serovar konkukian str. 97-27 |
Kingdom | Bacteria |
Replicon accession | NC_005957 |
Strand | - |
Start bp | 4343127 |
End bp | 4345988 |
Gene Length | 2862 bp |
Protein Length | 953 aa |
Translation table | 11 |
GC content | 35% |
IMG OID | 637515696 |
Product | cell surface protein |
Protein accession | YP_038596 |
Protein GI | 49481625 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG5386] Cell surface protein |
TIGRFAM ID | [TIGR03063] sortase B cell surface sorting signal [TIGR03656] heme uptake protein IsdC |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.533345 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGAACAGGT ATCTTAAGAT TGTTGTTGCT ATGTTTTTAA TGATATTTAC ATTCGTATCA ACACTACAAC CACTTGCAGT TCAAGCAGCT ACTCAATTAG CTGACGGTGA ATACTCAATC GGTTTTAAAG TTCTTAAAGA CGCATCGGAT GAAGTATCCA TGATGAATGA ATACTCTGTA AGTCCAGGAA CTTTAAAAGT GAAGGATGGG AAAAAGAAAG TGTCCTTTAC ATTAAAAAAT AGTTCATGGA TTACGAAATT TGAAACAGAC AAAGCAGGTC AACTTGTTGA GACAAATGTA ATTAGTGAAG ATAAAGAAAA AGATACAAGA GTAGTAGAAT TCGATGTGGA AGATGTAGAG AAGATATTAA AAGCGAAAGT AAAAGTAGAT ATTGATTTTC TGAACTATCA TCATGAATAT GATGTTCGTA TTGCATTTGA TCAAAATAGC ATTACACCAA TTCATGTAGA AAAACCAGAT GAAAAAGAGG ACCCAGCTAA TAAGCCAGAT CCAAATGAAA CTACGGATCC AGGTCAGAAG CCCGACCAAA AGCCTGACCC AGATCAACAA CCAAATTCTA ACACAATTGA AGATGGTGCG TATAGCATTC CTTTCAAAGT GTTAAAAGAT AAAACAGATG AAGAATCTAA AATGAATAGT TACATGGAAA ATCCAGGAGT ATTGAAAGTA GAAAATGGTA AGAAAAAAGC GGTTGTAACG TTAAAAAGTA GCTCATTAAT TAAAAATTTC CAAACGGAAA AAGATGGTGC ATTTGTTGAT GCAAAAGTAG TGAGTGAAGA TAAAGAAAAA GATACAAGAG TAGTAGAGTT TGAAATAGCT GATTTATCGA AAAAACTTAA TACAAAAGTA TTTATTGAGA TGGTATCAAG AAATTATAAA CAAACGCATG ACGTACAACT TGTATTTGAA CAAGAAAAAT TGGAACCTAT TAAAAGTGAA GACAAACAAC CAGACGGAGA TAAACAACCA GACGGAGGCA AACAACCAGA TGGAGACAAA CAACCAGACG GAGACAAACA ACCAGACGGA GACAAGCAAC CAGACGGAGA CAAACAACCA GACGGAGACA AACAACCAGA TGGAGACAAA CAACCAGACG GAGACAAGCA ACCAGACGTA GATACCATTA AAGATGGTGA ATACAGTATT GGTTTTAAAG TATTGAAAGA TAAAACAGAA GAAATTTCAA TGATGAATAC GTACACGAAG AGTCCAGGTG TACTAAAAGT GAAAGATGGA AAGAAATATG TATCCTTCAC ATTAACGAAT AGCTCATGGA TTACAAAGTT CGGATTTGAA AAGAATAATT CATTTGTTGA TGCAAGTGTA TTAAGTGAAG ATAAGAAAGC TGATACACGT GTAGTAGAAG TGGAAGTAGC TAATTTATCT AAGAAACTAA ATGCAAAAGT GAAAGTAGAT ATTGATTCAA TGAATTATCA CCATTTCTAT GATATTCAAT TTGCATTTGA TAATGATAGT ATTCAACCGT TAGACAACCA AGGCGAAAAT GACAACCAAG GTGGAAACGA CAACCAAGGT GGAAACAACA ATCAAGGCGA AAATGACAAC CAAGGTGGAA ACGACAACCA AGGTGGAAAT AACGACCAAG ATGGAAACAA TAACCAAGGC GGAAACGACA GCCAAGACGA TAACACAGCG ATTGATCCAA ACGCTCTTAA AGACGGTGAA TACAGTATCG GTTTTAAAGT GTTAAAAGAT AAAACAGAAG AAATTTCAAT GATGAACACA TATACGAAGA ATCCAGGTGT ATTAAAAGTG AAAGATGGAA AGAAATATGT ATCCTTCACA TTAACAAATA GCTCATGGAT TACGAAGTTT GAGTTTGAAA AGAATGGTGC GTTCGTCGAT GCGCAAGTAT TAGGTACAAA TAAAGAGAAA GATACAAGAG TAGTAGAAGT GGAAGTAGAG GATTTATCGA AAAAGTTAAA TGCAAAAGTG AAGGTAGATA TCGATGCAAT GAATTATCAT CATTTCTATG ATATTCAATT TGCATTCGAT AAAGGAAGTA TTAAAGCTTT AGGTAACCAA GGTGGAGATA CTAACCAAGA TGGTAATGGT AATCAGGTCG GAAGCGACAA CCAAGGCGGA AGCGACAACC AAGGCGGAAA CGACAACCAA GGTGGAAGTA ACAACCAAGA TGGAACGAAT AATCTAAATG AAAACCCAAC AGTTGATCCG AAAAATTTAA AAGATGGTCA GTATGATATT GCCTTTAAAG TGTTAAAAGA TAAGACAGAA GAAATTTCAA TGATGAATCA ATATGTTGTA AGTCCAGCAA GATTAACAGT GAAAGATGGC AAGAAGTATG TTGCAATGAC ACTGAAAAAT AGTGCGTGGA TTACGAAATT CCAAACAGAA AATAATAGCC TTTTTGCTGA TGCGAAAGTA GTAAGCGAAG ATAAAAAGGC AAATACGAGA GTAGTGCAAT TTGAAGTAAG TGATTTATTT GCAAAATTAA ATGCAAAAGT AAAAGTTGAT ATTGATGAAA TGAACTACCA TCATTTCTAC GATGTCCAAA TTCAATTTGA TACGACGAAG ATTGGCGCTG TAGGAACGGT AAAAGAAGAG CCAAAGAATG ATCCAAAGAA TGAACCGAAA AACCCAGTAA CTACACCAAA AGTAGATAAT GTAAAAACAG TAGGAACTCC TGATTTTAAC CGAAATGCAG ATGGTAAAAA GAAAAACGAA GCTACAAATA ATGATTCGAA GAAAGAGAAA AACTCAAAAA CTGCAGATAC AGCACAACTT GGTTTATACA TGGTGTTACT GCTAGGTTCA CTTGCTTTAC TAGTTCGTAA ATATAGAGCA GGTAGATTGT AA
|
Protein sequence | MNRYLKIVVA MFLMIFTFVS TLQPLAVQAA TQLADGEYSI GFKVLKDASD EVSMMNEYSV SPGTLKVKDG KKKVSFTLKN SSWITKFETD KAGQLVETNV ISEDKEKDTR VVEFDVEDVE KILKAKVKVD IDFLNYHHEY DVRIAFDQNS ITPIHVEKPD EKEDPANKPD PNETTDPGQK PDQKPDPDQQ PNSNTIEDGA YSIPFKVLKD KTDEESKMNS YMENPGVLKV ENGKKKAVVT LKSSSLIKNF QTEKDGAFVD AKVVSEDKEK DTRVVEFEIA DLSKKLNTKV FIEMVSRNYK QTHDVQLVFE QEKLEPIKSE DKQPDGDKQP DGGKQPDGDK QPDGDKQPDG DKQPDGDKQP DGDKQPDGDK QPDGDKQPDV DTIKDGEYSI GFKVLKDKTE EISMMNTYTK SPGVLKVKDG KKYVSFTLTN SSWITKFGFE KNNSFVDASV LSEDKKADTR VVEVEVANLS KKLNAKVKVD IDSMNYHHFY DIQFAFDNDS IQPLDNQGEN DNQGGNDNQG GNNNQGENDN QGGNDNQGGN NDQDGNNNQG GNDSQDDNTA IDPNALKDGE YSIGFKVLKD KTEEISMMNT YTKNPGVLKV KDGKKYVSFT LTNSSWITKF EFEKNGAFVD AQVLGTNKEK DTRVVEVEVE DLSKKLNAKV KVDIDAMNYH HFYDIQFAFD KGSIKALGNQ GGDTNQDGNG NQVGSDNQGG SDNQGGNDNQ GGSNNQDGTN NLNENPTVDP KNLKDGQYDI AFKVLKDKTE EISMMNQYVV SPARLTVKDG KKYVAMTLKN SAWITKFQTE NNSLFADAKV VSEDKKANTR VVQFEVSDLF AKLNAKVKVD IDEMNYHHFY DVQIQFDTTK IGAVGTVKEE PKNDPKNEPK NPVTTPKVDN VKTVGTPDFN RNADGKKKNE ATNNDSKKEK NSKTADTAQL GLYMVLLLGS LALLVRKYRA GRL
|
| |