Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BT9727_0466 |
Symbol | colA |
ID | 2856845 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Bacillus thuringiensis serovar konkukian str. 97-27 |
Kingdom | Bacteria |
Replicon accession | NC_005957 |
Strand | + |
Start bp | 546992 |
End bp | 549889 |
Gene Length | 2898 bp |
Protein Length | 965 aa |
Translation table | 11 |
GC content | 34% |
IMG OID | 637511887 |
Product | microbial collagenase |
Protein accession | YP_034814 |
Protein GI | 49476834 |
COG category | [R] General function prediction only |
COG ID | [COG3291] FOG: PKD repeat |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACAAGA AATCAAAGAT TAATAAAGTT ATGCTTAGCA TTAGTACAAT GGCTCTATCA TTAGGGGCGC TTCAAGCTCC TGTATCAGCG GAAGAAAAAG TACCGTATAA TGTGTTGAAA ACGAAACCAG TTGGAATTGA AAAACCAGTA GATGAGATTG GACACATTTC TAAAGCGGAG GAAACATTAT CGTTTCAAGA ACGTTTAAAA GTAGGAGACT TTTCACAGCG ACCAGCATCT ATTACGAATA AAGCGACAGT AAAGCAAGTT AAAGAAAGCT ATTCAATGGC TGATTTAAAC AAAATGAATG ATCAAGAATT AGTTGAAACG TTAGGTAGTA TTAAGTGGCA CCAAATTACA GACTTATTCC AGTTTAATGA AGATGCAAAA GCTTTTTATA AAGATAAAAG GAAAATGCAA GTCGTTATAG ATGAATTAGC TCATCGGGGT AGTACATTTA CGAAAGATGA TTCAAAAGGA ATTCAAACGT TTACGGAAGT GTTACGTTCA GCTTTTTATC TGGCATTTTA TAATAACGAG TTAAGTGAAT TAAATGAAAG AAGCTTCCAG GACAAATGTT TACCTGCTTT AAAAGCAATC GCAAAAAATC CAAACTTTAA GCTTGGTACA GCTGAACAAG ATACAGTCGT ATCTGCATAC GGTAAATTAA TTAGTAATGC GTCAAGCGAT GTTGAAACAG TTCAATACGC ATCGAATATT TTAAAGCAAT ACAATGATAA TTTTACTACG TATGTAAATG ATCGAATGAA GGGACAAGCA ATATACGATA TTATGCAAGG GATTGACTAT GATATACAGT CTTACTTAAT TGAAGCTCGT AAAGAAGCAA ATGAAACGAT GTGGTACGGG AAAGTAGATG GGTTTATTAA TGAAATAAAT CGTATTGCTC TTTTAAATGA AGTAACACAA GAAAATAAGT GGCTCGTTAA TAATGGAATT TACTTTGCAA GTCGTTTAGG GAAGTTTCAC AGTAATCCAA ATAAAGGTTT AGAAGTTGTT ACACAAGCAA TGCATATGTA TCCGCGCTTA AGTGAACCGT ACTTTGTCGC AGTAGAACAA ATTACAACAA ATTATAACGG AAAAGATTAT AGCGGGAATA CAGTAGATTT AGAGAAAATA CGTAAAGAAG GAAAAGAGCA GTACTTACCA AAAACGTATA CATTTGATGA TGGATCTATC GTATTTAAAA CAGGAGATAA AGTATCGGAA GAAAAAATTA AGAGACTATA CTGGGCTGCA AAGGAAGTAA AAGCACAGTA TCATCGTGTA ATTGGGAATG ACAAAGCGTT AGAGCCGGGC AATGCAGATG ATATATTAAC GATAGTAATT TATAACAGCC CAGAAGAGTA CCAGTTAAAT AGACAACTGT ATGGATACGA AACAAATAAC GGTGGAATTT ATATTGAAGA AACAGGAACA TTCTTTACAT ATGAACGCAC GCCAGAACAA AGTATTTATA GTTTAGAAGA ATTATTCCGC CATGAGTTTA CCCATTATCT TCAAGGGAGA TATGAAGTGC CAGGTTTATT CGGAAGAGGA GATATGTATC AAAATGAAAG GTTAACTTGG TTCCAAGAAG GAAATGCAGA GTTTTTCGCA GGGTCTACTC GAACAAATAA TGTAGTACCG AGAAAGAGCA TCATTAGTGG ATTATCATCT GATCCTGCAA GCCGTTATAC AGCAGAGCGC ACATTATTTG CTAAATATGG CTCTTGGGAT TTCTATAATT ACTCGTTCGC ATTGCAATCT TACTTATATA CACATCAATT TGAAACATTT GATAAAATTC AAGATTTAAT TCGTGCAAAT GACGTGAAAA ATTATGATGC CTATCGTGAG AATTTAAGTA AAGACCTTAA ACTAAATGAA GAGTATCAAG AGTATATGCA GCAGCTAATC GATAATCAAG ATAAATATAA TGTACCGGAA GTAGCAGATG ATTATTTAGC TGAACATGCA CCAAAATCAT TAACAGCAGT AGAGAAAGAA ATTACTGAAA CGTTGCCGAT GAAAGATGCA AAAATGACAA AACATAGCTC CCAATTCTTT AATACATTTA CATTAGAAGG CACGTATACA GGTAGTGTAA CAAAAGGTGA GTCAGAAGAT TGGAACGCAA TGAGTAAGAA AGTAAACGAA GCTTTAGAAC AACTGGCGCA AAAAGAATGG AGTGGCTACA AAACTGTTAC AGCATATTTC GTCAATTACC GTATAAATAG TTCAAATCAA TTTGAATATG ATGTAGTCTT CCACGGTATC GCAAAAGATG ACGGAGAAAA TAAAGCTCCA ACAGTTAATA TAAATGGCCC TTATAATGGA CTTGTAAAAG AAGGTATTCA ATTTAAAAGT GATGGCTCAA AAGATGAAGA TGGAAAAATC GTTTCTTATT TATGGGACTT TGGAGATGGA AGCACAAGTG CAGAAGTAAA TCCGGTACAT GTATATGAAA GAGAAGGTTC ATATAAAGTA GCGTTAATAG TAAAAGATGA TAAAGGAAAA GAGAGCAAAA GCGAAACAAC GGTTACGGTT AAAGATGGAA GTTTAACAGA ATTAGAACCA AATAATCGCC CAGAGGAAGC AAATCGTATT GGACTAAACA ATACGATAAA AGGTAGCCTT ATCGGCGGAG ATCACACTGA TGTTTATACA TTTAATGTAG CAACAACGAA AAATATTGAT ATTTCCGTTT TAAATGAATA TGGAATCGGG ATGACATGGG TACTTCACCA TGAATCAGAT ATGCAAAATT ATGCTGCTTA CGGTCAAGCA AATGGAAATC ATATAGAGGC AAACTTTAAT GCAAAACCAG GTAAGTATTA CTTGTATGTA TATAAATATG ATAATGGTGA TGGAACATAC GAATTATCAG TAAAATAA
|
Protein sequence | MNKKSKINKV MLSISTMALS LGALQAPVSA EEKVPYNVLK TKPVGIEKPV DEIGHISKAE ETLSFQERLK VGDFSQRPAS ITNKATVKQV KESYSMADLN KMNDQELVET LGSIKWHQIT DLFQFNEDAK AFYKDKRKMQ VVIDELAHRG STFTKDDSKG IQTFTEVLRS AFYLAFYNNE LSELNERSFQ DKCLPALKAI AKNPNFKLGT AEQDTVVSAY GKLISNASSD VETVQYASNI LKQYNDNFTT YVNDRMKGQA IYDIMQGIDY DIQSYLIEAR KEANETMWYG KVDGFINEIN RIALLNEVTQ ENKWLVNNGI YFASRLGKFH SNPNKGLEVV TQAMHMYPRL SEPYFVAVEQ ITTNYNGKDY SGNTVDLEKI RKEGKEQYLP KTYTFDDGSI VFKTGDKVSE EKIKRLYWAA KEVKAQYHRV IGNDKALEPG NADDILTIVI YNSPEEYQLN RQLYGYETNN GGIYIEETGT FFTYERTPEQ SIYSLEELFR HEFTHYLQGR YEVPGLFGRG DMYQNERLTW FQEGNAEFFA GSTRTNNVVP RKSIISGLSS DPASRYTAER TLFAKYGSWD FYNYSFALQS YLYTHQFETF DKIQDLIRAN DVKNYDAYRE NLSKDLKLNE EYQEYMQQLI DNQDKYNVPE VADDYLAEHA PKSLTAVEKE ITETLPMKDA KMTKHSSQFF NTFTLEGTYT GSVTKGESED WNAMSKKVNE ALEQLAQKEW SGYKTVTAYF VNYRINSSNQ FEYDVVFHGI AKDDGENKAP TVNINGPYNG LVKEGIQFKS DGSKDEDGKI VSYLWDFGDG STSAEVNPVH VYEREGSYKV ALIVKDDKGK ESKSETTVTV KDGSLTELEP NNRPEEANRI GLNNTIKGSL IGGDHTDVYT FNVATTKNID ISVLNEYGIG MTWVLHHESD MQNYAAYGQA NGNHIEANFN AKPGKYYLYV YKYDNGDGTY ELSVK
|
| |