Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BT9727_3286 |
Symbol | colA |
ID | 2854181 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Bacillus thuringiensis serovar konkukian str. 97-27 |
Kingdom | Bacteria |
Replicon accession | NC_005957 |
Strand | - |
Start bp | 3355888 |
End bp | 3358803 |
Gene Length | 2916 bp |
Protein Length | 971 aa |
Translation table | 11 |
GC content | 35% |
IMG OID | 637514706 |
Product | collagenase |
Protein accession | YP_037608 |
Protein GI | 49478266 |
COG category | [R] General function prediction only |
COG ID | [COG3291] FOG: PKD repeat |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.238892 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAGGCT ATTCAAAAAA AGTGTTAGTA GGGGTAAGTT TTGCTAGTTT AATGTTAGGG GGTTTTCAAG GGGGCGCATT GGCAGAAGGT ACAAAGGGAG AGCAAGCTTC ATATCGGAAT GTGCTCAAAA TGGAACCAGT TGGTGTACAA TTACCAGTAC AAGAATTAGC TCATTCATCA AAAGTACTTG AAAATAAGTC TTTTGAGAAG AGGTTACAAT TTGCTGATTT GTCGCAAAGA CCGCCTGAAT TGAAAAAGGA GAGTAAGCAA TTAGCTACAG CAAAAACTTA TACAATTGCT GAGTTAAATC AATTAAGCAA TCAGCAGTTA GTAGATTTAC TTGTAACAAT TGATTGGGAG CAAATTACTG GGCTATTTCA GTTTAATAAG GATAGTCTTG CATTCTATCA AAATGATAGT AGAATGCAGG CAATTATTGA TAAATTGAAC CAGCAAGGAC AAGCGTATAC GAAAGATGAT TCAAAAGGGA TTGAGACTTT AGTAGAGGTA TTACGATCTG GTTTTTATTT AGGATTTTAT CATACAGAAT TAAGTAAACT AAATGAGCGA AGCTATCATG ATAAATGCTT ACCTGCATTA AAAACGATTG CGAATAACCC GAATTTCAAA CTAGGTACGT TAGAACAAAA TAGAGTTGTA TCATCATACG GAAAATTAAT AGGAAATGCT TCGAGTGATG TGGAAACGAT AACATCAGCT GCAAAGATTT TTAAACAATA TAATGATAAT TTTTCTACAT TGGTAGATAA TCTTTCAGCT GGAAATGCGA TTTACGATAT TATGCAAGGC GTTGACTACG ATATTCAATC GTATTTGTAC GATACGAGAA AAGCACCGAA AGATACAGTA TGGTATCAAA AAATTGATAG CTATATTAAT GAATTAAGTC GTTTTGCTTT AATTGGAACG GTGACAGAGA AGAATGGTTG GCTTATTAAT AATGGTATTT ATTATACAGG TAGACTTGGT ACGTTCCATA GTACAGGGAC GAAAGGGTTG CAAGTTGTAA CAGATGCCAT GAAAATGTAT CCGTATTTAG GGGAGCAATA TTTCGTAGCG GCTGAGCAAA TTGCGACGAA TTATGGCGGG AAAGATGCAA ATGGAAACGT TGTGAATTTA GATCAAATAC GAGAAGATGG TAAGAAGAAA TATTTACCGA AAACATATAC ATTTGACGAT GGGACAATTG TTTTAAAAGC TGGAGATAAA GTGACAGAAG AAAAAGTAAA ACGTCTATAT TGGGCGGCAA AAGAAGTGAA GGCTCAATTC CATCGTACGG TTGAAAGTGA CCAGCCGTTA GAAAAAGGGA ATGCTGATGA TGTATTAACG ATGGTTATTT ATAATAGCCC AGCTGAATAT CAATTTAACC GTCAATTGTA CGGGTATGAA ACGAATAACG GCGGTCTTTA TATAGAAGGA ACAGGTACGT TCTTTACTTA TGAGCGTACG CCAGAAGAAA GTATTTATAG TTTAGAGGAA TTGTTCCGGC ACGAGTTCAC ACATTACTTA CAAGGTAGAT ATGAAGTGCC AGGACTTTGG GGACAAGGTA AGATTTATGA GAATGAGAGA TTATCTTGGT TTGAAGAAGG CAATGCAGAG TTTTTTGCAG GTGCAACGAG AACAGATAAT GTTGTACCGA GAAAGAGCAT TATAGGAGGA ATATCTTCAA ATCCGGCAGA ACGTTATACG GCAGAGAGAA CGTTAAATGC AAAGTACGGA ACATGGGATT TTTATAATTA TTCCTTCGCT TTACAATCGT ACATGTACAA TAAGAGATAT GATATGTTTG ACAAAGTTCA TGATCTTATT AGAAAAAATG ATGTAACAGC ATATGATGCA TATCGCTCTG CATTAAGTAA AGATGCGAAT TTAAATAAAG AGTATCAAGA CTATATGCAA ATGTTAGTCG ACAATCGTGA TAAATATAAT GTTCCATTAG TATCAGATGA TTATTTAGCA ACTCACGCAC CGAAACCAGT CTCAGATATT GTGGCAGAAA TTACGGCAGA AGCGAAATTA AGTAATGTAT CAGTTAAGAA AAATAAATCA CAGTTCTTTC ATACATTTAC ACTGCAAGGA ACATATACAG GTACGACTGC AAAAGGAGAA TATGAAGACT GGAAATCAAT TACACAAAAC GTAAATGATA CGTTAAAACG TTTAAGTGCA AAAGAATGGA CAGGCTATAA AACAGTAACA GCTTATTTCG TAAATTACCG TGTGAATGCA TCAGGACAAT TTGAATATGA CGTTGTATTC CATGGTATTA ATACAGAAGA AGGCGCTGTG AATAAAGCAC CAGTTGCGGT TATAAATGGT CCCTATAGTG GGAATGTAAA TGAAGCAATT TCGTTTAAAA GCGATGGATC AAAAGATGAA GATGGAAAAA TTGTTGCTTA TAAATGGGAG TTTGGTGATG GTACTGTAAG CAATGAACAA AATCCAACTC ACGTGTATAC AAAAGAAGGA ACATATACAG CGAGATTAAC AGTAACAGAT GATAAAGGGT TAACGAATAC TGTTACAACG AATGTAACAG TTCAAAAGAA AGAAGATAAC AGTGTAGAAA AAGAACCAAA CAATTCATTC CAGACAGCAA ATACACTGCA ATTCAATCAA GTTTTACGCG CAAGTTTAGG AAATGGTGAT ACGAGTGATT TCTTTGAAAT AAATGTGGAA ACGGCGAAAA ATCTGCAAAT TAATGTAACG AAGGAAAATA ATATCGGAGT AAACTGGGTT CTTTATTCGG AAGCAGATTT AAATAACTAT ATTACGTATG CCCAGCAAGA GGGGAATAAG TTAGTAGGAA GTTACTACAC GTATCCAGGT AAGTATTATT TACATGTGTA TCAGTATGGT GGTGGATTTG GGAATTATAC GGTAGAAGTG AAGTAG
|
Protein sequence | MKGYSKKVLV GVSFASLMLG GFQGGALAEG TKGEQASYRN VLKMEPVGVQ LPVQELAHSS KVLENKSFEK RLQFADLSQR PPELKKESKQ LATAKTYTIA ELNQLSNQQL VDLLVTIDWE QITGLFQFNK DSLAFYQNDS RMQAIIDKLN QQGQAYTKDD SKGIETLVEV LRSGFYLGFY HTELSKLNER SYHDKCLPAL KTIANNPNFK LGTLEQNRVV SSYGKLIGNA SSDVETITSA AKIFKQYNDN FSTLVDNLSA GNAIYDIMQG VDYDIQSYLY DTRKAPKDTV WYQKIDSYIN ELSRFALIGT VTEKNGWLIN NGIYYTGRLG TFHSTGTKGL QVVTDAMKMY PYLGEQYFVA AEQIATNYGG KDANGNVVNL DQIREDGKKK YLPKTYTFDD GTIVLKAGDK VTEEKVKRLY WAAKEVKAQF HRTVESDQPL EKGNADDVLT MVIYNSPAEY QFNRQLYGYE TNNGGLYIEG TGTFFTYERT PEESIYSLEE LFRHEFTHYL QGRYEVPGLW GQGKIYENER LSWFEEGNAE FFAGATRTDN VVPRKSIIGG ISSNPAERYT AERTLNAKYG TWDFYNYSFA LQSYMYNKRY DMFDKVHDLI RKNDVTAYDA YRSALSKDAN LNKEYQDYMQ MLVDNRDKYN VPLVSDDYLA THAPKPVSDI VAEITAEAKL SNVSVKKNKS QFFHTFTLQG TYTGTTAKGE YEDWKSITQN VNDTLKRLSA KEWTGYKTVT AYFVNYRVNA SGQFEYDVVF HGINTEEGAV NKAPVAVING PYSGNVNEAI SFKSDGSKDE DGKIVAYKWE FGDGTVSNEQ NPTHVYTKEG TYTARLTVTD DKGLTNTVTT NVTVQKKEDN SVEKEPNNSF QTANTLQFNQ VLRASLGNGD TSDFFEINVE TAKNLQINVT KENNIGVNWV LYSEADLNNY ITYAQQEGNK LVGSYYTYPG KYYLHVYQYG GGFGNYTVEV K
|
| |