Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BcerKBAB4_3229 |
Symbol | |
ID | 5843441 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Bacillus weihenstephanensis KBAB4 |
Kingdom | Bacteria |
Replicon accession | NC_010184 |
Strand | - |
Start bp | 3276329 |
End bp | 3279244 |
Gene Length | 2916 bp |
Protein Length | 971 aa |
Translation table | 11 |
GC content | 36% |
IMG OID | 641378355 |
Product | collagenase |
Protein accession | YP_001646033 |
Protein GI | 163941149 |
COG category | [R] General function prediction only |
COG ID | [COG3291] FOG: PKD repeat |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAGGCT ATTCAAAAAA AGTTTTAGTA GGGGTAAGTT TTGCTAGTTT AATGTTAGGG AGTTTTCAAG GAAGCATATT GGCGGAAGAT ACTAAGGGAG AGCAAGTTTC ATATCGAAAT GTGCTAAAAA TGGAGCCGGT TGGTGTACAA CTGCCCGTGG AAGAATTAGC TCATTCATCG AAAGTATTAG AAAGCAAGTC TTTTGAGAAA AGGCTACAAT TTGCTGATTT ATCGCAAAGA CCGCCTGAAG TAAAAAAGGA AAGTAAGCAA TTAGCTGTAG CGAAAACTTA TACAATTGCT GAATTAAATC AATTAAGCAA TCAACAATTA GTAGATTTAC TTGTAACAAT CGATTGGGAG CAAATTACGG GGCTATTTCA GTTTAATAAG GACAGTCTTG CATTCTATCA AAATGATAGT AGGATGCAGG CAATTATTGA TAAATTGAAG CAGCAAGGAC AAGCTTATAC GAAGGATGAT TCAAAAGGGG TTGAGACATT AGTAGAGGTA TTGCGCTCTG GATTTTATTT AGGTTTTTAT AATGCAGAAT TAAGTAAACT TAATGAGCGG AGCTATCATG ACAAGTGCTT GCCGGCGTTA AAAACGATTG CGAACAATCC AAATTTCAAG CTCGGTACAT TGGAACAAAA TAGAGTTGTA TCATCATACG GAAAATTAAT AGGAAATGCT TCGAGTGATG TGGAAACGGT AACATCGGCT GCAAAGATTT TTAAACAATA TAATGATAAT TTTTCTACAT TGGTAGATAA TCTTTCAGCT GGAAATGCGA TTTACGATAT TATGCAAGGT GTCGATTATG ATATTCAATC GTATTTGTAC GATACGAGAA AAGTACCGAA AGATACAGTA TGGTACCAAA AAATAGATAG TTATATTAAT GAGTTAAGTA GATTTGCGTT AATGGGAACG ATTACAGAGA AAAACGGATG GCTTATTAAT AATGGCATTT ATTATACAGG GAGACTCGGC ATATTCCATA GTACAGGGAC GAAAGGATTA CAAGTTGTAA CAGATGCGAT GAAAATCTAT CCTTATTTAG GTGAGCAATA TTTCGTAGCA GCTGAGCAAA TTACGACGAA TTATGGCGGG AAAGATGCAA ACGGTAACGT TGTTAATTTA GATCAAATAC GAGAAGATGG GAAGAAAAAG TATTTGCCGA AAACGTATAC GTTTGATGAT GGTGCAATTG TTTTAAAAGC CGGAGATAAA GTGACTGAGG AGAAAGTTAA ACGTTTATAT TGGGCAGCAA AAGAAGTGAA GGCACAATTC CATCGTACGG TTGAAAGTGA CCAACCGTTA GAAAAAGGAA ATCCAGATGA TGTATTAACG ATGGTTATTT ATAATAGCCC AGCTGAATAC CAATTTAACC GTCAATTGTA CGGATATGAA ACGAATAACG GCGGTCTTTA TATAGAAGGA ACAGGTACGT TCTTTACTTA TGAACGTACG CCACAAGAAA GTATTTATAG TTTAGAAGAA TTGTTCCGAC ATGAGTTCAC GCATTATTTA CAAGGTAGAT ATGAAGTGCC AGGACTTTGG GGGCAAGGTA AGATGTATGA GAATGAGAGA TTATCTTGGT TTGAAGAAGG GAATGCTGAG TTTTTTGCAG GTGCAACGAG AACGGACAAC GTCGTTCCAA GAAAGAGCAT TATAGGAGGA CTATCTTCGA ATCCAGCAGA ACGTTATACG GCAGAGCGAA CGTTAAATGC AAAGTATGGA ACGTGGGATT TCTATAATTA TTCCTTTGCT TTACAATCGT ATATGTACAA TAAGAGATAC GATATGTTTG ATAAAGTTCA TGATCTTATT AGGAAAAATG ATGTAACAGC ATATGATGCA TATCGCTCTG CTTTAAGTAA AGATGCGAAT TTAAATAAAG AGTATCAAGA CTATATGCAA ATGTTAGTAG ACAACCGTGA GAAATATAAT GTTCCGTTAG TGTCAGATGA TTATTTAGCA ACTCATGCAC CGAAACCAGT TTCAGATATT GCCGCAGAAA TTACAGCGGA AGCAAAATTA AGTAATGTAT CAGTAAAGAA AAATAAATCA CAATTCTTTA ATACATTTAC ACTGCAAGGA ACATATACAG GAACTTCTGC AAAAGGAGAA TATGAAGACT GGAAAACAAT TACGCAAAAC GTTAATGATA CGTTAAAACG TTTAAGTGCA AAAGAATGGA CAGGCTATAA AACAGTAACA GCTTACTTCG TGAATTACCG TGTAAATGCA TCAGGGCAAT TTGAATATGA TGTTGTGTTC CACGGTATTA ATACAGAAGA AGGTGCTGTG AATAAAGCGC CGGTCGCGGT TATAAATGGT CCGTATAGCG GACATACGAA TGAAGCAATC TCGTTTAAAA GCGATGGATC AAAAGATGAA GATGGGAAAA TTGCTTCTTA TAACTGGGAG TTCGGTGATG GTGCGGTAAG TAATGAGCAA AATCCAACTC ACGTATATAC AAAAGAAGGA ACGTATACGG CGAAATTAAC AGTAACAGAC GATAAAGGAT TAACTAATAC TGCTACAACA AGTGTAACGG TTCAAAAGAA AGAAGATAAC AGTGTAGAAA AAGAGCCAAA TAATTCATTC CAAACGGCAA ATAAACTGCA GTTAAATCAA GTTTTACGTG CTAGTTTAGG AAACGGTGAT ACGAGTGATT ACTTTGAAAT AAATATAGAA ACTGCGAAAA ATCTTCAAAT TAATGTAACG AAGGAAAATA ATATCGGAGT AAACTGGGTT CTTTATTCAG AAGCAGATTT AAATAATTAT GTTACGTATG CACAGCAACA AGGAAATAAA TTAGTAGGTA GTTACTATAC GTATCCAGGG AAGTACTATT TACATGTATA CCAGTATGGT GGTGGAACAG GGAATTATAC GGTAGAAGTG AAGTGA
|
Protein sequence | MKGYSKKVLV GVSFASLMLG SFQGSILAED TKGEQVSYRN VLKMEPVGVQ LPVEELAHSS KVLESKSFEK RLQFADLSQR PPEVKKESKQ LAVAKTYTIA ELNQLSNQQL VDLLVTIDWE QITGLFQFNK DSLAFYQNDS RMQAIIDKLK QQGQAYTKDD SKGVETLVEV LRSGFYLGFY NAELSKLNER SYHDKCLPAL KTIANNPNFK LGTLEQNRVV SSYGKLIGNA SSDVETVTSA AKIFKQYNDN FSTLVDNLSA GNAIYDIMQG VDYDIQSYLY DTRKVPKDTV WYQKIDSYIN ELSRFALMGT ITEKNGWLIN NGIYYTGRLG IFHSTGTKGL QVVTDAMKIY PYLGEQYFVA AEQITTNYGG KDANGNVVNL DQIREDGKKK YLPKTYTFDD GAIVLKAGDK VTEEKVKRLY WAAKEVKAQF HRTVESDQPL EKGNPDDVLT MVIYNSPAEY QFNRQLYGYE TNNGGLYIEG TGTFFTYERT PQESIYSLEE LFRHEFTHYL QGRYEVPGLW GQGKMYENER LSWFEEGNAE FFAGATRTDN VVPRKSIIGG LSSNPAERYT AERTLNAKYG TWDFYNYSFA LQSYMYNKRY DMFDKVHDLI RKNDVTAYDA YRSALSKDAN LNKEYQDYMQ MLVDNREKYN VPLVSDDYLA THAPKPVSDI AAEITAEAKL SNVSVKKNKS QFFNTFTLQG TYTGTSAKGE YEDWKTITQN VNDTLKRLSA KEWTGYKTVT AYFVNYRVNA SGQFEYDVVF HGINTEEGAV NKAPVAVING PYSGHTNEAI SFKSDGSKDE DGKIASYNWE FGDGAVSNEQ NPTHVYTKEG TYTAKLTVTD DKGLTNTATT SVTVQKKEDN SVEKEPNNSF QTANKLQLNQ VLRASLGNGD TSDYFEINIE TAKNLQINVT KENNIGVNWV LYSEADLNNY VTYAQQQGNK LVGSYYTYPG KYYLHVYQYG GGTGNYTVEV K
|
| |