Gene BcerKBAB4_3229 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBcerKBAB4_3229 
Symbol 
ID5843441 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus weihenstephanensis KBAB4 
KingdomBacteria 
Replicon accessionNC_010184 
Strand
Start bp3276329 
End bp3279244 
Gene Length2916 bp 
Protein Length971 aa 
Translation table11 
GC content36% 
IMG OID641378355 
Productcollagenase 
Protein accessionYP_001646033 
Protein GI163941149 
COG category[R] General function prediction only 
COG ID[COG3291] FOG: PKD repeat 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAGGCT ATTCAAAAAA AGTTTTAGTA GGGGTAAGTT TTGCTAGTTT AATGTTAGGG 
AGTTTTCAAG GAAGCATATT GGCGGAAGAT ACTAAGGGAG AGCAAGTTTC ATATCGAAAT
GTGCTAAAAA TGGAGCCGGT TGGTGTACAA CTGCCCGTGG AAGAATTAGC TCATTCATCG
AAAGTATTAG AAAGCAAGTC TTTTGAGAAA AGGCTACAAT TTGCTGATTT ATCGCAAAGA
CCGCCTGAAG TAAAAAAGGA AAGTAAGCAA TTAGCTGTAG CGAAAACTTA TACAATTGCT
GAATTAAATC AATTAAGCAA TCAACAATTA GTAGATTTAC TTGTAACAAT CGATTGGGAG
CAAATTACGG GGCTATTTCA GTTTAATAAG GACAGTCTTG CATTCTATCA AAATGATAGT
AGGATGCAGG CAATTATTGA TAAATTGAAG CAGCAAGGAC AAGCTTATAC GAAGGATGAT
TCAAAAGGGG TTGAGACATT AGTAGAGGTA TTGCGCTCTG GATTTTATTT AGGTTTTTAT
AATGCAGAAT TAAGTAAACT TAATGAGCGG AGCTATCATG ACAAGTGCTT GCCGGCGTTA
AAAACGATTG CGAACAATCC AAATTTCAAG CTCGGTACAT TGGAACAAAA TAGAGTTGTA
TCATCATACG GAAAATTAAT AGGAAATGCT TCGAGTGATG TGGAAACGGT AACATCGGCT
GCAAAGATTT TTAAACAATA TAATGATAAT TTTTCTACAT TGGTAGATAA TCTTTCAGCT
GGAAATGCGA TTTACGATAT TATGCAAGGT GTCGATTATG ATATTCAATC GTATTTGTAC
GATACGAGAA AAGTACCGAA AGATACAGTA TGGTACCAAA AAATAGATAG TTATATTAAT
GAGTTAAGTA GATTTGCGTT AATGGGAACG ATTACAGAGA AAAACGGATG GCTTATTAAT
AATGGCATTT ATTATACAGG GAGACTCGGC ATATTCCATA GTACAGGGAC GAAAGGATTA
CAAGTTGTAA CAGATGCGAT GAAAATCTAT CCTTATTTAG GTGAGCAATA TTTCGTAGCA
GCTGAGCAAA TTACGACGAA TTATGGCGGG AAAGATGCAA ACGGTAACGT TGTTAATTTA
GATCAAATAC GAGAAGATGG GAAGAAAAAG TATTTGCCGA AAACGTATAC GTTTGATGAT
GGTGCAATTG TTTTAAAAGC CGGAGATAAA GTGACTGAGG AGAAAGTTAA ACGTTTATAT
TGGGCAGCAA AAGAAGTGAA GGCACAATTC CATCGTACGG TTGAAAGTGA CCAACCGTTA
GAAAAAGGAA ATCCAGATGA TGTATTAACG ATGGTTATTT ATAATAGCCC AGCTGAATAC
CAATTTAACC GTCAATTGTA CGGATATGAA ACGAATAACG GCGGTCTTTA TATAGAAGGA
ACAGGTACGT TCTTTACTTA TGAACGTACG CCACAAGAAA GTATTTATAG TTTAGAAGAA
TTGTTCCGAC ATGAGTTCAC GCATTATTTA CAAGGTAGAT ATGAAGTGCC AGGACTTTGG
GGGCAAGGTA AGATGTATGA GAATGAGAGA TTATCTTGGT TTGAAGAAGG GAATGCTGAG
TTTTTTGCAG GTGCAACGAG AACGGACAAC GTCGTTCCAA GAAAGAGCAT TATAGGAGGA
CTATCTTCGA ATCCAGCAGA ACGTTATACG GCAGAGCGAA CGTTAAATGC AAAGTATGGA
ACGTGGGATT TCTATAATTA TTCCTTTGCT TTACAATCGT ATATGTACAA TAAGAGATAC
GATATGTTTG ATAAAGTTCA TGATCTTATT AGGAAAAATG ATGTAACAGC ATATGATGCA
TATCGCTCTG CTTTAAGTAA AGATGCGAAT TTAAATAAAG AGTATCAAGA CTATATGCAA
ATGTTAGTAG ACAACCGTGA GAAATATAAT GTTCCGTTAG TGTCAGATGA TTATTTAGCA
ACTCATGCAC CGAAACCAGT TTCAGATATT GCCGCAGAAA TTACAGCGGA AGCAAAATTA
AGTAATGTAT CAGTAAAGAA AAATAAATCA CAATTCTTTA ATACATTTAC ACTGCAAGGA
ACATATACAG GAACTTCTGC AAAAGGAGAA TATGAAGACT GGAAAACAAT TACGCAAAAC
GTTAATGATA CGTTAAAACG TTTAAGTGCA AAAGAATGGA CAGGCTATAA AACAGTAACA
GCTTACTTCG TGAATTACCG TGTAAATGCA TCAGGGCAAT TTGAATATGA TGTTGTGTTC
CACGGTATTA ATACAGAAGA AGGTGCTGTG AATAAAGCGC CGGTCGCGGT TATAAATGGT
CCGTATAGCG GACATACGAA TGAAGCAATC TCGTTTAAAA GCGATGGATC AAAAGATGAA
GATGGGAAAA TTGCTTCTTA TAACTGGGAG TTCGGTGATG GTGCGGTAAG TAATGAGCAA
AATCCAACTC ACGTATATAC AAAAGAAGGA ACGTATACGG CGAAATTAAC AGTAACAGAC
GATAAAGGAT TAACTAATAC TGCTACAACA AGTGTAACGG TTCAAAAGAA AGAAGATAAC
AGTGTAGAAA AAGAGCCAAA TAATTCATTC CAAACGGCAA ATAAACTGCA GTTAAATCAA
GTTTTACGTG CTAGTTTAGG AAACGGTGAT ACGAGTGATT ACTTTGAAAT AAATATAGAA
ACTGCGAAAA ATCTTCAAAT TAATGTAACG AAGGAAAATA ATATCGGAGT AAACTGGGTT
CTTTATTCAG AAGCAGATTT AAATAATTAT GTTACGTATG CACAGCAACA AGGAAATAAA
TTAGTAGGTA GTTACTATAC GTATCCAGGG AAGTACTATT TACATGTATA CCAGTATGGT
GGTGGAACAG GGAATTATAC GGTAGAAGTG AAGTGA
 
Protein sequence
MKGYSKKVLV GVSFASLMLG SFQGSILAED TKGEQVSYRN VLKMEPVGVQ LPVEELAHSS 
KVLESKSFEK RLQFADLSQR PPEVKKESKQ LAVAKTYTIA ELNQLSNQQL VDLLVTIDWE
QITGLFQFNK DSLAFYQNDS RMQAIIDKLK QQGQAYTKDD SKGVETLVEV LRSGFYLGFY
NAELSKLNER SYHDKCLPAL KTIANNPNFK LGTLEQNRVV SSYGKLIGNA SSDVETVTSA
AKIFKQYNDN FSTLVDNLSA GNAIYDIMQG VDYDIQSYLY DTRKVPKDTV WYQKIDSYIN
ELSRFALMGT ITEKNGWLIN NGIYYTGRLG IFHSTGTKGL QVVTDAMKIY PYLGEQYFVA
AEQITTNYGG KDANGNVVNL DQIREDGKKK YLPKTYTFDD GAIVLKAGDK VTEEKVKRLY
WAAKEVKAQF HRTVESDQPL EKGNPDDVLT MVIYNSPAEY QFNRQLYGYE TNNGGLYIEG
TGTFFTYERT PQESIYSLEE LFRHEFTHYL QGRYEVPGLW GQGKMYENER LSWFEEGNAE
FFAGATRTDN VVPRKSIIGG LSSNPAERYT AERTLNAKYG TWDFYNYSFA LQSYMYNKRY
DMFDKVHDLI RKNDVTAYDA YRSALSKDAN LNKEYQDYMQ MLVDNREKYN VPLVSDDYLA
THAPKPVSDI AAEITAEAKL SNVSVKKNKS QFFNTFTLQG TYTGTSAKGE YEDWKTITQN
VNDTLKRLSA KEWTGYKTVT AYFVNYRVNA SGQFEYDVVF HGINTEEGAV NKAPVAVING
PYSGHTNEAI SFKSDGSKDE DGKIASYNWE FGDGAVSNEQ NPTHVYTKEG TYTAKLTVTD
DKGLTNTATT SVTVQKKEDN SVEKEPNNSF QTANKLQLNQ VLRASLGNGD TSDYFEINIE
TAKNLQINVT KENNIGVNWV LYSEADLNNY VTYAQQQGNK LVGSYYTYPG KYYLHVYQYG
GGTGNYTVEV K