Gene BCAH820_3538 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBCAH820_3538 
Symbol 
ID7190198 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus cereus AH820 
KingdomBacteria 
Replicon accessionNC_011773 
Strand
Start bp3367900 
End bp3370815 
Gene Length2916 bp 
Protein Length971 aa 
Translation table11 
GC content35% 
IMG OID643556949 
Productputative microbial collagenase 
Protein accessionYP_002452488 
Protein GI218904654 
COG category[R] General function prediction only 
COG ID[COG3291] FOG: PKD repeat 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones44 
Fosmid unclonability p-value5.60161e-19 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAAAGGCT ATTCAAAAAA AGTGTTAGTA GGGGTAAGTT TTGCTAGTTT AATGTTAGGG 
AGTTTTCAAG GGGGCGCATT GGCAGAAGGT ACAAAGGGAG AGCAAGCTTC ATATCGGAAT
GTGCTCAAAA TGGAACCAGT TGGTGTACAA TTACCAGTAC AAGAATTAGC TCATTCATCA
AAAGTACTTG AAAATAAGTC TTTTGAGAAG AGGTTACAAT TTGCTGATTT GTCGCAAAGA
CCGCCTGAAT TGAAAAAGGA GAGTAAGCAA TTAGCTACAG CAAAAACTTA TACAATTGCT
GAGTTAAATC AATTAAGCAA TCAGCAGTTA GTAGATTTAC TTGTAACAAT TGATTGGGAG
CAAATTACTG GGCTATTTCA GTTTAACAAG GATAGTCTTG CCTTCTATCA AAATGATAGT
AGAATGCAGG CAATTATTGA TAAATTGAAC CAGCAAGGAC AAGCGTATAC GAAAGATGAT
TCAAAAGGGA TTGAGACTTT AGTAGAGGTA TTACGATCTG GTTTTTATTT AGGATTTTAT
CATACAGAAT TAAGTAAACT AAATGAGCGA AGCTATCATG ATAAATGCTT ACCTGCACTA
AAAACGATTG CGAATAACCC GAATTTCAAA CTAGGTACGT TAGAACAAAA TAGAGTTGTA
TCATCATACG GAAAATTAAT AGGAAATGCT TCGAGTGATG TGGAAACGAT AACATCAGCT
GCAAAGATTT TTAAACAATA TAATGATAAT TTTTCTACAT GGGTAGATAA TCTTTCAGCT
GGAAATGCGA TTTACGATAT TATGCAAGGC GTTGACTACG ATATTCAATC GTATTTGTAC
GATACGAGAA AAGCACCGAA AGATACAGTA TGGTATCAAA AAATTGATAG CTATATTAAT
GAATTAAGTC GTTTTGCTTT AATTGGAACG GTGACAGAGA AGAATGCTTG GCTTATTAAT
AATGGTATTT ATTATACAGG TAGACTTGGT ACGTTCCATA GTACAGGGAC GAAAGGGTTG
CAAGTTGTAA CAGATGCCAT GAAAATGTAT CCGTATTTAG GGGAGCAATA TTTCGTAGCG
GCTGAGCAAA TTGCGACGAA TTATGGCGGG AAAGATGCAA ATGGAAACGT TGTGAATTTA
GATCAAATAC GAGAAGATGG TAAGAAGAAA TATTTACCGA AAACATATAC ATTTGACGAT
GGGACAATTG TTTTAAAAGC TGGAGATAAA GTGACAGAAG AAAAAGTAAA ACGTCTATAT
TGGGCGGCAA AAGAAGTGAA GGCTCAATTC CATCGTACGG TTGAAAGTGA CCAGCCGTTA
GAAAAAGGGA ATGCTGATGA TGTATTAACG ATGGTTATTT ATAATAGCCC AGCTGAATAT
CAATTTAACC GTCAATTGTA CGGGTATGAA ACGAATAACG GCGGTCTTTA TATAGAAGGA
ACAGGTACGT TCTTTACTTA TGAGCGTACG CCAGAAGAAA GTATTTATAG TTTAGAGGAA
TTGTTCCGGC ACGAGTTCAC ACATTACTTA CAAGGTAGAT ATGAAGTGCC AGGACTTTGG
GGACAAGGTA AGATTTATGA GAATGAGAGA TTATCTTGGT TTGAAGAAGG CAATGCAGAG
TTTTTTGCAG GTGCAACGAG AACAGATAAT GTTGTACCGA GAAAGAGCAT TATAGGAGGA
ATATCTTCAA ATCCGGCAGA ACGTTATACG GCAGAGAGAA CGTTAAATGC AAAGTACGGA
ACATGGGATT TTTATAATTA TTCCTTCGCT TTACAATCGT ACATGTACAA TAAGAGATAT
GATATGTTTG ACAAAGTTCA TGATCTTATT AGAAAAAATG ATGTAACAGC ATATGATGCA
TATCGCTCTG CATTAAGTAA AGATGCGAAT TTAAATAAAG AGTATCAAGA CTATATGCAA
ATGTTAGTCG ACAATCGTGA TAAATATAAT GTTCCATTAG TATCAGATGA TTATTTAGCA
ACTCACGCAC CGAAACCAGT CTCAGATATT GTGGCAGAAA TTACGGCAGA AGCGAAATTA
AGTAATGTAT CAGTTAAGAA AAATAAATCA CAGTTCTTTA ATACATTTAC ACTGCAAGGA
ACATATACAG GTACGACTGC AAAAGGAGAA TATGAAGACT GGAAATCAAT TACACAAAAC
GTAAATGATA CGTTAAAACG TTTAAGTGTA AAAGAATGGA CAGGCTATAA AACAGTAACA
GCTTATTTCG TAAATTACCG TGTGAATGCA TCAGGACAAT TTGAATATGA CGTTGTATTC
CATGGTATTA ATACAGAAGA AGGCGCTGTG AATAAAGCAC CAGTTGCGGT TATAAATGGT
CCCTATAGTG GGAATGTAAA TGAAGCAATT TCGTTTAAAA GCGATGGATC AAAAGATGAA
GATGGAAAAA TTGTTGCTTA TAAATGGGAG TTTGGTGATG GTACTGTAAG CAATGAACAA
AATCCAACTC ACGTGTATAC AAAAGAAGGA ACATATACAG CGAGATTAAC AGTAACAGAT
GATAAAGGGT TAACGAATAC TGTTACAACG AATGTAACAG TTCAAAAGAA AGAAGATAAC
AGTGTAGAAA AAGAACCAAA CAATTCATTC CAGACAGCAA ATACACTGCA ATTCAATCAA
GTTTTACGCG CAAGTTTAGG AAATGGTGAT ACGAGTGATT TCTTTGAAAT AAATGTGGAA
ACGGCGAAAA ATCTGCAAAT TAATGTAACG AAGGAAAATA ATATCGGAGT AAACTGGGTT
CTTTATTCGG AAGCAGATTT AAATAACTAT ATTACGTATG CCCAGCAAGA GGGGAATAAG
TTAGTAGGAA GTTACTACAC GTATCCAGGT AAGTATTATT TACATGTGTA TCAGTATGGT
GGTGGATTTG GGAATTATAC GGTAGAAGTG AAGTAG
 
Protein sequence
MKGYSKKVLV GVSFASLMLG SFQGGALAEG TKGEQASYRN VLKMEPVGVQ LPVQELAHSS 
KVLENKSFEK RLQFADLSQR PPELKKESKQ LATAKTYTIA ELNQLSNQQL VDLLVTIDWE
QITGLFQFNK DSLAFYQNDS RMQAIIDKLN QQGQAYTKDD SKGIETLVEV LRSGFYLGFY
HTELSKLNER SYHDKCLPAL KTIANNPNFK LGTLEQNRVV SSYGKLIGNA SSDVETITSA
AKIFKQYNDN FSTWVDNLSA GNAIYDIMQG VDYDIQSYLY DTRKAPKDTV WYQKIDSYIN
ELSRFALIGT VTEKNAWLIN NGIYYTGRLG TFHSTGTKGL QVVTDAMKMY PYLGEQYFVA
AEQIATNYGG KDANGNVVNL DQIREDGKKK YLPKTYTFDD GTIVLKAGDK VTEEKVKRLY
WAAKEVKAQF HRTVESDQPL EKGNADDVLT MVIYNSPAEY QFNRQLYGYE TNNGGLYIEG
TGTFFTYERT PEESIYSLEE LFRHEFTHYL QGRYEVPGLW GQGKIYENER LSWFEEGNAE
FFAGATRTDN VVPRKSIIGG ISSNPAERYT AERTLNAKYG TWDFYNYSFA LQSYMYNKRY
DMFDKVHDLI RKNDVTAYDA YRSALSKDAN LNKEYQDYMQ MLVDNRDKYN VPLVSDDYLA
THAPKPVSDI VAEITAEAKL SNVSVKKNKS QFFNTFTLQG TYTGTTAKGE YEDWKSITQN
VNDTLKRLSV KEWTGYKTVT AYFVNYRVNA SGQFEYDVVF HGINTEEGAV NKAPVAVING
PYSGNVNEAI SFKSDGSKDE DGKIVAYKWE FGDGTVSNEQ NPTHVYTKEG TYTARLTVTD
DKGLTNTVTT NVTVQKKEDN SVEKEPNNSF QTANTLQFNQ VLRASLGNGD TSDFFEINVE
TAKNLQINVT KENNIGVNWV LYSEADLNNY ITYAQQEGNK LVGSYYTYPG KYYLHVYQYG
GGFGNYTVEV K