Gene BCAH820_4461 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBCAH820_4461 
Symbol 
ID7190611 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus cereus AH820 
KingdomBacteria 
Replicon accessionNC_011773 
Strand
Start bp4217165 
End bp4218445 
Gene Length1281 bp 
Protein Length426 aa 
Translation table11 
GC content38% 
IMG OID643557872 
Productpeptidase, U32 family 
Protein accessionYP_002453410 
Protein GI218905576 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0826] Collagenase and related proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones109 
Fosmid unclonability p-value0.00643994 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGACTGTAC AAGAAATTTC ACGAGTAATC GATGGCAAAC GTGTTATTGT GAAGAAACCT 
GAACTGTTAA TCCCTGCGGG TAACTTAGAA AAATTAAAAG TAGCTATCCA TTACGGGGCA
GATGCTGTAT ATTTAGGTGG ACAAGAATTT GGTCTTCGTT CGAATGCAGG TAACTTTACA
CTGGAAGAAA TGGCAGAAGG TGTGGAATTC GCAAAGAAAT ATGGAGCGAA AATATATGTA
ACAACAAATA TCTTTGCGCA TAATGAAAAT ATGGACGGGC TAGAGGAATA TTTAAAAGGG
ATTGAAAAAG CTGGCGTAAC GGGAATTATC GTTGCTGATC CGCTTATTAT TGAGACTTGT
AAACGTGTAG CACCTTCTGT TGAGGTGCAT TTAAGTACAC AACAATCACT ATCCAACTGG
AAAGCAGCAC AGTATTGGAA AGAAGAAGGT TTACATCGTC TTGTATTAGC TCGTGAAGCA
AGCTATGAAG AAATGAAAGA AATTAAAGAA AAAGTGGATA TTGAAATTGA AGCATTCGTC
CATGGTGCAA TGTGTATCGC ATACTCAGGA AGATGTACGT TAAGTAACCA TATGACAGCG
CGTGATTCTA ACCGTGGTGG TTGTTGTCAA TCTTGCCGCT GGGACTATGA TTTAGTTCAA
ACAGTATCAC AACATAAAGA TGCAAAAGAG CTTCCTCTAT TCCAAGAAGA AGATGCTCAC
TTCGCGATGA GTCCAAAAGA CTTAAATTTA ATTTTATCAA TTCCGAAAAT GATTGAAATT
GGAATTGATA GCTTAAAAGT TGAAGGACGT ATGAAATCAA TCCATTACGT AGCGACTGTA
GCGACAGTAT ACCGTAAAGT AATTGATGCA TATTGTGCGG ATCCTGATAA CTTTGAGTTT
AAGCAAGAAT GGTTAGATGA GCTTGATAAA TGTGCAAATC GTGATACAGC TCCTGCATTC
TTTGAAGGGG TTCCAGGACA TCAAGAGCAA ATGTTTGGAA ATCATAGTAA GAAAACAACG
TATGATTTCG CTGGTTTAGT GTTAGATTAT AATGAAGAAA CGGGCATCGT AACGATTGAG
CAACGTAATC ATTTCAAACC AGGACATGAA GTGGAGTTCT TTGGACCAGA AATAGAAAAC
TTTACGCAGA CGGTGGAGAA AATTTGGGAT GAGGATGGAA ACGAATTAGA TGCAGCGAGA
CATCCGTTGC AGATCGTGAA ATTCAAAGTG GATCAACCAG TGTATGTGAA TAATATGATG
CGCAAAAGCA TACTTCAATA A
 
Protein sequence
MTVQEISRVI DGKRVIVKKP ELLIPAGNLE KLKVAIHYGA DAVYLGGQEF GLRSNAGNFT 
LEEMAEGVEF AKKYGAKIYV TTNIFAHNEN MDGLEEYLKG IEKAGVTGII VADPLIIETC
KRVAPSVEVH LSTQQSLSNW KAAQYWKEEG LHRLVLAREA SYEEMKEIKE KVDIEIEAFV
HGAMCIAYSG RCTLSNHMTA RDSNRGGCCQ SCRWDYDLVQ TVSQHKDAKE LPLFQEEDAH
FAMSPKDLNL ILSIPKMIEI GIDSLKVEGR MKSIHYVATV ATVYRKVIDA YCADPDNFEF
KQEWLDELDK CANRDTAPAF FEGVPGHQEQ MFGNHSKKTT YDFAGLVLDY NEETGIVTIE
QRNHFKPGHE VEFFGPEIEN FTQTVEKIWD EDGNELDAAR HPLQIVKFKV DQPVYVNNMM
RKSILQ