Gene Bcer98_3094 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBcer98_3094 
Symbol 
ID5343800 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus cytotoxicus NVH 391-98 
KingdomBacteria 
Replicon accessionNC_009674 
Strand
Start bp3135572 
End bp3136852 
Gene Length1281 bp 
Protein Length426 aa 
Translation table11 
GC content38% 
IMG OID640840588 
Productpeptidase U32 
Protein accessionYP_001376313 
Protein GI152976796 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0826] Collagenase and related proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.585219 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACAATGC AAGAAATTTC ACGAGTAATT GATGGCAAGC GCGTTATTGT GAAGAAACCT 
GAATTGTTAA TCCCAGCAGG AAATTTAGAG AAATTAAAAA TAGCTATCCA TTACGGTGCA
GATGCTGTAT ATTTAGGTGG GCAAGAATTT GGTCTTCGTT CCAATGCTGG AAACTTCACA
TTAGAAGAGA TGGCAGAAGG CGTTGAATTT GCTAAGAAAT ATGGTGCAAA GATATATGTA
ACAACAAATA TTTTTGCACA TAATGAGAAC ATGGAAGGGC TAGAAGAATA TTTAAGAGGC
ATTGAAAAAG CTGGTGTAAC AGGTATTATC GTTGCAGACC CTCTTATTAT TGAAACATGT
AAGCGGGTCG CACCTTCTGT TGAAGTACAT TTAAGTACGC AACAATCACT ATCTAACTGG
AAAGCAGCGC AATATTGGAA AGAGGAAGGA TTACATCGCC TTGTATTAGC GCGTGAAGTT
GGATATGAAG AAATGAAAGA AATAAAAGAA CACGTTGATA TTGAAATTGA GGCATTTGTC
CATGGGGCAA TGTGTATTGC GTATTCTGGA AGATGTACAT TAAGTAACCA TATGACAGCA
CGCGACTCTA ACCGCGGTGG TTGTTGCCAA TCTTGTCGCT GGGATTACGA CTTAATTCAA
ACAGTCTCCC AACATAAAGA TGCACAAGAA TTATCGTTGT TCCAAGAAGG AGATGCTCCT
TTCGCGATGA GTCCGAAAGA TTTAAATTTA ATTCTTTCGA TTCCAAAAAT GATTGAGATC
GGAATTGACA GCTTAAAAGT TGAAGGACGT ATGAAATCTA TCCATTACAT TGCAACTGTA
GCAACGGTAT ATCGTAAAGT AATCGATACG TACTGCGCAG ATCCTGATAA TTTTGAATTT
AAACAAGAAT GGTTAGACGA ATTGGATAAA TGTGCAAATC GTGACACAGC TCCAGCTTTC
TTTGAAGGTG TACCAGGATA TCAAGAACAA ATGTATGGAA ATCATAGTAA GAAAACAACG
TATGATTTTG CTGGTTTAGT GTTAGATTAT AATGAAGAAA CAGGCATTGC AACAATCGAA
CAACGTAATT ATTTTAAACC AGGCCATGAA GTGGAGTTCT TTGGACCAGA AATAGAAAAC
TTTACACAAA CGGTGGAGAA AATTTGGGAT GAGGATGGAA ATGAATTAGA TGCAGCAAGA
CACCCGCTGC AAATCGTGAA AATCAAAGTG GATCGACCAG TGTATGTGAA CAATATGATG
CGCAAAAGCA TATATCAATA A
 
Protein sequence
MTMQEISRVI DGKRVIVKKP ELLIPAGNLE KLKIAIHYGA DAVYLGGQEF GLRSNAGNFT 
LEEMAEGVEF AKKYGAKIYV TTNIFAHNEN MEGLEEYLRG IEKAGVTGII VADPLIIETC
KRVAPSVEVH LSTQQSLSNW KAAQYWKEEG LHRLVLAREV GYEEMKEIKE HVDIEIEAFV
HGAMCIAYSG RCTLSNHMTA RDSNRGGCCQ SCRWDYDLIQ TVSQHKDAQE LSLFQEGDAP
FAMSPKDLNL ILSIPKMIEI GIDSLKVEGR MKSIHYIATV ATVYRKVIDT YCADPDNFEF
KQEWLDELDK CANRDTAPAF FEGVPGYQEQ MYGNHSKKTT YDFAGLVLDY NEETGIATIE
QRNYFKPGHE VEFFGPEIEN FTQTVEKIWD EDGNELDAAR HPLQIVKIKV DRPVYVNNMM
RKSIYQ