Gene Bcer98_1621 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBcer98_1621 
Symbol 
ID5346866 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus cytotoxicus NVH 391-98 
KingdomBacteria 
Replicon accessionNC_009674 
Strand
Start bp1724098 
End bp1725606 
Gene Length1509 bp 
Protein Length502 aa 
Translation table11 
GC content38% 
IMG OID640839199 
Productpeptidase 
Protein accessionYP_001374925 
Protein GI152975408 
COG category[E] Amino acid transport and metabolism 
COG ID[COG3227] Zinc metalloprotease (elastase) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.379555 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATAAGA AAGTGATAGC GCTCGCGGCG GTTATACCTC TTGTGTTAGG GACGGTATCT 
ACAGCTTCGG CAGTGGAGAA AGAACAAGTA AGCCTAGAAA AGTATTCCCC TAAAGAAAAG
GCAATAGAAT ATTTGAAAGA AAATGCAGCG CATTATGCGT TGAAGGAAGA TCTATCAGAT
TTACGATATA TTTCAACAAC TGAAACGCCA GTAGCCTCAT ATGTGAGATT TCAACAAGTC
GTAAATGATG CTCCTGTATT TTCACGACAA ATAACGGTGA CAATAAATAG GGCAGGACAA
AGTGTATTAG TAGTTTCTGA TTACCAGCCT GTTCAAAGGG TGAAAGAAAT AAAGAAAAAG
ATGAGTGAGC AAGAAGCTGA ACAAAAGTCA AAATCATATG TATCTGGTGC TGAAAATGAA
AGTAATTTAT GGGCACCAAC GACGAAAGAA TTTGGATATA TCATTGAAGA GGGAGTTGCT
ATACCGGTAT ATAAAGTTGT TGTCCATTCT AATAAACCAT TTGGTGCTTG GGAAACATTG
ATTGATGCTG GAAGTGGAAA GCTATTAAAA AAGGTGGATA TAAACCGTAA AGTAGAGGGA
ACGGGTAAAG TATTTTTGCC AAATCCAGTC GTATCAAATG GTAGCTTAAC AGGCTTGAAA
GATAACAATG ATAAAGATTC AGTAGAATTA AATAATCAAT TGAAAACGGT TATTTTAAAA
GGTTTAGATG GAACGGGTTT TTTAATTGGT GATTATGTAA CAATTTCTTC TAAGGCAAAA
ACAAAATCTA CAAATTTTCA ATTCAATTAC ACACGTTCTC ATGATAGTTT TGAAGATGTC
ATGGCATATT ATCATATTGA TACTTTGCAA CGTTATATTC AAGGGTTGGG CTTTCAAAAT
ATTAACAATC GCTCCATTAA AGTGAATGTA AATGGAACAA CGGCTGATAA CTCTTTTTAT
TCTCCCTCAA CGAAAGCTTT AACATTTGGA ACAGGTGGAG TAGATGATGC AGAGGATGCC
GGAATTATTG CACATGAATA TGGACACTCT ATCCAAGATA ATCAAGTTCC AGGGTTCGGA
AGTTCCTTAG AAGGCGGAGC AATGGGGGAA GGGTTTGGTG ATTTCTTAGG TGCGACGTAT
GAAGATGCTG TATCGACGAC AGAATATGGG AAAGCTTGTG TTGGAGAATG GGATGCAACA
GCTTATTCGA GCTCTGATCC AACATGTCTT CGTCGGTTAG ATAATAATAA AGTATATCCA
AAAGATATAC AAAATGAAGT ACATGCAGAC GGAGAAATTT GGGCGCAAGG AGAGTATGAA
ATGGCGCAAG CCTTTGGGCG TGATGTAGCG ACAAAAATCA TTTTACAATC CCATTGGTCT
TTGACACCAA ATGCGACATT TCATGATGGA GCACGAGCAA TTAAACAAGC GGATGCGCTT
CTTTATGGGG GACAACATGC TGCAGAAATT GATCGAATTT GGATAGCAAG AGGAATTCGT
ACAAATTAA
 
Protein sequence
MNKKVIALAA VIPLVLGTVS TASAVEKEQV SLEKYSPKEK AIEYLKENAA HYALKEDLSD 
LRYISTTETP VASYVRFQQV VNDAPVFSRQ ITVTINRAGQ SVLVVSDYQP VQRVKEIKKK
MSEQEAEQKS KSYVSGAENE SNLWAPTTKE FGYIIEEGVA IPVYKVVVHS NKPFGAWETL
IDAGSGKLLK KVDINRKVEG TGKVFLPNPV VSNGSLTGLK DNNDKDSVEL NNQLKTVILK
GLDGTGFLIG DYVTISSKAK TKSTNFQFNY TRSHDSFEDV MAYYHIDTLQ RYIQGLGFQN
INNRSIKVNV NGTTADNSFY SPSTKALTFG TGGVDDAEDA GIIAHEYGHS IQDNQVPGFG
SSLEGGAMGE GFGDFLGATY EDAVSTTEYG KACVGEWDAT AYSSSDPTCL RRLDNNKVYP
KDIQNEVHAD GEIWAQGEYE MAQAFGRDVA TKIILQSHWS LTPNATFHDG ARAIKQADAL
LYGGQHAAEI DRIWIARGIR TN