Gene Bcer98_2323 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBcer98_2323 
Symbol 
ID5345741 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus cytotoxicus NVH 391-98 
KingdomBacteria 
Replicon accessionNC_009674 
Strand
Start bp2416389 
End bp2418029 
Gene Length1641 bp 
Protein Length546 aa 
Translation table11 
GC content36% 
IMG OID640839838 
Productpeptidase M20 
Protein accessionYP_001375564 
Protein GI152976047 
COG category[E] Amino acid transport and metabolism 
COG ID[COG4187] Arginine degradation protein (predicted deacylase) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000106298 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCAAAAT GGCAGACAAA AGAAGAGTTA GTACAATTAT TGAGTAATCT TGTTGAAATT 
CCTAGTATTA CTGGATCAAA AGCTGAAACA GTATTACCAG ACTTTGTTGT TGAACAACTA
TCTAGTCTAT CGTACTTCAA GGAGAATCCT AGTCATTTAG AAAAACATCC AACAGGGGAT
GGACGCTATT TTATTACAGC TTTAGTAAAG AAAGAAAAAC ATTTGAAAAA TACAATTGTT
TTAGTGAGTC ACTTTGATGT TGTAGATGTA GAGGACTATG GCATATGGAA AGAATATGCA
TTTCAACCTC AAAAACTCAC ATCTATGTTT TATTCACATA AAGATGAAGT GCCATATCCT
GTTCAGCAAG ATATAGAACA AGGTGATTGG CTATTTGGTA GAGGGATAAT GGATATGAAG
TGCGGTTTAA CTTTGCATAT GGCAATGGTT GAACAAGCTT GTGAAGGGAG TTTTGATGGG
AATATTCTTT TATTAACTGT CCCAGATGAA GAGGTGAACT CTGTAGGAAT GAGAACCGCT
GTTCCAAAAT TATTAGAGCT TGCAAAAGAA CACGATCTTC AGTATAAAGC AGTCCTCAAT
TCTGAACCAA TGTTTACATG TTATCCAGGA GATCAAAATA AATATATTTA TACAGGATCT
ATTGGAAAAG TATTACCTGG TTTTCTTTGT TATGGAAAGG AAACGCATGT AGGAGAACCG
TTTGCGGGAT TAAATGCGAA TTATATGGCT TCATTATTAA CTGCGGAATT GGAATTAAAT
ACGGAACTTT GTGATATTGT AGAAGGCGAA GCAAGTCCAC CTCCGACAAA TTTATTGCAA
AGGGATTTAA AAGAAGAGTA TTCGGTACAA ATTCCTCATC GTGCAGTGAC ATTGTTTAAT
TTATTTTTAT TAGAAAAAAC GATGCCAGAT GTTGTTTCGT TGCTATATAA AAAGGCAATG
GGAGTAGCGG AGAAAATCGA GGAAACGTAT GCGAAACAGG CATATCATTT TTCTAAATAT
AATCCGTTTA TACCGCCCAG TCTAAAAGTG AATGTACTCA CTTATGAGGA ATTGGTTGCT
TATGCCATTG AACAACATGG AAAAGAAGCA ATTGATAAAA TGCAAGCGAT TGTTTTGGAA
AATCGCGGGG GAAAAGATGA CCGTGCAATT ACGATTGAAC TTGTGGATAG GTTAGCCATT
TTATGTAAAG AAAAAGGGCC AATGATCATA CTGTTCTTTG CGCCACCTTA CTATCCGGCT
GTAAGTTCTC GTAATAATCC ATTCATTCAA AGCGCAGTAG CAGAACTAGA AAGTTATGGA
CATGACCAAC ATGGAGTTAC ATTTAAAACT CAAAATTATT TTGGCGGAAT TTCTGACTTA
AGTTATGTAG GATTACAATA TCCAGTTGAA GCAATGACTT CACTTGTAGA AAATATGCCT
TTATGGAATA AAGGGTATTC GATTCCACTA CAAGAATTAG CAGAGTTTAA TGTTCCTGTA
TTAAATGTTG GACCTGTAGG AAAAGATGCG CACCAGTGGA CTGAGCGTTT AGATGTAGAT
TATGCGTTTG AAACATTATT GGATATGTTA CCAGTATGTA TTGATAGATT ACTCGCTAGG
AATCAAATGT CACAAGTATA G
 
Protein sequence
MAKWQTKEEL VQLLSNLVEI PSITGSKAET VLPDFVVEQL SSLSYFKENP SHLEKHPTGD 
GRYFITALVK KEKHLKNTIV LVSHFDVVDV EDYGIWKEYA FQPQKLTSMF YSHKDEVPYP
VQQDIEQGDW LFGRGIMDMK CGLTLHMAMV EQACEGSFDG NILLLTVPDE EVNSVGMRTA
VPKLLELAKE HDLQYKAVLN SEPMFTCYPG DQNKYIYTGS IGKVLPGFLC YGKETHVGEP
FAGLNANYMA SLLTAELELN TELCDIVEGE ASPPPTNLLQ RDLKEEYSVQ IPHRAVTLFN
LFLLEKTMPD VVSLLYKKAM GVAEKIEETY AKQAYHFSKY NPFIPPSLKV NVLTYEELVA
YAIEQHGKEA IDKMQAIVLE NRGGKDDRAI TIELVDRLAI LCKEKGPMII LFFAPPYYPA
VSSRNNPFIQ SAVAELESYG HDQHGVTFKT QNYFGGISDL SYVGLQYPVE AMTSLVENMP
LWNKGYSIPL QELAEFNVPV LNVGPVGKDA HQWTERLDVD YAFETLLDML PVCIDRLLAR
NQMSQV