Gene Bcer98_1166 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBcer98_1166 
Symbol 
ID5344730 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus cytotoxicus NVH 391-98 
KingdomBacteria 
Replicon accessionNC_009674 
Strand
Start bp1277669 
End bp1278769 
Gene Length1101 bp 
Protein Length366 aa 
Translation table11 
GC content36% 
IMG OID640838759 
Productpeptidase M50 
Protein accessionYP_001374486 
Protein GI152974969 
COG category[R] General function prediction only 
COG ID[COG1994] Zn-dependent proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.59725 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAAAAGA AAAATACAAA GGGATTATGG GGAATTTTGG CTGCAGTCGG AATCTTTTTA 
TTTTCTAAGC TGAAATGGGT ATTAGCAATT TTGAAATTTG CTAAATTTTC AACAGTATTT
AGTATGTTGT TATCACTCGG AGCATATGCA GCTATATATG GTTGGAAATT TGGAGTCGCA
CTCATTTATT TACTTTTCGT ACATGAAATG GGACATTTAT GGGCGGCGAA GCGAAAAGGT
ATACCTACGT CACCGGCAAT TTTCATTCCA TTTATGGGTG CGTTAATTGG AATGAAGGAG
ATGCCGAAAA ATGCAAAGGA TGAGGCGTAT CTTGCTTATA TGGGACCTTT ATTTGGTTTG
TTATCATTTT TACCAGCTAT TCCGCTTTAT ATGCTAACGA ATGAGCCATT CTGGGCACTC
GTGATTTTAC TGGGAAGCAT GTTGAACTTT TTTAATTTAA TTCCGGTATC GCCTTTAGAT
GGCGGAAGAA TTATTTCGGT TGTTAGCACG AAAATTTGGA TTGGCGGTCT TGTTTTATTG
CTTGGCTATT CCATTTTCTT TACAAGTATT ATCGGATTTT TCATTTTTGT TATAGGATGC
ATGGAACTGT ATCGAGTAAT GAAGCGTGAT AAGCCGATTG AAGAATTAGG TTACAAAGTA
GAAATATTAA AAACGTATCT TTCTAAATTG CAAGAAGAGT TTCTTGAAAC AGGAGCAGTG
CATCGAACGC TTTATGTAGC TCACCATGAA ATGGGACAAT TGAGACAAAA GGCGAAAGAA
AAGAAGCTTG AAACAGGAGA AAGCCAAAAA ATTGAAGTAT TAGAGTATAT TGTGCCTAAA
TTTGAAGCGC TTGATTATGT GCCATATGAA GAGGAAAAAG AAGAACATAC CAATCGTATG
AAAGAAGCGA TCGCGTTATT AGAAACAAAA GGAAATCAAT GGGAGAAGGA AAAGAAACAA
CAAGAGGAAT ATTATAAAGT CGATGCGAAA ACAAGGTGGA TTGTATTTGG TTGTTACATT
GGTCTACTTG TCATATTAGG CTATGCTGCT TATGAAGGAC ATGTGATTCT ACAGCAGTAT
TTACCAGCGC GAAATGTGTA G
 
Protein sequence
MQKKNTKGLW GILAAVGIFL FSKLKWVLAI LKFAKFSTVF SMLLSLGAYA AIYGWKFGVA 
LIYLLFVHEM GHLWAAKRKG IPTSPAIFIP FMGALIGMKE MPKNAKDEAY LAYMGPLFGL
LSFLPAIPLY MLTNEPFWAL VILLGSMLNF FNLIPVSPLD GGRIISVVST KIWIGGLVLL
LGYSIFFTSI IGFFIFVIGC MELYRVMKRD KPIEELGYKV EILKTYLSKL QEEFLETGAV
HRTLYVAHHE MGQLRQKAKE KKLETGESQK IEVLEYIVPK FEALDYVPYE EEKEEHTNRM
KEAIALLETK GNQWEKEKKQ QEEYYKVDAK TRWIVFGCYI GLLVILGYAA YEGHVILQQY
LPARNV