Gene BCG9842_B4700 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBCG9842_B4700 
Symbol 
ID7182259 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus cereus G9842 
KingdomBacteria 
Replicon accessionNC_011772 
Strand
Start bp591229 
End bp592929 
Gene Length1701 bp 
Protein Length566 aa 
Translation table11 
GC content38% 
IMG OID643548374 
Productneutral protease Npr599 
Protein accessionYP_002444067 
Protein GI218895656 
COG category[E] Amino acid transport and metabolism 
COG ID[COG3227] Zinc metalloprotease (elastase) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value0.000121132 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAAAAAGA AAAGTTTAGC GTTAGTGTTA GCGACAGGAA TGGCAGTTAC AACGTTTGGA 
GGGACAGGCT CTGCATTTGC GGATTCTAAA AATGTACTCT CTACGAAGAA GTACAATGAG
ACAGTACAGT CACCTGAGTT TATTTCTGGT GATTTAACTG GAGCGACTGG TAAGAAAGCA
GAATCTGTTG TGTTTGATTA CTTAAACGCA GCAAAAGGTG ATTACAAGCT AGGGGAAAAG
AGTGCGCAAG ATTCTTTCAA AGTGAAACAA GTGAAGAAAG ATGCTGTAAC TGATTCAACA
GTAGTACGTA TGCAACAAGT TTACGAAGGA GTACCTGTAT GGGGTTCTAC TCAAGTAGCT
CACGTAAGTA AAGACGGTTC TTTAAAAGTA TTGTCTGGAA CAGTTGCACC TGATTTAGAC
AAGAAGGAAA AGTTGAAAAA TAAAAATAAG ATTGAAGGCG CAAAAGCGAT TGAAATCGCG
CAGCAAGACT TAGGGGTAAC ACCGAAATAT GAGGTAGAAC CAAAAGCGGA CTTATATGTA
TATCAAAATG GTGAAGAAAC AACATATGCA TACGTTGTAA ATCTAAACTT CTTAGATCCT
AGTCCAGGAA ACTACTACTA TTTCATTGAA GCAGACAGCG GTAAAGTATT AAATAAATAT
AATAAATTGG ATCATGTAGC AAATGAAGAT AAGTCACCAG TTAAGCAAGA GGCTCCTAAA
CAGGATGCGA AAGCAGTAGT AAAACCTGTA ACAGGAACAA ATAAAGTAGG AACTGGTAAA
GGTGTATTAG GAGATACGAA ATCACTTAAT ACAACATTAT CTGGTTCATC TTACTACTTA
CAAGATAATA CGCGCGGAGC AACGATTTTC ACATACGATG CGAAAAACCG CTCAACATTA
CCAGGAACAT TATGGGCAGA TGCAGATAAT GTTTTCAATG CAGCGTATGA TGCAGCGGCA
GTAGATGCTC ATTACTATGC AGGTAGAACA TATGATTATT ATAAAGCTAC ATTTAACAGA
AACTCTATTA ATGATGCAGG AGCACCATTA AAATCAACAG TTCATTACGG AAGTAAGTAT
AATAATGCAT TCTGGAACGG TTCACAAATG GTATACGGGG ATGGTGATGG TGTAACATTC
ACTTCATTAT CTGGTGGAAT TGACGTAATT GGTCACGAAT TAACGCATGC TGTTACGGAA
AATAGCTCGG ACTTAATTTA TCAAAATGAA TCAGGGGCGT TAAATGAAGC GATTTCTGAT
ATCTTTGGTA CTTTAGTAGA ATTCTATGAT AACCGTAACC CAGATTGGGA GATTGGTGAA
GATATTTACA CGCCTGGTAA AGCAGGAGAC GCGCTTCGCT CTATGAGTGA TCCAGCGAAA
TATGGTGACC CAGACCACTA TTCTAAGCGT TACACAGGTT CAAGTGATAA CGGTGGCGTT
CATACAAACA GTGGTATTAT TAACAAACAA GCTTATTTAT TAGCAAATGG TGGTACGCAT
TCTGGTGTAA CTGTAACTGG TATTGGTAAA GATAAATTAG GTGCGATTTA CTACCGTGCA
AATACACAGT ATTTCACGCA ATCTACTACA TTTAGTCAAG CTCGTGCTGG TGCAGTACAA
GCTGCTGCGG ATTTATATGG TGCTAGCTCT GCAGAAGTAA ATGCAGTGAA GCAATCATTT
AGTGCTGTTG GTGTAAATTA A
 
Protein sequence
MKKKSLALVL ATGMAVTTFG GTGSAFADSK NVLSTKKYNE TVQSPEFISG DLTGATGKKA 
ESVVFDYLNA AKGDYKLGEK SAQDSFKVKQ VKKDAVTDST VVRMQQVYEG VPVWGSTQVA
HVSKDGSLKV LSGTVAPDLD KKEKLKNKNK IEGAKAIEIA QQDLGVTPKY EVEPKADLYV
YQNGEETTYA YVVNLNFLDP SPGNYYYFIE ADSGKVLNKY NKLDHVANED KSPVKQEAPK
QDAKAVVKPV TGTNKVGTGK GVLGDTKSLN TTLSGSSYYL QDNTRGATIF TYDAKNRSTL
PGTLWADADN VFNAAYDAAA VDAHYYAGRT YDYYKATFNR NSINDAGAPL KSTVHYGSKY
NNAFWNGSQM VYGDGDGVTF TSLSGGIDVI GHELTHAVTE NSSDLIYQNE SGALNEAISD
IFGTLVEFYD NRNPDWEIGE DIYTPGKAGD ALRSMSDPAK YGDPDHYSKR YTGSSDNGGV
HTNSGIINKQ AYLLANGGTH SGVTVTGIGK DKLGAIYYRA NTQYFTQSTT FSQARAGAVQ
AAADLYGASS AEVNAVKQSF SAVGVN