Gene pE33L466_0375 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagpE33L466_0375 
Symbolnpr 
ID3399868 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus cereus E33L 
KingdomBacteria 
Replicon accessionNC_007103 
Strand
Start bp377350 
End bp379020 
Gene Length1671 bp 
Protein Length556 aa 
Translation table11 
GC content36% 
IMG OID637660192 
Productneutral protease 
Protein accessionYP_245856 
Protein GI67078236 
COG category[E] Amino acid transport and metabolism 
COG ID[COG3227] Zinc metalloprotease (elastase) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAAAA CTGTTATTAC ATTACTTGCT GCAGGAACAA TGTTAGGTGC GCCTTTTTCA 
ACTGCATTTG CAGAAGAACA AGCATCTCAA AAAGAAGTAA TGGATAAAAT GGAAGTACAA
CAAAAGAATT GGAATGAGGA ACAAGGAAGT CCATCATTTC TTTCAGGGGA ATTATCTGAT
AAGAAAGTAG AAACTCAAAA AGCAGTAAAA GAGTTTCTTG AAGAAAATAA AGAATTATTT
AAGGTAAATC CACAAACTGA TCTAACGCTT AAAGAAGTGA AATCTGATGA TTTAGGTATG
AAACATTATG TTTATACAAG GTCTGTAAAT AAGGTGCCTG TTGATGGTGC GCAATTCATT
GTTCATACAG ATAAAGAGGG CAAGGTAACA ACAGTAAACG GAGATGTTCA TCCATCTGCT
GCGGAAAATT TGAAAGGTGA TACAGAAGCA AAGATTACAA AAGAAACAGC CCTTTCAAGT
GCCTGGAAAC ATATTAAACT TACAAAAAAT GATACATTAG TAAAAGAGGA TGGAAATACA
TTAGACCAAG TAAAAGAAAA CTTAGAATCT ACAAATGAAA AAGCAGATTT AGTTGTATAT
GAAAAAGACG GAGAATATTA TCTAACGTTT AAAGTGCAAC TGCAATTTAT CAAACCCTAC
GGAGCTAATT GGCAGATCTA TGTGAATGCG GAAGATGGAA AAATTATAGA TTCATATAAC
GCAGTTACAG ATGCAGAGAG TGCGCAAAAA GGGTATGGAC AAGGGGTATT AGGGGATCGA
AAAGAACTGA ATACAACCTT TGATAGTGTA AAGGGGAAAT ACTATTTAAA AGACACAACA
AAGCCTATGA ATGGAGGATA TATTGAAACA TTTACGGTAA ATCATAGTGA TGCAGATTAT
CCAGTTAACT ATCGTCTCCT GGATGATGAT AATGCTTGGA TAAACAAAAA TCAAGGACCA
GCAGTCGATG CTCATTATCA TGCAGGAAAA GTCTATGACT ACTATAAAAA TATTCACAAT
CGTAACAGTA TTGATGGAAA AGGGAAAACA ATTCGTTCTG GTGTGAATTA TGGAGTAAAT
GTAAACAATG CATTTTGGAA TGGACAGCAA ATGATTTATG GAGATGGTGA TGGGCGCATA
TTCTCTCCAC TTTCAGGTTC CCTTGATGTT GTTGCGCATG AATTAACTCA TGCCGTGACA
CAGTATTCAG CTGATCTTCG TTATGTAAAT CAATCAGGTG CGTTAAATGA ATCGTTCTCT
GATGTATTTG GATATTTTGT TGATCCAACA AATTGGGATT TAGGAGAGGC TGTATATACG
CCTGGTATTT CTGGAGATGC ACTTCGCAGT TTATCAAATC CTGAAAAATA TGGCCAACCT
TCTCATATGA GGGATTATCA ATACCTTCCG GCAACTGAAG AAGGCGATAA CGGTGGGGTA
CATATTAATA GTGGTATTCC GAATAAGGCT GCATATCTAA CAATTAATGC TATTGGTAAA
GAAAAAGCAG AAAAAATCTA TTATCGCGCG TTAACAACAT ATTTAACACC AACAAGTGAC
TTTAAACAAG CTCGTACAGC TTTATTACAA TCCGCAGCTG ATTATGATGG CTATGGTAGT
GCAACCTATA AAGCAGTAGA AAACGCTTGG AATCAAGTGG GCGTAAAGTA A
 
Protein sequence
MKKTVITLLA AGTMLGAPFS TAFAEEQASQ KEVMDKMEVQ QKNWNEEQGS PSFLSGELSD 
KKVETQKAVK EFLEENKELF KVNPQTDLTL KEVKSDDLGM KHYVYTRSVN KVPVDGAQFI
VHTDKEGKVT TVNGDVHPSA AENLKGDTEA KITKETALSS AWKHIKLTKN DTLVKEDGNT
LDQVKENLES TNEKADLVVY EKDGEYYLTF KVQLQFIKPY GANWQIYVNA EDGKIIDSYN
AVTDAESAQK GYGQGVLGDR KELNTTFDSV KGKYYLKDTT KPMNGGYIET FTVNHSDADY
PVNYRLLDDD NAWINKNQGP AVDAHYHAGK VYDYYKNIHN RNSIDGKGKT IRSGVNYGVN
VNNAFWNGQQ MIYGDGDGRI FSPLSGSLDV VAHELTHAVT QYSADLRYVN QSGALNESFS
DVFGYFVDPT NWDLGEAVYT PGISGDALRS LSNPEKYGQP SHMRDYQYLP ATEEGDNGGV
HINSGIPNKA AYLTINAIGK EKAEKIYYRA LTTYLTPTSD FKQARTALLQ SAADYDGYGS
ATYKAVENAW NQVGVK