Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | pE33L466_0375 |
Symbol | npr |
ID | 3399868 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Bacillus cereus E33L |
Kingdom | Bacteria |
Replicon accession | NC_007103 |
Strand | + |
Start bp | 377350 |
End bp | 379020 |
Gene Length | 1671 bp |
Protein Length | 556 aa |
Translation table | 11 |
GC content | 36% |
IMG OID | 637660192 |
Product | neutral protease |
Protein accession | YP_245856 |
Protein GI | 67078236 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG3227] Zinc metalloprotease (elastase) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAAAA CTGTTATTAC ATTACTTGCT GCAGGAACAA TGTTAGGTGC GCCTTTTTCA ACTGCATTTG CAGAAGAACA AGCATCTCAA AAAGAAGTAA TGGATAAAAT GGAAGTACAA CAAAAGAATT GGAATGAGGA ACAAGGAAGT CCATCATTTC TTTCAGGGGA ATTATCTGAT AAGAAAGTAG AAACTCAAAA AGCAGTAAAA GAGTTTCTTG AAGAAAATAA AGAATTATTT AAGGTAAATC CACAAACTGA TCTAACGCTT AAAGAAGTGA AATCTGATGA TTTAGGTATG AAACATTATG TTTATACAAG GTCTGTAAAT AAGGTGCCTG TTGATGGTGC GCAATTCATT GTTCATACAG ATAAAGAGGG CAAGGTAACA ACAGTAAACG GAGATGTTCA TCCATCTGCT GCGGAAAATT TGAAAGGTGA TACAGAAGCA AAGATTACAA AAGAAACAGC CCTTTCAAGT GCCTGGAAAC ATATTAAACT TACAAAAAAT GATACATTAG TAAAAGAGGA TGGAAATACA TTAGACCAAG TAAAAGAAAA CTTAGAATCT ACAAATGAAA AAGCAGATTT AGTTGTATAT GAAAAAGACG GAGAATATTA TCTAACGTTT AAAGTGCAAC TGCAATTTAT CAAACCCTAC GGAGCTAATT GGCAGATCTA TGTGAATGCG GAAGATGGAA AAATTATAGA TTCATATAAC GCAGTTACAG ATGCAGAGAG TGCGCAAAAA GGGTATGGAC AAGGGGTATT AGGGGATCGA AAAGAACTGA ATACAACCTT TGATAGTGTA AAGGGGAAAT ACTATTTAAA AGACACAACA AAGCCTATGA ATGGAGGATA TATTGAAACA TTTACGGTAA ATCATAGTGA TGCAGATTAT CCAGTTAACT ATCGTCTCCT GGATGATGAT AATGCTTGGA TAAACAAAAA TCAAGGACCA GCAGTCGATG CTCATTATCA TGCAGGAAAA GTCTATGACT ACTATAAAAA TATTCACAAT CGTAACAGTA TTGATGGAAA AGGGAAAACA ATTCGTTCTG GTGTGAATTA TGGAGTAAAT GTAAACAATG CATTTTGGAA TGGACAGCAA ATGATTTATG GAGATGGTGA TGGGCGCATA TTCTCTCCAC TTTCAGGTTC CCTTGATGTT GTTGCGCATG AATTAACTCA TGCCGTGACA CAGTATTCAG CTGATCTTCG TTATGTAAAT CAATCAGGTG CGTTAAATGA ATCGTTCTCT GATGTATTTG GATATTTTGT TGATCCAACA AATTGGGATT TAGGAGAGGC TGTATATACG CCTGGTATTT CTGGAGATGC ACTTCGCAGT TTATCAAATC CTGAAAAATA TGGCCAACCT TCTCATATGA GGGATTATCA ATACCTTCCG GCAACTGAAG AAGGCGATAA CGGTGGGGTA CATATTAATA GTGGTATTCC GAATAAGGCT GCATATCTAA CAATTAATGC TATTGGTAAA GAAAAAGCAG AAAAAATCTA TTATCGCGCG TTAACAACAT ATTTAACACC AACAAGTGAC TTTAAACAAG CTCGTACAGC TTTATTACAA TCCGCAGCTG ATTATGATGG CTATGGTAGT GCAACCTATA AAGCAGTAGA AAACGCTTGG AATCAAGTGG GCGTAAAGTA A
|
Protein sequence | MKKTVITLLA AGTMLGAPFS TAFAEEQASQ KEVMDKMEVQ QKNWNEEQGS PSFLSGELSD KKVETQKAVK EFLEENKELF KVNPQTDLTL KEVKSDDLGM KHYVYTRSVN KVPVDGAQFI VHTDKEGKVT TVNGDVHPSA AENLKGDTEA KITKETALSS AWKHIKLTKN DTLVKEDGNT LDQVKENLES TNEKADLVVY EKDGEYYLTF KVQLQFIKPY GANWQIYVNA EDGKIIDSYN AVTDAESAQK GYGQGVLGDR KELNTTFDSV KGKYYLKDTT KPMNGGYIET FTVNHSDADY PVNYRLLDDD NAWINKNQGP AVDAHYHAGK VYDYYKNIHN RNSIDGKGKT IRSGVNYGVN VNNAFWNGQQ MIYGDGDGRI FSPLSGSLDV VAHELTHAVT QYSADLRYVN QSGALNESFS DVFGYFVDPT NWDLGEAVYT PGISGDALRS LSNPEKYGQP SHMRDYQYLP ATEEGDNGGV HINSGIPNKA AYLTINAIGK EKAEKIYYRA LTTYLTPTSD FKQARTALLQ SAADYDGYGS ATYKAVENAW NQVGVK
|
| |