Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | pE33L466_0380 |
Symbol | npr |
ID | 3399873 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Bacillus cereus E33L |
Kingdom | Bacteria |
Replicon accession | NC_007103 |
Strand | + |
Start bp | 385514 |
End bp | 387220 |
Gene Length | 1707 bp |
Protein Length | 568 aa |
Translation table | 11 |
GC content | 34% |
IMG OID | 637660197 |
Product | neutral protease |
Protein accession | YP_245861 |
Protein GI | 67078241 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG3227] Zinc metalloprotease (elastase) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.0000306257 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGAATA AGAAGAATTT CGTGAAGATA GGATTAACTA CAGGAGTAAT GTTATCGGTG ATTATGCCCT ATGGAGATGC ATATGCGGCA ACAGAAGATT TAAAAGTGGA AACAAAGGAA GATACGTTCC GAACAGGTAA TTTAACAGTA CCTTCTCAAA AATCGGCAGA AAATGTAGCA AAAGATGCGC TAAAAGGAAA AACAGAACAA GCATTATCAT CAAAGCAAGT TAATACGGAA TCAAAAGTAA ATTATAATGT TACGCAAAGT CGTAAATCTT ATGATGGTAC TACATTGGTA CGTCTTCAAC AAACATATGA AGGACGCGAT GTATACGGAT ATCAACTAAC AGCACATATT AATGATGATG GTGTACTTAC GAGTGTTTCG GGGGATAGTG CCCAAGATCT ACAACAACAA GAAGATTTGA AACAACCTAT TACTCTATCA GAAGAGGATG CAAAGAAGCA GCTTTTTAAA ATCTATGGGG ATAATCTTAC ATTTGTTGAA GAACCAGAAA TTAAACAAGT GGTATATGTA GATGAAAATA CAAATAAAGC TACAAGCGCA TACCAAATTA CTTTTAGTGC ATCTACACCT GAATATGTAT CGGGTACGGT ATTAATTGAT GCTTTTGTTG GGGATCTATT AAAAGAACTC GTTCAAAAAT TGGGTATACA AGTAGACAGC AGTATTGTTC AATCCGCAAC ATCAAATAAA TCACAAGATC CTTCTAAATT AACAGGCACA GGAAAAGATG ACTTAGGTAT GAATCGTACA TTTGGAATTT CACAACGAAG TGATGGAACG TACACTCTTG CAGATTATTC TCGTGGTAAG GGAATTGAAA CGTATACTGC TAATTATAAA GATTATAATA ATTATAGAAG AAATATATGG GGTTATTTGG ATGATTTAGT AACAAGTAAT TCTACAAATT TTACAGATCC TAAAGCAGTC AGTGCACATT ATTTAGCAAC GAAAGTATAT GATTTTTATC AAGAAAAATA TAGCCGAAAC AGCTTTGATA ATAATGGACA AAAAGTAATT TCTGTCGTTC ATGGCTGGAA TACAAATGGT ACGAATAAAG GAAATCCTAA GCAATGGTTT AATGCATTTA GTAATGGGGC TATGCTGGTA TACGGAGATC CAATTGTTAG AGCATTTGAT GTGGCAGGAC ATGAGTTTAC ACATGCGGTT ACGAGAAATG AGTCTGGACT TGAGTACGCA GGGGAAGCTG GTGCAATTAA TGAAGCAATA TCTGATATTT TAGGAGTAGC GGTTGAGAAG TATGCAAATA ACGGGAAATT TAATTGGACA ATGGGAGAAC AATCAGGTCG TATTTTTAGA GATATGAAAA ATCCATCATC TATCTCTTCT AGATATCCAG AAGATTATAG ACATTATAAC AATTTACCTA TTGATGCTGC CCATGATCAT GGTGGTGTAC ACACGAACTC TAGTATTATT AATAAAGTAG CTTATTTGAT TGCTAGTGGT GGAAATCATA ACGGAGTAAA TGTACAAGGC ATTGGAGAAG ATAAAATGTT TGATATTTTC TATTATGCAA ATACGGATGA ATTAAATATG ACTTCTGACT TTAAAGAATT AAAAGAAGCT TGTATTCGTG TAGCAACGAA CTTATATGGT AAGGATTCAT CAGAAGTACA AGCTGTCCAA CAAGCCTTTA AAGCAGCTTA TATTTAA
|
Protein sequence | MKNKKNFVKI GLTTGVMLSV IMPYGDAYAA TEDLKVETKE DTFRTGNLTV PSQKSAENVA KDALKGKTEQ ALSSKQVNTE SKVNYNVTQS RKSYDGTTLV RLQQTYEGRD VYGYQLTAHI NDDGVLTSVS GDSAQDLQQQ EDLKQPITLS EEDAKKQLFK IYGDNLTFVE EPEIKQVVYV DENTNKATSA YQITFSASTP EYVSGTVLID AFVGDLLKEL VQKLGIQVDS SIVQSATSNK SQDPSKLTGT GKDDLGMNRT FGISQRSDGT YTLADYSRGK GIETYTANYK DYNNYRRNIW GYLDDLVTSN STNFTDPKAV SAHYLATKVY DFYQEKYSRN SFDNNGQKVI SVVHGWNTNG TNKGNPKQWF NAFSNGAMLV YGDPIVRAFD VAGHEFTHAV TRNESGLEYA GEAGAINEAI SDILGVAVEK YANNGKFNWT MGEQSGRIFR DMKNPSSISS RYPEDYRHYN NLPIDAAHDH GGVHTNSSII NKVAYLIASG GNHNGVNVQG IGEDKMFDIF YYANTDELNM TSDFKELKEA CIRVATNLYG KDSSEVQAVQ QAFKAAYI
|
| |