Gene pE33L466_0380 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagpE33L466_0380 
Symbolnpr 
ID3399873 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus cereus E33L 
KingdomBacteria 
Replicon accessionNC_007103 
Strand
Start bp385514 
End bp387220 
Gene Length1707 bp 
Protein Length568 aa 
Translation table11 
GC content34% 
IMG OID637660197 
Productneutral protease 
Protein accessionYP_245861 
Protein GI67078241 
COG category[E] Amino acid transport and metabolism 
COG ID[COG3227] Zinc metalloprotease (elastase) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000306257 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGAATA AGAAGAATTT CGTGAAGATA GGATTAACTA CAGGAGTAAT GTTATCGGTG 
ATTATGCCCT ATGGAGATGC ATATGCGGCA ACAGAAGATT TAAAAGTGGA AACAAAGGAA
GATACGTTCC GAACAGGTAA TTTAACAGTA CCTTCTCAAA AATCGGCAGA AAATGTAGCA
AAAGATGCGC TAAAAGGAAA AACAGAACAA GCATTATCAT CAAAGCAAGT TAATACGGAA
TCAAAAGTAA ATTATAATGT TACGCAAAGT CGTAAATCTT ATGATGGTAC TACATTGGTA
CGTCTTCAAC AAACATATGA AGGACGCGAT GTATACGGAT ATCAACTAAC AGCACATATT
AATGATGATG GTGTACTTAC GAGTGTTTCG GGGGATAGTG CCCAAGATCT ACAACAACAA
GAAGATTTGA AACAACCTAT TACTCTATCA GAAGAGGATG CAAAGAAGCA GCTTTTTAAA
ATCTATGGGG ATAATCTTAC ATTTGTTGAA GAACCAGAAA TTAAACAAGT GGTATATGTA
GATGAAAATA CAAATAAAGC TACAAGCGCA TACCAAATTA CTTTTAGTGC ATCTACACCT
GAATATGTAT CGGGTACGGT ATTAATTGAT GCTTTTGTTG GGGATCTATT AAAAGAACTC
GTTCAAAAAT TGGGTATACA AGTAGACAGC AGTATTGTTC AATCCGCAAC ATCAAATAAA
TCACAAGATC CTTCTAAATT AACAGGCACA GGAAAAGATG ACTTAGGTAT GAATCGTACA
TTTGGAATTT CACAACGAAG TGATGGAACG TACACTCTTG CAGATTATTC TCGTGGTAAG
GGAATTGAAA CGTATACTGC TAATTATAAA GATTATAATA ATTATAGAAG AAATATATGG
GGTTATTTGG ATGATTTAGT AACAAGTAAT TCTACAAATT TTACAGATCC TAAAGCAGTC
AGTGCACATT ATTTAGCAAC GAAAGTATAT GATTTTTATC AAGAAAAATA TAGCCGAAAC
AGCTTTGATA ATAATGGACA AAAAGTAATT TCTGTCGTTC ATGGCTGGAA TACAAATGGT
ACGAATAAAG GAAATCCTAA GCAATGGTTT AATGCATTTA GTAATGGGGC TATGCTGGTA
TACGGAGATC CAATTGTTAG AGCATTTGAT GTGGCAGGAC ATGAGTTTAC ACATGCGGTT
ACGAGAAATG AGTCTGGACT TGAGTACGCA GGGGAAGCTG GTGCAATTAA TGAAGCAATA
TCTGATATTT TAGGAGTAGC GGTTGAGAAG TATGCAAATA ACGGGAAATT TAATTGGACA
ATGGGAGAAC AATCAGGTCG TATTTTTAGA GATATGAAAA ATCCATCATC TATCTCTTCT
AGATATCCAG AAGATTATAG ACATTATAAC AATTTACCTA TTGATGCTGC CCATGATCAT
GGTGGTGTAC ACACGAACTC TAGTATTATT AATAAAGTAG CTTATTTGAT TGCTAGTGGT
GGAAATCATA ACGGAGTAAA TGTACAAGGC ATTGGAGAAG ATAAAATGTT TGATATTTTC
TATTATGCAA ATACGGATGA ATTAAATATG ACTTCTGACT TTAAAGAATT AAAAGAAGCT
TGTATTCGTG TAGCAACGAA CTTATATGGT AAGGATTCAT CAGAAGTACA AGCTGTCCAA
CAAGCCTTTA AAGCAGCTTA TATTTAA
 
Protein sequence
MKNKKNFVKI GLTTGVMLSV IMPYGDAYAA TEDLKVETKE DTFRTGNLTV PSQKSAENVA 
KDALKGKTEQ ALSSKQVNTE SKVNYNVTQS RKSYDGTTLV RLQQTYEGRD VYGYQLTAHI
NDDGVLTSVS GDSAQDLQQQ EDLKQPITLS EEDAKKQLFK IYGDNLTFVE EPEIKQVVYV
DENTNKATSA YQITFSASTP EYVSGTVLID AFVGDLLKEL VQKLGIQVDS SIVQSATSNK
SQDPSKLTGT GKDDLGMNRT FGISQRSDGT YTLADYSRGK GIETYTANYK DYNNYRRNIW
GYLDDLVTSN STNFTDPKAV SAHYLATKVY DFYQEKYSRN SFDNNGQKVI SVVHGWNTNG
TNKGNPKQWF NAFSNGAMLV YGDPIVRAFD VAGHEFTHAV TRNESGLEYA GEAGAINEAI
SDILGVAVEK YANNGKFNWT MGEQSGRIFR DMKNPSSISS RYPEDYRHYN NLPIDAAHDH
GGVHTNSSII NKVAYLIASG GNHNGVNVQG IGEDKMFDIF YYANTDELNM TSDFKELKEA
CIRVATNLYG KDSSEVQAVQ QAFKAAYI