Gene BCZK3635 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBCZK3635 
SymbolpepQ 
ID3026891 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus cereus E33L 
KingdomBacteria 
Replicon accessionNC_006274 
Strand
Start bp3775357 
End bp3776427 
Gene Length1071 bp 
Protein Length356 aa 
Translation table11 
GC content35% 
IMG OID637547851 
Productproline dipeptidase 
Protein accessionYP_085217 
Protein GI52141612 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0006] Xaa-Pro aminopeptidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACTTTAA AAATTAATAA AATCCAAAAT CAACTACAGA ACTATGAAAT TGACGGGTTA 
CTCATTACAA AAAAAGAAAA TCGCCAATAT GCGACAGGCT TTACAGGTAG TGCTGGTGTT
GTCTTAATCT CTGCGGATGC AGCTGTTTTT ATAACTGATT TCCGCTATGT AGACCAAGCG
AATTCACAAA TAAAAAATGC TGAAATTATT ATGCATAAAG GAAATTTAGA AAAAGAAATT
GCAAATCAAG TATCGAAATT AAACATTCAA AAACTTGGAA TTGAAGAAAA TAATATGACA
TTGCAACAAT ATAAAAACTT ACAAAAATAT GTACATACGG AAATGGTTCA AGTGTGTGAA
ATCATTGAAA ACATTCGTCT TATTAAAGAC ACTCATGAAA TAGAAACAAT GAAAATCGCA
GCTAATATTG CGGACGAAGC ATTTCACCAC ATCATTACGT TTCTAAAACC AGGAATAAGT
GAAAATGATG TACGAGATGA GTTAGAATTT TTCATGCGAA AAAAAGGGGC TACGTCCTCT
TCATTCCAAA TCATTGTAGC TTCTGGCGTT CGTTCTTCAC TTCCTCATGG AGTTGCATCA
AATAAAATAA TTGAACGAGG CGACATCGTT ACATTAGATT TCGGTGCACT TTACGACGGA
TATTGTTCCG ATATAACACG TACTGTAGCA ATCGGGGAAC CACCAGAAGA GTTCAAAAAA
ATATACAGTG TTGTACGCGA AGCATTAAAA CGCGGGACTG AAGCAATTAA GCCTGGAGAA
ACTGCGAAAC GTATCGATGA TATAACGAGA AACTATATTA TAGAACATGG ATACGGTCAA
TATTTTGGAC ATTCTACTGG TCATGGTCTT GGATTAGAAA TTCATGAACC ACTTCGCCTA
TCCCAAGAAA GTAAGGCTAT TTTAGAAGAA GGTATGGTCG TTACCATTGA ACCAGGTATT
TACATACCAA ACTGGGGCGG TTGTAGAATT GAAGATGATA TCGTCATTAC AGAAGATGGA
TATGAAGTTA TTACAAAATC AAATAGAGAT CTAATTATAA TCCCTTGTTA A
 
Protein sequence
MTLKINKIQN QLQNYEIDGL LITKKENRQY ATGFTGSAGV VLISADAAVF ITDFRYVDQA 
NSQIKNAEII MHKGNLEKEI ANQVSKLNIQ KLGIEENNMT LQQYKNLQKY VHTEMVQVCE
IIENIRLIKD THEIETMKIA ANIADEAFHH IITFLKPGIS ENDVRDELEF FMRKKGATSS
SFQIIVASGV RSSLPHGVAS NKIIERGDIV TLDFGALYDG YCSDITRTVA IGEPPEEFKK
IYSVVREALK RGTEAIKPGE TAKRIDDITR NYIIEHGYGQ YFGHSTGHGL GLEIHEPLRL
SQESKAILEE GMVVTIEPGI YIPNWGGCRI EDDIVITEDG YEVITKSNRD LIIIPC