Gene BCZK1401 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBCZK1401 
Symbol 
ID3023224 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus cereus E33L 
KingdomBacteria 
Replicon accessionNC_006274 
Strand
Start bp1490505 
End bp1491767 
Gene Length1263 bp 
Protein Length420 aa 
Translation table11 
GC content35% 
IMG OID637545633 
ProductTPR repeat-containing protein 
Protein accessionYP_082999 
Protein GI52143829 
COG category[R] General function prediction only 
COG ID[COG4783] Putative Zn-dependent protease, contains TPR repeats 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.0553383 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAAAAGT TTGAACAAGC TGTTTCATAT ATTGAAAATG GTGAAGCGGA AAAAGGATTA 
CAATTATTAA AAGAGCAATT AAAAATTGCG AATGATGAAG AGAAGTATGA TATTGCTCGC
TATTATCATA CTCTTGGATT TACGGATGAA GCGTTAGCTA TTACAGAAGA TTTGCGTTTA
TTGTATCCAG AAGAAAGTGA ATTCACTGTA TTTTTAGCAG AATTATATAT TGATCTAGAC
AAAGAAGATG AAGCGATTGA AGTGCTTCAT GATATTCCAG AAAATGATGA TTTATATGTT
CAATCGTTAT TACTAGTTGC GGATTTATTC CAAATGCAAG GTTTTGATGA TGTAGCAGAA
CAAAAACTAT TAAAGGCGAA AGAAATGATG CCTGACGAAC CTGTCATTAC GTTTGGATTA
GCAGAGTTAT ATAGTAGTAA AGGTGAAGAA CAAAAGGCAA TCACTTATTA TGAGTCGCTA
TTAGCGGAAC ATAAAGTAAT GGGTGGTGTT GTCATTGCAC TACGCCTTGG AGAAACGTTA
AGTGCGATTG GAAATTGGGA AGAGGCGATT TCTTACTACG AAGCAGGTTT AGAAGAACAA
AAAGATATCC ACTCATTGTT TGGATATGCC TTCACATTAT ACCAAGGTGA AGAATACCAA
AGAGCAATTG GTGCTTGGCA AGAACTAAAA GAATTAGATC CTGAGTATGC ATCCCTTTAC
ATGTATTTAG CGAAAAGCTA TGAAAAAGAA GGAATGCTTC AAGAAAGCTA TGAAACACTT
CATGAAGGAA TTAAAGTAGA TGAACTTTCT GTACCATTTT ATGTAGAATT AGCGAACATT
GCAGCGAAAT TAGGGAAAAT AGCGGAAGCA GAGGAAGTGC TTCAAAAAGC GCTTGAGTTA
GATCCAGGAC ATTTAGGTGC AACATTAAAA TATGCATATA TCTTAAAGGA ACAAGAAAAG
TATGAAGAGC TAATTGCCGT TGTAGAGCGT GCTATCGATA GTGGAGAACC AGATACACAA
CTACTTTGGG ATCTTGCGTT TGCAAAAAAA CAATTAGAAA TGTATTCTGA TGCATTAAAA
CACTATGAAA GTGCATATAC TTCTTTTAAG AATCATCCAG ACTTCTTGGA AGAGTACGGT
TATTTCTTAT TGGAAGAAGG TATGCGAAAA GAGGCGAAAG AAGTATTTAC TCAGTTAATA
CAACTAGACC CGACACAAAT TCATATTGAA GAATTGTTAT ATAATTTAGA GGATTTTTCA
TAA
 
Protein sequence
MQKFEQAVSY IENGEAEKGL QLLKEQLKIA NDEEKYDIAR YYHTLGFTDE ALAITEDLRL 
LYPEESEFTV FLAELYIDLD KEDEAIEVLH DIPENDDLYV QSLLLVADLF QMQGFDDVAE
QKLLKAKEMM PDEPVITFGL AELYSSKGEE QKAITYYESL LAEHKVMGGV VIALRLGETL
SAIGNWEEAI SYYEAGLEEQ KDIHSLFGYA FTLYQGEEYQ RAIGAWQELK ELDPEYASLY
MYLAKSYEKE GMLQESYETL HEGIKVDELS VPFYVELANI AAKLGKIAEA EEVLQKALEL
DPGHLGATLK YAYILKEQEK YEELIAVVER AIDSGEPDTQ LLWDLAFAKK QLEMYSDALK
HYESAYTSFK NHPDFLEEYG YFLLEEGMRK EAKEVFTQLI QLDPTQIHIE ELLYNLEDFS