Gene BCZK0472 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBCZK0472 
Symbol 
ID3026514 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus cereus E33L 
KingdomBacteria 
Replicon accessionNC_006274 
Strand
Start bp554148 
End bp555278 
Gene Length1131 bp 
Protein Length376 aa 
Translation table11 
GC content35% 
IMG OID637544689 
ProductIS605 family transposase 
Protein accessionYP_082079 
Protein GI52144749 
COG category[L] Replication, recombination and repair 
COG ID[COG0675] Transposase and inactivated derivatives 
TIGRFAM ID[TIGR01766] transposase, IS605 OrfB family, central region 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATATTGG CGAAGAAAGT TAGACTGATT CCAACGCCTG AACAAGAAAA GGTGCTTAGA 
AACCATGCTG GTGCTGCAAG ATTCGCTTAT AACTATTGTA AAAGAATGAG TGATAGATAC
TATAAGCTAT TTGGAAAATC TGTTTCCCAG TTAGCTTTAC AGAAACGATT TACAAAGATC
AAGAAGCGAA AGAGATATGA GTGGTTAAAA TACATTAATG CACAAGTTCC CAAACAGGCT
TCAAAAGATT TTGATACGGC GAGAAAACAT TCGTTCAAAA AGTACAAAAA TGGTTATCAC
ACTTCTTATA AATCCAAAAA AGATGTAATC CAAGGATTTT ATGCCAATTA TGAAAGACTG
GTTATAGGAA AGAAAGTCGT TCATATTCAG TCTATTGGAG AAGTGAAAAC AAGCCAACAA
CTACCAAGAA ATAAAAAACC ATCCAATCCA AGAGTTACCT TTGATGGTCG TCACTGGTGG
ATTAGTGTAG GGTTCCAAGA AGACTTTGAA TCACAAGAAC TAACGAATGA GTCGATTGGT
GTGGATGTTG GTTTAAAAGA ACTTTTTGTA GCTTCTAATG GTATGAAAGA ACGAAATATA
AACAAAGATG CCAAAGTTAA AAAACTTTTG AAAAGGAAAA AGTCAGCACA AAGAGATATG
TCTAGGAGAT TTAAAAAAGG TGTAACAATT CAATCTGCCG GATATGAAAA AGCTAGAGCG
GAGCACCTGC GGTTATCTAG GAAAATTACG AATATCCGAA ATAACCATAT CCATCAAGCA
ACAGCAAAAT TGGTGAAAAC CAAACCAATG AGGATTGTTG TGGAAGACTT ACCTATCTCA
AACCTGTTAA AAAACAAAAA ACTATCGAAA GCATTCTTAT TTCAAAAATT AAACTTCTTC
TTTCAATGTT TATCATACAA GTGCGAGAAA TATGGCATTG CGTATGTAAA AGCTGATAAA
TGGTTCGCCT CAAGCAAGAT TTGTTCATGT TGCGGAGTAA AATACGACCA TTCAGTTCAA
CCAGAAGGAC AATGGAGTTT AAAAATACGT GAGTGGTGTT GTGCTTCATG CAATAGCCAT
CACGATCGAG ATGTAAATGC TGCGATGAAT TTATCAAGAT GGGTAAAATA A
 
Protein sequence
MILAKKVRLI PTPEQEKVLR NHAGAARFAY NYCKRMSDRY YKLFGKSVSQ LALQKRFTKI 
KKRKRYEWLK YINAQVPKQA SKDFDTARKH SFKKYKNGYH TSYKSKKDVI QGFYANYERL
VIGKKVVHIQ SIGEVKTSQQ LPRNKKPSNP RVTFDGRHWW ISVGFQEDFE SQELTNESIG
VDVGLKELFV ASNGMKERNI NKDAKVKKLL KRKKSAQRDM SRRFKKGVTI QSAGYEKARA
EHLRLSRKIT NIRNNHIHQA TAKLVKTKPM RIVVEDLPIS NLLKNKKLSK AFLFQKLNFF
FQCLSYKCEK YGIAYVKADK WFASSKICSC CGVKYDHSVQ PEGQWSLKIR EWCCASCNSH
HDRDVNAAMN LSRWVK