Gene BCZK0217 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBCZK0217 
Symbol 
ID3022540 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus cereus E33L 
KingdomBacteria 
Replicon accessionNC_006274 
Strand
Start bp232307 
End bp233479 
Gene Length1173 bp 
Protein Length390 aa 
Translation table11 
GC content39% 
IMG OID637544392 
Producthomogentisate 1,2-dioxygenase 
Protein accessionYP_081832 
Protein GI52144997 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG3508] Homogentisate 1,2-dioxygenase 
TIGRFAM ID[TIGR01015] homogentisate 1,2-dioxygenase 


Plasmid Coverage information

Num covering plasmid clones37 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTTTATC GTCACATGGG GGAGCTACCT CATAAACGAC ATGTACAATT CCGTAAAAAA 
GATGGATCAC TTTATCGTGA ACAGGTAATG GGAACAAAAG GTTTTTCTGG TACGCAATCT
ATTTTGTATC ATCATTATAT GCCAACGGAA GTAGGGCATG CGGCATTATC GCATTCTTGT
CAGTTGCAGT ATGAAGAAGA TGTTGCTCTT TCTCATCGTC ACTTTCGCAC GAAAGAGAAT
AAAAAAAGTG GTGATGCAGT AAGTGGCAGA AACTTTATAC TTGGAAATGA GGATTTATTA
ATCGGAGTAG TGACTCCGAC AGAAAAAATG GATTATTTCT ACCGTAATGG TGATGGTGAT
GAAATGTTGT TTGTCCATTA CGGAACAGGA AAAATTGAAA CAATGTTCGG AACGATTCAC
TATCGAAAAG GTGATTATGT AATAATCCCA ATTGGAACGA TTTATCGTGT TATTCCAGAT
GAAGGAGAGA CTAAGTTTCT TGTTGTAGAG GCAAATAGTC AAATTACAAC ACCGCGTCGC
TACCGTAATG AATATGGACA ATTGTTAGAG CATAGCCCGT TTTGTGAGAG AGATATTCGT
GGCCCGGAAA AATTAGAGAC ATATGATGAA AAAGGTGAGT TTGTCGTAAT GACAAAGTCG
CGAGGATATA TGCATAAACA TGTTTTAGGA CACCATCCGT TAGATGTAGT TGGATGGGAT
GGTTATTTAT ATCCTTGGGT CTTTAATGTA GAGGATTTTG AACCAATTAC AGGTCGTATT
CATCAGCCAC CTCCAGTACA TCAAACGTTC GAGGGTCACA ATTTTGTTAT TTGTTCTTTC
GTACCACGTT TATATGACTA TCATCCAGAA TCTATTCCGG CACCGTATTA TCATAGTAAC
GTGAATAGTG ATGAAGTACT GTACTATGTA GAAGGTAACT TTATGAGCCG AAAAGGGGTG
GAGGAAGGGT CTATTACACT TCATCCGAGC GGCATTCCTC ATGGGCCACA TCCTGGGAAA
ACAGAGGCGA GTATAGGGAA AAAAGAAACG CTTGAATTAG CTGTTATGAT AGATACATTC
CGTCCGCTTC GTATTGTAAA ACAAGCACAT GAAACAGAAG ATGAAAAATA TATGTATAGC
TGGATTGAAG AGGGATCGTA TACTGTGAAA TAA
 
Protein sequence
MFYRHMGELP HKRHVQFRKK DGSLYREQVM GTKGFSGTQS ILYHHYMPTE VGHAALSHSC 
QLQYEEDVAL SHRHFRTKEN KKSGDAVSGR NFILGNEDLL IGVVTPTEKM DYFYRNGDGD
EMLFVHYGTG KIETMFGTIH YRKGDYVIIP IGTIYRVIPD EGETKFLVVE ANSQITTPRR
YRNEYGQLLE HSPFCERDIR GPEKLETYDE KGEFVVMTKS RGYMHKHVLG HHPLDVVGWD
GYLYPWVFNV EDFEPITGRI HQPPPVHQTF EGHNFVICSF VPRLYDYHPE SIPAPYYHSN
VNSDEVLYYV EGNFMSRKGV EEGSITLHPS GIPHGPHPGK TEASIGKKET LELAVMIDTF
RPLRIVKQAH ETEDEKYMYS WIEEGSYTVK