Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BCZK0215 |
Symbol | hppD |
ID | 3024857 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Bacillus cereus E33L |
Kingdom | Bacteria |
Replicon accession | NC_006274 |
Strand | + |
Start bp | 230200 |
End bp | 231318 |
Gene Length | 1119 bp |
Protein Length | 372 aa |
Translation table | 11 |
GC content | 36% |
IMG OID | 637544390 |
Product | 4-hydroxyphenylpyruvate dioxygenase |
Protein accession | YP_081830 |
Protein GI | 52144999 |
COG category | [E] Amino acid transport and metabolism [R] General function prediction only |
COG ID | [COG3185] 4-hydroxyphenylpyruvate dioxygenase and related hemolysins |
TIGRFAM ID | [TIGR01263] 4-hydroxyphenylpyruvate dioxygenase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 38 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAACAAA AATCTATGGA TACGCTAGCT GCACAAATGG AGGACTTTTT TCCAGTACGT GATGTAGATC ATTTGGAATT TTACGTAGGA AATGCAAAGC AATCAAGTTA TTATCTTGCG AGAGCATTCG GATTCAAAAT TGTGGCTTAC TCTGGATTAG AAACTGGTAA TCGTGAAAAA GTATCTTATG TTCTTGTGCA AAAAAATATG CGTTTTGTTG TGTCTGGGGC TTTAAGTAGT GACAATCGTA TTGCAGAGTT TGTAAAGACT CATGGTGATG GCGTGAAAGA TGTGGCATTA CTTGTTGACG ATGTTGATAA AGCATACTCA GAAGCAGTGA AACGTGGTGC CGTCGCAATT GCTCCGCCTG TAGAGTTAAC AGATGAGAAC GGTACATTGA AAAAAGCAGT TATTGGTACG TATGGTGATA CAATTCATAC GCTTGTAGAG CGTAAAAATT ATAAAGGGAC ATTTATGCCA GGATTCCAAA AGGCTGAGTT TGATATTCCA TTTGAAGAGT CAGGTTTAAT TGCTGTAGAC CATGTAGTTG GTAATGTTGA AAAGATGGAA GAGTGGGTTA GTTATTACGA GAACGTTATG GGCTTTAAAC AAATGATTCA TTTTGATGAT GATGATATTA GTACAGAGTA TTCAGCATTA ATGTCGAAGG TTATGACAAA TGGAAGTCGT ATTAAGTTCC CTATTAACGA GCCAGCAGAT GGAAAGAGAA AATCACAAAT TCAAGAATAT CTAGAGTTCT ATAATGGAGC AGGTGTACAG CATCTTGCTT TACTAACAAA TGACATTGTT AAAACAGTAG AAGCGCTACG TGCAAATGGT GTGGAGTTTT TAGATACACC AGATACTTAT TATGATGAGT TAACTGCACG AGTTGGAAAA ATTGATGAGG AAATTGATAA GTTGAAAGAA TTAAAGATTT TAGTAGATCG CGATGATGAA GGATACTTAC TACAAATCTT TACGAAACCA ATTGTAGATC GTCCAACTTT ATTTATTGAA ATCATTCAGC GTAAAGGTTC TCGTGGATTT GGAGAAGGAA ACTTTAAAGC GTTATTCGAA TCAATTGAAA GAGAACAAGA GCGTCGCGGG AATTTATAA
|
Protein sequence | MKQKSMDTLA AQMEDFFPVR DVDHLEFYVG NAKQSSYYLA RAFGFKIVAY SGLETGNREK VSYVLVQKNM RFVVSGALSS DNRIAEFVKT HGDGVKDVAL LVDDVDKAYS EAVKRGAVAI APPVELTDEN GTLKKAVIGT YGDTIHTLVE RKNYKGTFMP GFQKAEFDIP FEESGLIAVD HVVGNVEKME EWVSYYENVM GFKQMIHFDD DDISTEYSAL MSKVMTNGSR IKFPINEPAD GKRKSQIQEY LEFYNGAGVQ HLALLTNDIV KTVEALRANG VEFLDTPDTY YDELTARVGK IDEEIDKLKE LKILVDRDDE GYLLQIFTKP IVDRPTLFIE IIQRKGSRGF GEGNFKALFE SIEREQERRG NL
|
| |