Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BCG9842_B5070 |
Symbol | hppD |
ID | 7181799 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Bacillus cereus G9842 |
Kingdom | Bacteria |
Replicon accession | NC_011772 |
Strand | + |
Start bp | 215923 |
End bp | 217041 |
Gene Length | 1119 bp |
Protein Length | 372 aa |
Translation table | 11 |
GC content | 37% |
IMG OID | 643548019 |
Product | 4-hydroxyphenylpyruvate dioxygenase |
Protein accession | YP_002443763 |
Protein GI | 218895352 |
COG category | [E] Amino acid transport and metabolism [R] General function prediction only |
COG ID | [COG3185] 4-hydroxyphenylpyruvate dioxygenase and related hemolysins |
TIGRFAM ID | [TIGR01263] 4-hydroxyphenylpyruvate dioxygenase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.160788 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 68 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAACAAA AATCTATGGA TACGCTAGCT GCACAAATGG AGGACTTTTT TCCAGTACGT GATGTAGATC ATTTGGAATT TTACGTAGGG AATGCAAAGC AATCGAGTTA TTATCTTGCG AGAGCATTCG GATTCAAAAT TGTGGCTTAC TCTGGATTAG AAACTGGAAA CCGTGAAAAG GTATCTTATG TTCTTGTGCA AAAAAATATG CGTTTCGTTG TGTCTGGAGC TTTAAGTAGT GAAAATCGTA TTGCAGAGTT TGTAAAGACT CATGGTGATG GCGTGAAGGA TGTGGCACTA CTTGTTGATG ATGTTGATAA AGCATACTCA GAAGCAGTGA AACGTGGTGC CGTCGCAATT GCTCCACCAG AGGAATTAAC AGATGAGGAC GGTACATTGA AAAAAGCAGT TATTGGTACG TATGGTGATA CAATTCATAC GCTTGTAGAG CGTAAAAATT ATAAAGGGGC ATTTATGCCA GGATTCCAAA AGGTAGAGTT TAATATTCCA TTTGAAGAGT CTGGTTTAAT TGCTGTCGAT CATGTAGTTG GTAATGTTGA AAAAATGGAA GAGTGGGTTA GTTATTACGA GAATGTCATG GGCTTTAAAC AAATGATTCA TTTTGATGAT GACGATATTA GTACAGAGTA TTCGGCGTTA ATGTCGAAAG TTATGACGAA TGGAAGCCGT ATTAAGTTTC CGATTAACGA ACCAGCAGAC GGAAAGAGAA AGTCACAAAT TCAAGAGTAT CTAGAATTCT ATAATGGAGC TGGTGTACAG CATCTTGCTT TATTAACAAG TGATATTGTT AAAACAGTTG AAGCGCTTCG TGCAAATGGG GTGGAGTTTT TAGATACACC TGATACTTAT TATGATGAGT TAACTGCACG AGTTGGAAAA ATCGATGAAG AAATTGATAA GCTAAAAGAA TTAAAGATCT TAGTAGATCG TGATGATGAA GGTTACTTAC TACAAATCTT TACGAAACCA ATTGTAGATC GCCCGACTTT ATTTATTGAA ATCATTCAAC GTAAAGGTTC TCGTGGATTT GGTGAAGGAA ACTTTAAAGC GTTATTCGAA TCAATTGAAA GAGAACAAGA GCGTCGCGGA AACTTATAA
|
Protein sequence | MKQKSMDTLA AQMEDFFPVR DVDHLEFYVG NAKQSSYYLA RAFGFKIVAY SGLETGNREK VSYVLVQKNM RFVVSGALSS ENRIAEFVKT HGDGVKDVAL LVDDVDKAYS EAVKRGAVAI APPEELTDED GTLKKAVIGT YGDTIHTLVE RKNYKGAFMP GFQKVEFNIP FEESGLIAVD HVVGNVEKME EWVSYYENVM GFKQMIHFDD DDISTEYSAL MSKVMTNGSR IKFPINEPAD GKRKSQIQEY LEFYNGAGVQ HLALLTSDIV KTVEALRANG VEFLDTPDTY YDELTARVGK IDEEIDKLKE LKILVDRDDE GYLLQIFTKP IVDRPTLFIE IIQRKGSRGF GEGNFKALFE SIEREQERRG NL
|
| |