Gene BCAH820_0253 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBCAH820_0253 
SymbolhppD 
ID7187999 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus cereus AH820 
KingdomBacteria 
Replicon accessionNC_011773 
Strand
Start bp227486 
End bp228604 
Gene Length1119 bp 
Protein Length372 aa 
Translation table11 
GC content36% 
IMG OID643553659 
Product4-hydroxyphenylpyruvate dioxygenase 
Protein accessionYP_002449268 
Protein GI218901434 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG3185] 4-hydroxyphenylpyruvate dioxygenase and related hemolysins 
TIGRFAM ID[TIGR01263] 4-hydroxyphenylpyruvate dioxygenase 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones233 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACAAA AATCTATGGA TACGCTAGCT GCACAAATGG AGGACTTTTT TCCAGTACGT 
GATGTAGATC ATTTGGAATT TTACGTAGGA AATGCAAAGC AATCAAGTTA TTATCTTGCG
AGAGCATTCG GATTCAAAAT TGTGGCTTAC TCTGGATTAG AAACTGGTAA TCGTGAAAAA
GTATCTTATG TTCTTGTGCA AAAAAACATG CGTTTTGTTG TGTCTGGGGC TTTAAGTAGT
GACAATCGTA TTGCAGAGTT TGTAAAGACT CATGGTGATG GCGTGAAAGA TGTAGCATTA
CTTGTTGACG ATGTTGATAA AGCATACTCA GAAGCAGTGA AACGTGGTGC CGTCGCAATT
GCTCCGCCTG TAGAGTTAAC AGATGAGAAC GGTACATTGA AAAAAGCAGT TATTGGTACG
TATGGTGATA CAATTCATAC GCTTGTAGAG CGTAAAAATT ATAAAGGGAC ATTTATGCCA
GGATTCCAAA AGGCTGAGTT TGATATTCCA TTTGAAGAGT CAGGTTTAAT TGCTGTAGAC
CATGTAGTTG GTAATGTTGA AAAGATGGAA GAGTGGGTTA GTTATTACGA GAACGTTATG
GGCTTTAAAC AAATGATCCA TTTTGATGAT GATGATATTA GTACAGAGTA TTCAGCATTA
ATGTCGAAGG TTATGACAAA TGGAAGTCGT ATTAAGTTCC CTATTAACGA GCCAGCAGAT
GGAAAGAGAA AATCACAAAT TCAAGAATAT CTAGAGTTCT ATAATGGAGC AGGTGTACAG
CATCTTGCTT TACTAACAAA TGACATTGTT AAAACAGTAG AAGCGCTACG TGCAAATGGT
GTGGAGTTTT TAGATACACC AGATACTTAT TATGATGAGT TAACTGCACG AGTTGGAAAA
ATTGATGAGG AAATTGATAA GTTGAAAGAA TTAAAGATTT TAGTAGATCG CGATGATGAA
GGATACTTAC TACAAATCTT TACGAAACCA ATTGTAGATC GTCCAACTTT ATTTATTGAA
ATCATTCAGC GTAAAGGTTC TCGTGGATTT GGAGAAGGAA ACTTTAAAGC GTTATTCGAA
TCAATTGAAA GAGAACAAGA GCGTCGCGGG AATTTATAA
 
Protein sequence
MKQKSMDTLA AQMEDFFPVR DVDHLEFYVG NAKQSSYYLA RAFGFKIVAY SGLETGNREK 
VSYVLVQKNM RFVVSGALSS DNRIAEFVKT HGDGVKDVAL LVDDVDKAYS EAVKRGAVAI
APPVELTDEN GTLKKAVIGT YGDTIHTLVE RKNYKGTFMP GFQKAEFDIP FEESGLIAVD
HVVGNVEKME EWVSYYENVM GFKQMIHFDD DDISTEYSAL MSKVMTNGSR IKFPINEPAD
GKRKSQIQEY LEFYNGAGVQ HLALLTNDIV KTVEALRANG VEFLDTPDTY YDELTARVGK
IDEEIDKLKE LKILVDRDDE GYLLQIFTKP IVDRPTLFIE IIQRKGSRGF GEGNFKALFE
SIEREQERRG NL