Gene BCG9842_B5070 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBCG9842_B5070 
SymbolhppD 
ID7181799 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus cereus G9842 
KingdomBacteria 
Replicon accessionNC_011772 
Strand
Start bp215923 
End bp217041 
Gene Length1119 bp 
Protein Length372 aa 
Translation table11 
GC content37% 
IMG OID643548019 
Product4-hydroxyphenylpyruvate dioxygenase 
Protein accessionYP_002443763 
Protein GI218895352 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG3185] 4-hydroxyphenylpyruvate dioxygenase and related hemolysins 
TIGRFAM ID[TIGR01263] 4-hydroxyphenylpyruvate dioxygenase 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.160788 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones68 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACAAA AATCTATGGA TACGCTAGCT GCACAAATGG AGGACTTTTT TCCAGTACGT 
GATGTAGATC ATTTGGAATT TTACGTAGGG AATGCAAAGC AATCGAGTTA TTATCTTGCG
AGAGCATTCG GATTCAAAAT TGTGGCTTAC TCTGGATTAG AAACTGGAAA CCGTGAAAAG
GTATCTTATG TTCTTGTGCA AAAAAATATG CGTTTCGTTG TGTCTGGAGC TTTAAGTAGT
GAAAATCGTA TTGCAGAGTT TGTAAAGACT CATGGTGATG GCGTGAAGGA TGTGGCACTA
CTTGTTGATG ATGTTGATAA AGCATACTCA GAAGCAGTGA AACGTGGTGC CGTCGCAATT
GCTCCACCAG AGGAATTAAC AGATGAGGAC GGTACATTGA AAAAAGCAGT TATTGGTACG
TATGGTGATA CAATTCATAC GCTTGTAGAG CGTAAAAATT ATAAAGGGGC ATTTATGCCA
GGATTCCAAA AGGTAGAGTT TAATATTCCA TTTGAAGAGT CTGGTTTAAT TGCTGTCGAT
CATGTAGTTG GTAATGTTGA AAAAATGGAA GAGTGGGTTA GTTATTACGA GAATGTCATG
GGCTTTAAAC AAATGATTCA TTTTGATGAT GACGATATTA GTACAGAGTA TTCGGCGTTA
ATGTCGAAAG TTATGACGAA TGGAAGCCGT ATTAAGTTTC CGATTAACGA ACCAGCAGAC
GGAAAGAGAA AGTCACAAAT TCAAGAGTAT CTAGAATTCT ATAATGGAGC TGGTGTACAG
CATCTTGCTT TATTAACAAG TGATATTGTT AAAACAGTTG AAGCGCTTCG TGCAAATGGG
GTGGAGTTTT TAGATACACC TGATACTTAT TATGATGAGT TAACTGCACG AGTTGGAAAA
ATCGATGAAG AAATTGATAA GCTAAAAGAA TTAAAGATCT TAGTAGATCG TGATGATGAA
GGTTACTTAC TACAAATCTT TACGAAACCA ATTGTAGATC GCCCGACTTT ATTTATTGAA
ATCATTCAAC GTAAAGGTTC TCGTGGATTT GGTGAAGGAA ACTTTAAAGC GTTATTCGAA
TCAATTGAAA GAGAACAAGA GCGTCGCGGA AACTTATAA
 
Protein sequence
MKQKSMDTLA AQMEDFFPVR DVDHLEFYVG NAKQSSYYLA RAFGFKIVAY SGLETGNREK 
VSYVLVQKNM RFVVSGALSS ENRIAEFVKT HGDGVKDVAL LVDDVDKAYS EAVKRGAVAI
APPEELTDED GTLKKAVIGT YGDTIHTLVE RKNYKGAFMP GFQKVEFNIP FEESGLIAVD
HVVGNVEKME EWVSYYENVM GFKQMIHFDD DDISTEYSAL MSKVMTNGSR IKFPINEPAD
GKRKSQIQEY LEFYNGAGVQ HLALLTSDIV KTVEALRANG VEFLDTPDTY YDELTARVGK
IDEEIDKLKE LKILVDRDDE GYLLQIFTKP IVDRPTLFIE IIQRKGSRGF GEGNFKALFE
SIEREQERRG NL