Gene BCG9842_B3878 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBCG9842_B3878 
Symbol 
ID7182619 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus cereus G9842 
KingdomBacteria 
Replicon accessionNC_011772 
Strand
Start bp1349536 
End bp1350552 
Gene Length1017 bp 
Protein Length338 aa 
Translation table11 
GC content35% 
IMG OID643549185 
Producthistidinol-phosphatase 
Protein accessionYP_002444855 
Protein GI218896444 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG1387] Histidinol phosphatase and related hydrolases of the PHP family 
TIGRFAM ID[TIGR01856] histidinol phosphate phosphatase HisJ family 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones59 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAGTAG ATTATCATCT TCACTTAGAA GAAGGACCGT ATTCAATAAG ATGGCTTGCT 
AAAATAAATG AAGCGTTGGA ATGTTATGAA CCGCTTCAAG AAAGGCATTC TATTGATTGG
CTTATGAAAA CACAAGAGCG TTTGCAAAAG CGTGTGAAGG AAGGGCCATT TACAAAAGAA
TGGATGGATC TCTATTTAGA AGAAGCTGTG CGAAAAGGAA TAAAAGAAGT GGGAATTGTT
GATCATCTAT ATCGTTTTCA TGAAGCGAAA GGATATTATG AAAAATATGT AGATATTAGT
GATTCTAAGC TTGGTCGTAT ACAGAAGGAA TGGTTAGATC AAGTAAGGGT AGTTTCACTG
TATGATTTTA CAAAGGCAAT TGAAGAAGCG AAAGAACGAT GGAGTAAAAG AGGTGTCACA
CTTAAACTTG GAATTGAAGC GGATTATTTT CTTGATTGTG AAGGAGAGTT AAAAGAATTA
TTAGCGCTAG GAGACTTTGA TTATGTAATT GGTTCCGTTC ATTTTCTAAA TGGCTGGGGA
TTTGATAATC CGGATACGAA AGAATATTTT GAGGTGCATG ACCTACGCAC ATTATACGAT
ACGTTTTTTA AAACAGTTGA GAGTGCGATT CATACAGAGT TATTTGATAT TATCGCTCAT
CTTGATAATA TAAAAGTATT TAATTATCGA TTGGATGAGA ATGAACAGCT TTCTTATTAT
AAGGAAATTG CCTGTGCGTT AGTAGAAACG AATACGGCAA CAGAAATAAA TGCAGGACTG
TACTATCGTT ACCCTGTTCG TGAGATGTGC CCAAGTCCAC TATATTTACA AGTATTAGCT
AAGTATGGTG TTCCAATTAC GATTTCTTCG GATGCCCATT ATCCAAATGA TTTAGGGAAT
TATGTACAAG AAAATGTACA AACATTACGA GCTCACGGTG TTACTCAGGT CGCAACATTT
ACGAAGCGAG CAAGAGTAAT GAGGTTGCTT GAAGAAGAAG TAACAAATTT AAAATGA
 
Protein sequence
MKVDYHLHLE EGPYSIRWLA KINEALECYE PLQERHSIDW LMKTQERLQK RVKEGPFTKE 
WMDLYLEEAV RKGIKEVGIV DHLYRFHEAK GYYEKYVDIS DSKLGRIQKE WLDQVRVVSL
YDFTKAIEEA KERWSKRGVT LKLGIEADYF LDCEGELKEL LALGDFDYVI GSVHFLNGWG
FDNPDTKEYF EVHDLRTLYD TFFKTVESAI HTELFDIIAH LDNIKVFNYR LDENEQLSYY
KEIACALVET NTATEINAGL YYRYPVREMC PSPLYLQVLA KYGVPITISS DAHYPNDLGN
YVQENVQTLR AHGVTQVATF TKRARVMRLL EEEVTNLK