Gene BCG9842_B1557 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBCG9842_B1557 
SymbolhutU 
ID7184384 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus cereus G9842 
KingdomBacteria 
Replicon accessionNC_011772 
Strand
Start bp3587471 
End bp3589129 
Gene Length1659 bp 
Protein Length552 aa 
Translation table11 
GC content41% 
IMG OID643551484 
Producturocanate hydratase 
Protein accessionYP_002447154 
Protein GI218898743 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2987] Urocanate hydratase 
TIGRFAM ID[TIGR01228] urocanate hydratase 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.0743786 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones41 
Fosmid unclonability p-value0.0292483 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAAAAAG TACAACAAAC AATTCGCGCG CCAAGAGGTA CAGAGTTACA AACGAAAGGG 
TGGGTGCAAG AAGCTGCACT TCGTATGTTA ATGAACAATT TAGATCCTGA AGTTGCTGAA
AAACCAGAAG AATTAGTTGT ATATGGCGGA ATTGGCCGTG CAGCTCGTAA CTGGGAAAGC
TACAATGCAA TTGTAGATTC ATTAAAAACG TTAGAAAGCG ATGAAACGTT ACTTGTTCAA
TCAGGAAAAC CAGTTGCCAT TTTTAAATCA CATGAAGATG CACCGCGCGT TCTGTTAGCG
AACTCAAACT TAGTACCAAA ATGGGCGAAT TGGGATCACT TCCGAGAACT AGAGAAAAAA
GGACTTATGA TGTACGGACA AATGACAGCT GGTAGCTGGA TTTACATTGG AACACAAGGG
ATTCTACAAG GAACATATGA AACATTTGGT GAGGCAGCTC GTCAACATTT CGATGGTTCA
TTAAAAGGTA CATTAACACT TACTGCTGGT TTAGGTGGTA TGGGTGGTGC ACAACCTCTT
GCTGTAACGA TGAATGGCGG TGTTGTCATT GCTATTGATG TAGATAAGCG CAGCATCGAT
CGTCGTATTG AAAAGAGATA TTGTGATAAG TATACAGAAT CATTAGAAGA AGCATTGGCT
ATTGCAAACG AGTATAAAGA GAAGAAAGAG CCTATTTCAA TTGGATTATT AGGTAATGCA
GCAGAAATTT TACCTGAGTT AGTAAATCGT AATATTATCC CTGACTTAGT TACGGACCAA
ACATCTGCTC ATGATCCATT AAACGGTTAT ATTCCAGTAG GTTATACGTT AGAAGAGGCA
GCGAAACTTC GTGAAGAAGA CCCAGAACGT TACGTACAAT TATCAAAAGA AAGTATGACA
AAGCACGTAG AAGCAATGCT TGCGATGCAA GAAAAAGGCG CAATTACATT TGATTATGGA
AATAACATTC GCCAAGTTGC TTTCGATGAA GGTTTGAAAA ATGCTTTCGA TTTCCCAGGA
TTTGTTCCAG CATTTATTCG TCCATTATTC TGTGAAGGAA AAGGACCATT CCGCTGGGTA
GCTCTTTCTG GTGATCCAGA AGATATTTAT AAAACAGATG AAGTAATTTT ACGAGAATTC
GCTGACAATG AGCATTTATG TAACTGGATT CGTATGGCGC GTCAACAAGT GGAGTTCCAA
GGGCTTCCAT CACGTATTTG TTGGCTAGGT TACGGTGAGC GTGCGAAGTT TGGCCGCATC
ATTAATGAAA TGGTGGCAAA TGGTGAATTA TCAGCACCGA TCGTTATTGG TCGTGACCAT
TTAGATTGCG GTTCAGTAGC ATCTCCAAAC CGTGAAACAG AAGCGATGAA AGACGGTAGT
GATGCAGTAG CAGACTGGCC AATTTTAAAT GCATTAATTA ATAGTGTAAA CGGTGCGAGT
TGGGTATCTG TTCACCACGG TGGCGGCGTT GGTATGGGTT ATTCACTTCA CGCTGGAATG
GTTATTGTTG CAGATGGAAC AGAAGCAGCA GCAAAACGTA TTGAGCGCGT ATTAACTTCT
GACCCTGGTA TGGGTATTGT TCGTCACGTT GATGCAGGAT ATGACTTAGC AGTGGAAACT
GCGAAAGAAA AAGGCGTTAA CATTCCAATG ATGAAATAA
 
Protein sequence
MEKVQQTIRA PRGTELQTKG WVQEAALRML MNNLDPEVAE KPEELVVYGG IGRAARNWES 
YNAIVDSLKT LESDETLLVQ SGKPVAIFKS HEDAPRVLLA NSNLVPKWAN WDHFRELEKK
GLMMYGQMTA GSWIYIGTQG ILQGTYETFG EAARQHFDGS LKGTLTLTAG LGGMGGAQPL
AVTMNGGVVI AIDVDKRSID RRIEKRYCDK YTESLEEALA IANEYKEKKE PISIGLLGNA
AEILPELVNR NIIPDLVTDQ TSAHDPLNGY IPVGYTLEEA AKLREEDPER YVQLSKESMT
KHVEAMLAMQ EKGAITFDYG NNIRQVAFDE GLKNAFDFPG FVPAFIRPLF CEGKGPFRWV
ALSGDPEDIY KTDEVILREF ADNEHLCNWI RMARQQVEFQ GLPSRICWLG YGERAKFGRI
INEMVANGEL SAPIVIGRDH LDCGSVASPN RETEAMKDGS DAVADWPILN ALINSVNGAS
WVSVHHGGGV GMGYSLHAGM VIVADGTEAA AKRIERVLTS DPGMGIVRHV DAGYDLAVET
AKEKGVNIPM MK