Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BCG9842_B1557 |
Symbol | hutU |
ID | 7184384 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Bacillus cereus G9842 |
Kingdom | Bacteria |
Replicon accession | NC_011772 |
Strand | - |
Start bp | 3587471 |
End bp | 3589129 |
Gene Length | 1659 bp |
Protein Length | 552 aa |
Translation table | 11 |
GC content | 41% |
IMG OID | 643551484 |
Product | urocanate hydratase |
Protein accession | YP_002447154 |
Protein GI | 218898743 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2987] Urocanate hydratase |
TIGRFAM ID | [TIGR01228] urocanate hydratase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.0743786 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 41 |
Fosmid unclonability p-value | 0.0292483 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAAAAAG TACAACAAAC AATTCGCGCG CCAAGAGGTA CAGAGTTACA AACGAAAGGG TGGGTGCAAG AAGCTGCACT TCGTATGTTA ATGAACAATT TAGATCCTGA AGTTGCTGAA AAACCAGAAG AATTAGTTGT ATATGGCGGA ATTGGCCGTG CAGCTCGTAA CTGGGAAAGC TACAATGCAA TTGTAGATTC ATTAAAAACG TTAGAAAGCG ATGAAACGTT ACTTGTTCAA TCAGGAAAAC CAGTTGCCAT TTTTAAATCA CATGAAGATG CACCGCGCGT TCTGTTAGCG AACTCAAACT TAGTACCAAA ATGGGCGAAT TGGGATCACT TCCGAGAACT AGAGAAAAAA GGACTTATGA TGTACGGACA AATGACAGCT GGTAGCTGGA TTTACATTGG AACACAAGGG ATTCTACAAG GAACATATGA AACATTTGGT GAGGCAGCTC GTCAACATTT CGATGGTTCA TTAAAAGGTA CATTAACACT TACTGCTGGT TTAGGTGGTA TGGGTGGTGC ACAACCTCTT GCTGTAACGA TGAATGGCGG TGTTGTCATT GCTATTGATG TAGATAAGCG CAGCATCGAT CGTCGTATTG AAAAGAGATA TTGTGATAAG TATACAGAAT CATTAGAAGA AGCATTGGCT ATTGCAAACG AGTATAAAGA GAAGAAAGAG CCTATTTCAA TTGGATTATT AGGTAATGCA GCAGAAATTT TACCTGAGTT AGTAAATCGT AATATTATCC CTGACTTAGT TACGGACCAA ACATCTGCTC ATGATCCATT AAACGGTTAT ATTCCAGTAG GTTATACGTT AGAAGAGGCA GCGAAACTTC GTGAAGAAGA CCCAGAACGT TACGTACAAT TATCAAAAGA AAGTATGACA AAGCACGTAG AAGCAATGCT TGCGATGCAA GAAAAAGGCG CAATTACATT TGATTATGGA AATAACATTC GCCAAGTTGC TTTCGATGAA GGTTTGAAAA ATGCTTTCGA TTTCCCAGGA TTTGTTCCAG CATTTATTCG TCCATTATTC TGTGAAGGAA AAGGACCATT CCGCTGGGTA GCTCTTTCTG GTGATCCAGA AGATATTTAT AAAACAGATG AAGTAATTTT ACGAGAATTC GCTGACAATG AGCATTTATG TAACTGGATT CGTATGGCGC GTCAACAAGT GGAGTTCCAA GGGCTTCCAT CACGTATTTG TTGGCTAGGT TACGGTGAGC GTGCGAAGTT TGGCCGCATC ATTAATGAAA TGGTGGCAAA TGGTGAATTA TCAGCACCGA TCGTTATTGG TCGTGACCAT TTAGATTGCG GTTCAGTAGC ATCTCCAAAC CGTGAAACAG AAGCGATGAA AGACGGTAGT GATGCAGTAG CAGACTGGCC AATTTTAAAT GCATTAATTA ATAGTGTAAA CGGTGCGAGT TGGGTATCTG TTCACCACGG TGGCGGCGTT GGTATGGGTT ATTCACTTCA CGCTGGAATG GTTATTGTTG CAGATGGAAC AGAAGCAGCA GCAAAACGTA TTGAGCGCGT ATTAACTTCT GACCCTGGTA TGGGTATTGT TCGTCACGTT GATGCAGGAT ATGACTTAGC AGTGGAAACT GCGAAAGAAA AAGGCGTTAA CATTCCAATG ATGAAATAA
|
Protein sequence | MEKVQQTIRA PRGTELQTKG WVQEAALRML MNNLDPEVAE KPEELVVYGG IGRAARNWES YNAIVDSLKT LESDETLLVQ SGKPVAIFKS HEDAPRVLLA NSNLVPKWAN WDHFRELEKK GLMMYGQMTA GSWIYIGTQG ILQGTYETFG EAARQHFDGS LKGTLTLTAG LGGMGGAQPL AVTMNGGVVI AIDVDKRSID RRIEKRYCDK YTESLEEALA IANEYKEKKE PISIGLLGNA AEILPELVNR NIIPDLVTDQ TSAHDPLNGY IPVGYTLEEA AKLREEDPER YVQLSKESMT KHVEAMLAMQ EKGAITFDYG NNIRQVAFDE GLKNAFDFPG FVPAFIRPLF CEGKGPFRWV ALSGDPEDIY KTDEVILREF ADNEHLCNWI RMARQQVEFQ GLPSRICWLG YGERAKFGRI INEMVANGEL SAPIVIGRDH LDCGSVASPN RETEAMKDGS DAVADWPILN ALINSVNGAS WVSVHHGGGV GMGYSLHAGM VIVADGTEAA AKRIERVLTS DPGMGIVRHV DAGYDLAVET AKEKGVNIPM MK
|
| |