Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GBAA_3711 |
Symbol | hutU |
ID | 2819187 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Bacillus anthracis str. 'Ames Ancestor' |
Kingdom | Bacteria |
Replicon accession | NC_007530 |
Strand | - |
Start bp | 3412886 |
End bp | 3414544 |
Gene Length | 1659 bp |
Protein Length | 552 aa |
Translation table | 11 |
GC content | 42% |
IMG OID | 637790445 |
Product | urocanate hydratase |
Protein accession | YP_020344 |
Protein GI | 47528995 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2987] Urocanate hydratase |
TIGRFAM ID | [TIGR01228] urocanate hydratase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.0342728 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAAAAAG TTAAACAAAC AATTCGCGCG CCAAGAGGTA CTGAGTTACA AACGAAAGGG TGGGTGCAAG AAGCTGCACT TCGTATGTTA ATGAACAATT TAGATCCTGA AGTTGCTGAA AAACCAGAAG AATTAGTTGT ATATGGCGGA ATTGGCCGTG CAGCTCGTAA CTGGGAAAGC TATCAGGCGA TTGTAGATTC ATTAAAAACG TTAGAAAGCG ATGAAACTTT ACTTGTTCAA TCAGGAAAAC CAGTTGCTAT TTTTAAATCA CATGAAGATG CGCCTCGCGT TCTTTTAGCG AACTCAAACT TAGTACCGAA GTGGGCGAAC TGGGATCACT TCCGTGAACT AGAGAAAAAA GGTCTTATGA TGTACGGACA AATGACGGCA GGTAGCTGGA TTTACATCGG AACACAAGGT ATTTTACAAG GAACTTATGA AACGTTTGGT GAAGCGGCGC GTCAACATTT CGGTGGTTCA TTAAAAGGCA CATTAACACT TACTGCTGGT TTAGGTGGTA TGGGTGGTGC ACAACCTCTT GCTGTAACGA TGAACGGCGG TGTTGTTATT GCTATTGATG TTGATAAGCG CAGCATCGAT CGTCGTATTG AAAAGAGATA CTGTGATATG TATACAGAAT CATTAGAAGA AGCGTTAGCG GTTGCGAACG AGTATAAAGA GAAGAAAGAA CCGATTTCTA TTGGTTTATT AGGAAATGCG GCAGAAATTT TACCAGAACT AGTGAAGCGC AATATTACGC CAGACTTGGT TACAGATCAA ACATCTGCTC ATGATCCATT AAACGGTTAT ATTCCAGTAG GCTACACGTT AGAAGAAGCA GCAAAACTTC GTGAAGAAGA TCCAGAACGC TACGTACAAT TATCAAAAGA AAGCATGACA AAACATGTGG AAGCAATGCT TGCTATGCAA GAAAAAGGCG CAATTACATT TGATTACGGA AATAACATTC GCCAAGTTGC TTTTGATGAA GGTTTGAAAA ATGCATTCGA TTTCCCAGGA TTCGTTCCAG CATTTATCCG TCCATTATTC TGCGAAGGAA AAGGACCATT CCGCTGGGTA GCACTTTCTG GTGACCCAGA AGATATTTAT AAAACAGACG AAGTAATTTT ACGTGAGTTC GCGGATAATG AGCATTTATG TAACTGGATT CGTATGGCTC GTCAGCAAGT TGAATTCCAA GGCCTTCCAT CACGTATTTG TTGGCTTGGT TACGGTGAGC GTGCGAAATT TGGCCGCATC ATTAATGAAA TGGTTGCAAA TGGTGAATTA TCAGCACCAA TCGTTATCGG TCGTGACCAT TTAGATTGCG GATCAGTAGC ATCTCCAAAC CGTGAAACAG AAGCGATGAA AGACGGTAGT GATTCAGTAG CTGACTGGCC AATCTTAAAT GCATTAATTA ATAGTGTAAA CGGTGCAAGC TGGGTATCTG TTCACCACGG TGGTGGCGTT GGTATGGGTT ATTCACTTCA TGCAGGAATG GTTATCGTTG CAGATGGAAC AGAAGCAGCA GCAAAACGTA TTGAGCGCGT ATTAACTTCT GACCCTGGTA TGGGTGTTGT TCGTCACGTT GATGCAGGAT ATGACTTAGC TGTGGAAACT GCGAAAGAAA AAGGCGTTAA CATTCCAATG ATGAAATAA
|
Protein sequence | MEKVKQTIRA PRGTELQTKG WVQEAALRML MNNLDPEVAE KPEELVVYGG IGRAARNWES YQAIVDSLKT LESDETLLVQ SGKPVAIFKS HEDAPRVLLA NSNLVPKWAN WDHFRELEKK GLMMYGQMTA GSWIYIGTQG ILQGTYETFG EAARQHFGGS LKGTLTLTAG LGGMGGAQPL AVTMNGGVVI AIDVDKRSID RRIEKRYCDM YTESLEEALA VANEYKEKKE PISIGLLGNA AEILPELVKR NITPDLVTDQ TSAHDPLNGY IPVGYTLEEA AKLREEDPER YVQLSKESMT KHVEAMLAMQ EKGAITFDYG NNIRQVAFDE GLKNAFDFPG FVPAFIRPLF CEGKGPFRWV ALSGDPEDIY KTDEVILREF ADNEHLCNWI RMARQQVEFQ GLPSRICWLG YGERAKFGRI INEMVANGEL SAPIVIGRDH LDCGSVASPN RETEAMKDGS DSVADWPILN ALINSVNGAS WVSVHHGGGV GMGYSLHAGM VIVADGTEAA AKRIERVLTS DPGMGVVRHV DAGYDLAVET AKEKGVNIPM MK
|
| |