Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GWCH70_1282 |
Symbol | |
ID | 7976063 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacillus sp. WCH70 |
Kingdom | Bacteria |
Replicon accession | NC_012793 |
Strand | + |
Start bp | 1332066 |
End bp | 1333724 |
Gene Length | 1659 bp |
Protein Length | 552 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 644798226 |
Product | urocanate hydratase |
Protein accession | YP_002949399 |
Protein GI | 239826775 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2987] Urocanate hydratase |
TIGRFAM ID | [TIGR01228] urocanate hydratase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.0391092 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGTAACGA AACACCGGCC AGTGCAAGCA TACACCGGTT CTACTTTGCA TGCGAAAGGC TGGATTCAAG AGGCCGCGTT ACGAATGCTG AACAACAACT TACATCCAGA GGTGGCGGAG CGCCCGGAAG ATTTGGTTGT CTACGGCGGC ATCGGCAAAG CGGCGCGCAA TTGGGAGTGC TACGGGGCGA TTGTCGAAAC GCTATTAAAC TTAGAAAACG ATGAAACGCT GCTCATTCAG TCGGGAAAGC CAGTCGCGGT ATTTAAAACG CACACGGATG CGCCAAGGGT GTTAATCGCC AACTCGAATC TTGTGCCTGC TTGGGCGACG TGGGATCATT TCCACGAACT TGATAAAAAA GGATTAATCA TGTATGGCCA AATGACGGCA GGAAGCTGGA TTTACATTGG CAGCCAAGGC ATCGTGCAAG GCACGTACGA AACGTTTGCC GAGGTGGCGC GCCAGCACTA TGGCGGCACG CTAAAAGGAA CAATTACGGT GACGGCAGGT CTTGGCGGCA TGGGGGGAGC ACAGCCGCTC GCCGTCACGT TAAACGGCGG CGTCTGCATT GCCGTCGAAG TGGACCCAGC CCGCATCCAG CGCCGCATTG ACACGAAATA TTTAGACACG ATGACCGATC GTCTCGATGT GGCGATCCAG ATGGCGAAAA GGGCGAAGGA AGAAGGAAAA GCGCTATCGA TCGGCCTGCT TGGCAACGCG GCAGAAGTGC TGCCGAAAAT GATCGAAATC GGCTTTATTC CGGACGTGTT GACAGACCAG ACGTCCGCCC ACGATCCGCT TAACGGCTAC ATTCCGGCGG GCATGACGCT CGAGGAAGCG GCTGAGCTGC GCCAGCGCGA TCCGAAGCAG TATATCCGCC GCGCCAAACA ATCGATCGCC GAACATGTCA AAGCGATGCT CGCCATGCAG AAACAAGGCT CAGTGACATT CGATTACGGC AACAATATCC GCCAAGTCGC GAAAGATGAA GGAGTGGAAG AGGCGTTTAA TTTTCCAGGT TTTGTTCCCG CCTACATCCG TCCGCTCTTT TGCGAAGGGA AAGGGCCGTT CCGCTGGGTG GCCCTGTCAG GAGATCCGGA AGACATCTAC AAAACCGATG AAGTCATTTT GCGCGAATTC AGCGACAACC AACATTTGTG CAACTGGATC CGCATGGCGC GGGAAAAAAT CCAGTTTCAA GGGCTGCCGG CGCGCATCTG CTGGCTCGGC TACGGCGAAC GGGCGAAATT TGGCAAAATC ATTAACGACA TGGTGGCGAA AGGCGAGCTG AAAGCGCCGA TCGTCATCGG CCGCGATCAT TTGGATTCCG GTTCTGTCGC CTCGCCAAAC CGCGAAACGG AAGGAATGAA AGACGGCAGT GACGCGATCG CCGACTGGCC GATTTTAAAC GCGCTTCTTA ACGCGGTTGG CGGTGCAAGC TGGGTATCGG TGCACCATGG CGGCGGCGTC GGCATGGGCT ATTCGATTCA TGCGGGAATG GTCATTGTCG CCGATGGCAC GAAAGAAGCG GAAAAACGGC TCGAGCGCGT CTTGACGACC GACCCGGGCC TTGGCGTTGT CCGCCACGCT GATGCTGGCT ATGAACTCGC TATCAAAACG GCGAAAGAAA AAGGCATCCA TATGCCGATG CTGAAATAA
|
Protein sequence | MVTKHRPVQA YTGSTLHAKG WIQEAALRML NNNLHPEVAE RPEDLVVYGG IGKAARNWEC YGAIVETLLN LENDETLLIQ SGKPVAVFKT HTDAPRVLIA NSNLVPAWAT WDHFHELDKK GLIMYGQMTA GSWIYIGSQG IVQGTYETFA EVARQHYGGT LKGTITVTAG LGGMGGAQPL AVTLNGGVCI AVEVDPARIQ RRIDTKYLDT MTDRLDVAIQ MAKRAKEEGK ALSIGLLGNA AEVLPKMIEI GFIPDVLTDQ TSAHDPLNGY IPAGMTLEEA AELRQRDPKQ YIRRAKQSIA EHVKAMLAMQ KQGSVTFDYG NNIRQVAKDE GVEEAFNFPG FVPAYIRPLF CEGKGPFRWV ALSGDPEDIY KTDEVILREF SDNQHLCNWI RMAREKIQFQ GLPARICWLG YGERAKFGKI INDMVAKGEL KAPIVIGRDH LDSGSVASPN RETEGMKDGS DAIADWPILN ALLNAVGGAS WVSVHHGGGV GMGYSIHAGM VIVADGTKEA EKRLERVLTT DPGLGVVRHA DAGYELAIKT AKEKGIHMPM LK
|
| |