Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Pars_1484 |
Symbol | |
ID | 5054242 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pyrobaculum arsenaticum DSM 13514 |
Kingdom | Archaea |
Replicon accession | NC_009376 |
Strand | - |
Start bp | 1342669 |
End bp | 1344519 |
Gene Length | 1851 bp |
Protein Length | 616 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 640469024 |
Product | urocanate hydratase |
Protein accession | YP_001153693 |
Protein GI | 145591691 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2987] Urocanate hydratase |
TIGRFAM ID | [TIGR01228] urocanate hydratase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 30 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGTGTTC CGAGTAAATA CAAGGGGAGG CCCATTGAGG AGCTCATTTC TGCGGGGTAT TACAATCCTG AGACCCGCTC TGTAAAGGCA ATAAAGGGGT ACGACTTCCA CGTCTGGAGT AAAGATTGGC AGATAGAAGG AGTTCTGAGA ATGTTGTTCC ACGTCTTAGA TCCTGAGGTC GCAAAAGATC CCAAAAATCT CATAGTATAC GGCGGGAGCG GCAAAGCCGC GAGGAGTTGG GATGATTTCG AGGCTATTGT AGACACACTG ATATCTATGG ATAGGGAGGA TACTTTGGTA ATACAATCTG GCCAGCCTGT GGCTGTGTTT AAAACCGATT TGCGTGCCCC ACGTGTTTTG ATGAGTAACG CCGTTTTAGT GCCTAAGTGG GCTGATTGGA AGTATTTCTG GGAGCTGGAG GCGCGGGGGC TTATCTCGTA TCACCAAATG ACCGCGGGGT GTTGGGCCTA TATCGGGACA CAAGGGATCC TACAGGGGAC TTACGAGACT ATTGGCTTTG CTGCTGAGAG GCACTTCGGC GGCTCTCTTG AGGGTAGACT AGTAGTAAGC GCCGGGCTTG GAGAAATGGG CGGGGCCCAG CCTCTGGCAA TTAAAATGCT AGGTGGCGTC GCGCTGATAG CCGATGTGGA TCGTAGGATG ATCGAGAGGA GGATAGCGAC GGGCTATTTA GATACTTGGA CTGACAATGT GGACAAAGCC ATTGACATGG CTTTAAGAGC CAAGGAGAAG CGCGAGGCGA TTAGCATCGG CGTGTTGGCA AATGCCGTTG ATTTGCATGA GAAGCTTGTA AAGGAACAGA TAGTGCCCGA TCTTGTCACT GATCAAACAC CTGCCCACGA CCCCCTCGCC TATGTGCCTG CTGGCCTCAC TGTGGAGGAG GCCGAGAGGC TTAGGAAATT AGACCCTGAT AGATACGTAC AACTCTCTAA GCGGTCTATG GCGAGGCATG TGGAGCTTTT GCTAACTCAC CTAATGCGCG GCGCCGTGGT TTTTGAATAT GGGAATAACC TCAGGAAACA AGCCTACGAC GCGGGGGTTG AGCAGGCGTT TAAAATACCT GGGCAGATGG AGTATCTAAG ACCTATGTTT GAAGAAGGGA GGGGACCATT TAGATGGACG AGCCTTGTGG GGGAGCCAAA AGATATCTAC AAGCTCGACG ATGTGATTCT TACCGTCTAC AGCAGGAACT GGAGACTTGT AAGGTGGATT CAAAACGCCA AGAAGTATGT CAAGTTCCAG GGGTTGCCCG CAAGAGTGGT TTACCTAGGA TATGGGGAAC GCGCAGAATT TGGGAAAATC GTAAGCGAGA TGGTTAGGAG AGGCGAGTTA TCTGGCCCAA TTTGGTTTGG TAGAGACCAC TTAGACACTG GTTCTGTGGC TTCCCCGTTT AGAGAAACTG AGGGGATGCT GGACGGTAGC GACGCCGTAG GAGATTGGCC TGTGCTAAAC TACGCTCTTA ACACCGCGGT GGGAGCTACT TGGACGTGCT TCCACCACGG AGGCGGCGTT GGGATTGGCT ATTCTCTTCA CTGTGGATTT GGCATGGTGG TCGATGGTAC ACAGCTGGCG GAGGAGAAGG CCTTGAGGGT GTTCACAGTA GATCCCGGGA TTGGAGTCGT GAGGCACGCC CATGCGGGGT ATCCAAGAGC CTTAAAAACT GCTCTTACGA AAGGGGTTAG AATTCCGATA CATAAGAGGC TAGAAGAAAA ATCGTTGCGT GTAGTTGAAG AAGCTTGGCG CGAGGGGAGA ATAAGCGAAT ACACCTACAA GAGGGTAAAG GAGGAGTGGA AGGAATACGA GGAGGTTAAG AAAAACTTAG AGAAACCGTA G
|
Protein sequence | MSVPSKYKGR PIEELISAGY YNPETRSVKA IKGYDFHVWS KDWQIEGVLR MLFHVLDPEV AKDPKNLIVY GGSGKAARSW DDFEAIVDTL ISMDREDTLV IQSGQPVAVF KTDLRAPRVL MSNAVLVPKW ADWKYFWELE ARGLISYHQM TAGCWAYIGT QGILQGTYET IGFAAERHFG GSLEGRLVVS AGLGEMGGAQ PLAIKMLGGV ALIADVDRRM IERRIATGYL DTWTDNVDKA IDMALRAKEK REAISIGVLA NAVDLHEKLV KEQIVPDLVT DQTPAHDPLA YVPAGLTVEE AERLRKLDPD RYVQLSKRSM ARHVELLLTH LMRGAVVFEY GNNLRKQAYD AGVEQAFKIP GQMEYLRPMF EEGRGPFRWT SLVGEPKDIY KLDDVILTVY SRNWRLVRWI QNAKKYVKFQ GLPARVVYLG YGERAEFGKI VSEMVRRGEL SGPIWFGRDH LDTGSVASPF RETEGMLDGS DAVGDWPVLN YALNTAVGAT WTCFHHGGGV GIGYSLHCGF GMVVDGTQLA EEKALRVFTV DPGIGVVRHA HAGYPRALKT ALTKGVRIPI HKRLEEKSLR VVEEAWREGR ISEYTYKRVK EEWKEYEEVK KNLEKP
|
| |