Gene Pars_1484 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_1484 
Symbol 
ID5054242 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp1342669 
End bp1344519 
Gene Length1851 bp 
Protein Length616 aa 
Translation table11 
GC content51% 
IMG OID640469024 
Producturocanate hydratase 
Protein accessionYP_001153693 
Protein GI145591691 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2987] Urocanate hydratase 
TIGRFAM ID[TIGR01228] urocanate hydratase 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTGTTC CGAGTAAATA CAAGGGGAGG CCCATTGAGG AGCTCATTTC TGCGGGGTAT 
TACAATCCTG AGACCCGCTC TGTAAAGGCA ATAAAGGGGT ACGACTTCCA CGTCTGGAGT
AAAGATTGGC AGATAGAAGG AGTTCTGAGA ATGTTGTTCC ACGTCTTAGA TCCTGAGGTC
GCAAAAGATC CCAAAAATCT CATAGTATAC GGCGGGAGCG GCAAAGCCGC GAGGAGTTGG
GATGATTTCG AGGCTATTGT AGACACACTG ATATCTATGG ATAGGGAGGA TACTTTGGTA
ATACAATCTG GCCAGCCTGT GGCTGTGTTT AAAACCGATT TGCGTGCCCC ACGTGTTTTG
ATGAGTAACG CCGTTTTAGT GCCTAAGTGG GCTGATTGGA AGTATTTCTG GGAGCTGGAG
GCGCGGGGGC TTATCTCGTA TCACCAAATG ACCGCGGGGT GTTGGGCCTA TATCGGGACA
CAAGGGATCC TACAGGGGAC TTACGAGACT ATTGGCTTTG CTGCTGAGAG GCACTTCGGC
GGCTCTCTTG AGGGTAGACT AGTAGTAAGC GCCGGGCTTG GAGAAATGGG CGGGGCCCAG
CCTCTGGCAA TTAAAATGCT AGGTGGCGTC GCGCTGATAG CCGATGTGGA TCGTAGGATG
ATCGAGAGGA GGATAGCGAC GGGCTATTTA GATACTTGGA CTGACAATGT GGACAAAGCC
ATTGACATGG CTTTAAGAGC CAAGGAGAAG CGCGAGGCGA TTAGCATCGG CGTGTTGGCA
AATGCCGTTG ATTTGCATGA GAAGCTTGTA AAGGAACAGA TAGTGCCCGA TCTTGTCACT
GATCAAACAC CTGCCCACGA CCCCCTCGCC TATGTGCCTG CTGGCCTCAC TGTGGAGGAG
GCCGAGAGGC TTAGGAAATT AGACCCTGAT AGATACGTAC AACTCTCTAA GCGGTCTATG
GCGAGGCATG TGGAGCTTTT GCTAACTCAC CTAATGCGCG GCGCCGTGGT TTTTGAATAT
GGGAATAACC TCAGGAAACA AGCCTACGAC GCGGGGGTTG AGCAGGCGTT TAAAATACCT
GGGCAGATGG AGTATCTAAG ACCTATGTTT GAAGAAGGGA GGGGACCATT TAGATGGACG
AGCCTTGTGG GGGAGCCAAA AGATATCTAC AAGCTCGACG ATGTGATTCT TACCGTCTAC
AGCAGGAACT GGAGACTTGT AAGGTGGATT CAAAACGCCA AGAAGTATGT CAAGTTCCAG
GGGTTGCCCG CAAGAGTGGT TTACCTAGGA TATGGGGAAC GCGCAGAATT TGGGAAAATC
GTAAGCGAGA TGGTTAGGAG AGGCGAGTTA TCTGGCCCAA TTTGGTTTGG TAGAGACCAC
TTAGACACTG GTTCTGTGGC TTCCCCGTTT AGAGAAACTG AGGGGATGCT GGACGGTAGC
GACGCCGTAG GAGATTGGCC TGTGCTAAAC TACGCTCTTA ACACCGCGGT GGGAGCTACT
TGGACGTGCT TCCACCACGG AGGCGGCGTT GGGATTGGCT ATTCTCTTCA CTGTGGATTT
GGCATGGTGG TCGATGGTAC ACAGCTGGCG GAGGAGAAGG CCTTGAGGGT GTTCACAGTA
GATCCCGGGA TTGGAGTCGT GAGGCACGCC CATGCGGGGT ATCCAAGAGC CTTAAAAACT
GCTCTTACGA AAGGGGTTAG AATTCCGATA CATAAGAGGC TAGAAGAAAA ATCGTTGCGT
GTAGTTGAAG AAGCTTGGCG CGAGGGGAGA ATAAGCGAAT ACACCTACAA GAGGGTAAAG
GAGGAGTGGA AGGAATACGA GGAGGTTAAG AAAAACTTAG AGAAACCGTA G
 
Protein sequence
MSVPSKYKGR PIEELISAGY YNPETRSVKA IKGYDFHVWS KDWQIEGVLR MLFHVLDPEV 
AKDPKNLIVY GGSGKAARSW DDFEAIVDTL ISMDREDTLV IQSGQPVAVF KTDLRAPRVL
MSNAVLVPKW ADWKYFWELE ARGLISYHQM TAGCWAYIGT QGILQGTYET IGFAAERHFG
GSLEGRLVVS AGLGEMGGAQ PLAIKMLGGV ALIADVDRRM IERRIATGYL DTWTDNVDKA
IDMALRAKEK REAISIGVLA NAVDLHEKLV KEQIVPDLVT DQTPAHDPLA YVPAGLTVEE
AERLRKLDPD RYVQLSKRSM ARHVELLLTH LMRGAVVFEY GNNLRKQAYD AGVEQAFKIP
GQMEYLRPMF EEGRGPFRWT SLVGEPKDIY KLDDVILTVY SRNWRLVRWI QNAKKYVKFQ
GLPARVVYLG YGERAEFGKI VSEMVRRGEL SGPIWFGRDH LDTGSVASPF RETEGMLDGS
DAVGDWPVLN YALNTAVGAT WTCFHHGGGV GIGYSLHCGF GMVVDGTQLA EEKALRVFTV
DPGIGVVRHA HAGYPRALKT ALTKGVRIPI HKRLEEKSLR VVEEAWREGR ISEYTYKRVK
EEWKEYEEVK KNLEKP