Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Pars_0620 |
Symbol | |
ID | 5056195 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pyrobaculum arsenaticum DSM 13514 |
Kingdom | Archaea |
Replicon accession | NC_009376 |
Strand | + |
Start bp | 551458 |
End bp | 554082 |
Gene Length | 2625 bp |
Protein Length | 874 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 640468178 |
Product | aconitate hydratase |
Protein accession | YP_001152863 |
Protein GI | 145590861 |
COG category | [C] Energy production and conversion |
COG ID | [COG1048] Aconitase A |
TIGRFAM ID | [TIGR01341] aconitate hydratase 1 |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.343199 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 26 |
Fosmid unclonability p-value | 0.507089 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCAGAGA AGTTTGAAGT GAGGGGCGTT GCCTATAGGT ACTACCCCCT GAAATCTCTG GAAAGGGAGG GCTACGACGT GGGGCGACTT CCCTACTCCA TTAGAGTGCT TTTAGAAAAC GTCCTGCGCA ACATGGACGG TAGGGACATC ACCAAGGAGC ACCTGGAGAG GCTCGCCCGG TGGAACCCCA AGGCGCCAGA GGGGGAGGTG GCCATTAAGA TATCTAGAGT CGTGATGCAG GACTACACCG GCGTCCCCGC CATCGTAGAC CTTGCCACTA TGCGGGAAAT CGCGAAGAAG ATGGGGAGAG ACCCCACAGT AGTTAACCCA CAAGTCCCCG TCGACCTCAT CATCGATCAC TCTGTGCAGG TGGACTTCTG GGGGTCAAGG GAGGCGCTTA GGCTAAACCT TGACCTCGAG ATTAAGAGGA ATAGGGAGAG GTATAGGTTT TTGAAGTGGG CACAGCAAGC CTTTAAGAAC CTCCGCGTCT TCCCGCCGGG GACGGGCATT ATCCACCAGG TGAACCTCGA GTACCTGGCC AAGGTTGTGA TGACCGACGG CGACCTTGCC TTCTTTGAGA CTCTCGTGGG CATGGACAGC CACACCACGA TGATAAACGG GCTGGGAGTA GTGGGCTGGG GAGTAGGCGG CGTGGAGGCG GAGGCCGCCA TGCTGGGCGA GCCCATAACT ATCAAGGTGC CGAGAGTGGT GGGGGTCCAC CTATACGGCG AGCTGAGGCC CGGAGTCACC GCAACAGACG TGGTGCTGGC AATAACCGAG TTCCTCAGGA AGGTCAACGT GGTTGACGCC TTTGTCGAGT TCTTCGGCGA GGGGGTGAAG AAGCTGTCCG TCCCAGACCG CGCCACGATT GCCAACATGG CGCCGGAGTA CGGCTCCACC ACTGGCCTCT TCCCCGTAGA CGAGAACACC CTTTCCTATC TAAGAGCCAC GGGTCGGCCG GAGGCCCACA TAGCGCTGGT GAGGAAGTAC TACGAGCTCC AGGGAGTCTT CGGCGGCGTC GAGGGGGCGG AGTACAGCCA AGTGGTGGAC TTCGACCTCT CAGCCGTGGA GAGGAACGTG GCTGGCCCCA CGCTACCGTG GCAGAGAACT AGCCTAGCGG ATGTGCCGAA GAGCTTCGCG GTGTTTTTAC AAGAGCGTAA GAAGAGGACT GCCAGGAAGG CGGTGGAGAT TGAGATTGAC GGCAGGAGGG CGGAGTTCGG CGATGGAGAT GTGGTGATCG CCGCCATAAC TAGCTGCACC AACACCAGCA ACCCCTACCT CCTCGTGGCG GCGGGTCTGG TGGCCAAGAG GGCGGTTGAA CTTGGCTTGA GACCGCCGCC TTTCGTAAAG ACGAGCTTCG CCCCGGGGTC GAGGGCAGTC GCCGACTTGC TGGAGAGAAG CGGGTTGCAG AAGTACTTGG ACCAGCTTGG CTTTAGTGTA GTGGCCTTTG GCTGTACCAC ATGTATCGGC AACTCGGGAC CCCTCCCTGA GCCCGTCTCT AGGGCCATTA AGCAACACGA CATATTAGCA ACGGCCGTTT TGTCGGGCAA TAGGAACTTC GAGGCAAGGG TCCACCCAGA TGTCCGCGCT GCCTACCTCG CCTCGCCGCC ACTTGTAGTG GCCTACGCTC TCGCCGGCAA CGTGTGGAAA AACCTAGAGA AAGACCCCCT AGGCCACGCA AGCGATGGGA GGCCGGTCTA CCTAAAAGAT CTGTGGCCAA GCCCCGAGGA GGTGAACAGA GTGGTGGAGG AGTGGCTAGA TCCGAAAATA TACGTCGAGA AGTACGGCAA GGTCGGCGAG CTGGTCCCCG AGTGGCAGGC ACTTGAGGCG CCAGGCGGCA TACTTTACGA CTGGCGGCCG GACGACACCT ACATACAGCC CTCGCCGCTC TTCGAAGGCG AGGTAAAAGT GAGCGACATC ACCGGGGCGA GGCCTCTGCT GATCCTAGGC GACAGCATCA CCACAGACCA CATCTCCCCA GCCGGCGGAA TAACCCAGGA CAACCCCGCC GGGCAGTACC TAATGTCCCT GGGAGTGAAG CCCGCAGACT TCAACACCTT CGGCGCGAGG AGGGGCAACT GGCAGGTGAT GGTTAGGGGC ACCTTCTCCA GCAAGGGGTA CAGGAACAAG ATAGGAAACC TAGAGGGTGG GCTCACCGTC AAGTTCCCCG AGGGCAAAGT CTTGACTGTA TACGAGGCGG CAGAAGCCTA CAAGAAAGAG GGCACGCCGG TTATTGTAGT CGCTGGCAAG AACTACGGCG CTGGGTCGAG CCGCGATTGG GCCGCCAAGG GGCCAAAGCT CTTGGGCGTA AGGGCGGTAA TCGCGGAGAG CTTCGAAAGG ATACACAGGT CAAACCTCAC GATGGTGGGA ATTATACCAA TCCAGCTACC GCCAGGCGTA ACAGTGGACA GCCTCGGCTT AGACGGCACT GAGACCTTCG ACATAATGGG GCTGTCGGAG CTCGCACCTG GGAAGGAAGT CGTCATAAGG ATACACAGAA AAGACGGCCG CGTCGACGAG GTTAAGGCAA GGCTAGCCGT CTACACGTGG GCAGAGGTGG AATACATCAA ACACGGCGGA ATACTCCCCT ACGTCTTAAA GAAGCTGTTT CAGAAAACGT TTTAA
|
Protein sequence | MAEKFEVRGV AYRYYPLKSL EREGYDVGRL PYSIRVLLEN VLRNMDGRDI TKEHLERLAR WNPKAPEGEV AIKISRVVMQ DYTGVPAIVD LATMREIAKK MGRDPTVVNP QVPVDLIIDH SVQVDFWGSR EALRLNLDLE IKRNRERYRF LKWAQQAFKN LRVFPPGTGI IHQVNLEYLA KVVMTDGDLA FFETLVGMDS HTTMINGLGV VGWGVGGVEA EAAMLGEPIT IKVPRVVGVH LYGELRPGVT ATDVVLAITE FLRKVNVVDA FVEFFGEGVK KLSVPDRATI ANMAPEYGST TGLFPVDENT LSYLRATGRP EAHIALVRKY YELQGVFGGV EGAEYSQVVD FDLSAVERNV AGPTLPWQRT SLADVPKSFA VFLQERKKRT ARKAVEIEID GRRAEFGDGD VVIAAITSCT NTSNPYLLVA AGLVAKRAVE LGLRPPPFVK TSFAPGSRAV ADLLERSGLQ KYLDQLGFSV VAFGCTTCIG NSGPLPEPVS RAIKQHDILA TAVLSGNRNF EARVHPDVRA AYLASPPLVV AYALAGNVWK NLEKDPLGHA SDGRPVYLKD LWPSPEEVNR VVEEWLDPKI YVEKYGKVGE LVPEWQALEA PGGILYDWRP DDTYIQPSPL FEGEVKVSDI TGARPLLILG DSITTDHISP AGGITQDNPA GQYLMSLGVK PADFNTFGAR RGNWQVMVRG TFSSKGYRNK IGNLEGGLTV KFPEGKVLTV YEAAEAYKKE GTPVIVVAGK NYGAGSSRDW AAKGPKLLGV RAVIAESFER IHRSNLTMVG IIPIQLPPGV TVDSLGLDGT ETFDIMGLSE LAPGKEVVIR IHRKDGRVDE VKARLAVYTW AEVEYIKHGG ILPYVLKKLF QKTF
|
| |