Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Pars_2021 |
Symbol | |
ID | 5054032 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pyrobaculum arsenaticum DSM 13514 |
Kingdom | Archaea |
Replicon accession | NC_009376 |
Strand | - |
Start bp | 1807959 |
End bp | 1810130 |
Gene Length | 2172 bp |
Protein Length | 723 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 640469571 |
Product | aldehyde oxidase and xanthine dehydrogenase, molybdopterin binding |
Protein accession | YP_001154220 |
Protein GI | 145592218 |
COG category | [C] Energy production and conversion |
COG ID | [COG1529] Aerobic-type carbon monoxide dehydrogenase, large subunit CoxL/CutL homologs |
TIGRFAM ID | [TIGR02416] carbon-monoxide dehydrogenase, large subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 36 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGTACG TCGGCAGACC GATACCCAGG TTTGAGGACG ACGTAATTCT CAGCGGACGG GCGCAGTACG TCGACGACAT AGTCCTTCCT GGAATGTTGT ACGCGGGATT TGTCCGCTCC CCCTACGCCC ACGCCAGAGT CCTTAGGGTT GATCTCTCCG ACGCGGCTAA ACAAAAGGGA GTTGTGGCGG TGTTCGGGCC GGAGGAGATG GGCTTCGCCC CTGGGGGCAA GGTGAGATAC CAGGGAGAGG CCGTGGCCAT GGTCGTGGCT GGTGACCGCT ATCTTTTATA CGACGCGTTA GAGAAGGTAG TAGTGGATTA CGAGCCCCTC CCGGCGGTGT TAGACGTCTT TGAGGCCTTG AGGCCAGGAG CGCCGTTGGT AGACGAAAAC CTCGGCACTA ATATAGCACA TGAAGAGGTG TATGAAGGAG GCGATGTTGA CAGTGCAATG AGAGAGGCTG AGGTCAAGAT AGAGGAGAGG CTTACAATAC AACGAGTAGT GCCGGCGGCT ATGGAGCCCC GGGGGGTGGT GGCGGCTTAT GACGGCGATA TGCTGACTAT TTGGAGCTCT ACCCAAGTGC CTTTTGATAT AAGAAAAGAA GTGGCCAAGG CGCTTGACAT TCCCCTTGTG AAAGTAAGAG CGGTACAGCC CTTTGTGGGC GGCGCCTTCG GCTCAAAACT GATAGTCTAC CCCGAGGAGA TATGGGTCTC CAAGGCGGCG TATTTATTGA AAAGGCCTGT GAAGTGGGTT GCAACTAGAA GCGAGGATTT CAAAACGACT ACTCACGGCA GGGCGTTAAT ACTAGATTAC AGAGTAGGCG CCACGCGCGA CGGGAGGATT TTAGCTATTG AGGGGACTGT ATATGCCGAC GCAGGGGCTT ATTACTGGGG GGAGGGGCTG GCCGATACGG CCGCGAGAAT GCTCCCGGGG CCTTACGATA TACGCAACGG CAGAGTTAAA GCCGTTGCAG TGTTGACTAA TAAAACTCCG CTTAGCGCGT ACAGGGGGGC CGGCAGGCCC GAGGCCACGT TTTTTATTGA AAGAATTATG GACCGCCTCG CCGACGAGCT CGGCATAGAC AGAGTGGAGA TTAGGGAGAG GAATTTAATT CGACAGCTGC CCTATACAAA TGTCTTTGGC ATTACGTACG ACACCGGCGA CTACCTCACC ACGTTTAAAC AAGGGCTAGA GAGGCTGGGC TATTCCCAGC TTAAACAGTG GGCTGAGGAG GAGCTTAAAC GCGGACGCGT CGTAGGAGTC GGCTTCTCGG TATACGTAGA GATTACAACA TTTGGTTACG AAACGGCAAT TCTCAGAGCT GAGAGAGACG GCACTTTCAC GTTGTACACG GCCCTCACGC CGCACGGCCA GGGCCTGGCC ACTGCCCTGG CTCAAATAGT CGCCGAGGAG TTAGACGTGC CAATTGAGTC TGTTAAAGTC GTCTGGGGGG ACACAGCCCT GATATCTGAC GGCATTGGGA CTATGGGCAG CCGGTCAATA ACAGCTGGGG GCTCGGCGGC AATACTTGCG GCCAGGAGGC TGAAAGAGGA GCTTTTAAAA GCGGCGCGGA AAGTGTTGGG GTGCGACCCG GAGTACAGCG GCGGGAAGTT TAGTTGCGGG GGCAAGTCCG CTACAGTTAA AGACGTAGTT AGAGCAGTGT ACAGAGGAGA GGCGGAGGCC CAGCTCACTG TAGAGGCTAT TTACCACGCA GACTCAACCT TTCCATTCGG CGTGCATTTG GCCGTGGTAG AGCTGGATCC CGAGACCGGC TTTGTCAAGC CCATGCTCTA CAAGTCCTAT GACGACGTGG GCGTTGTGGT CAATCCGTTA CTGGCGTCAG GCCAGATCAC CGGCGGCGCG TTGCAGGGAA TAGCCCAGGC GCTGTATGAA GAGGTCGTTT ACGACGAGAG CGGCAATTTA ATTACCTCAA ACCTTGCCTT TTATTACGTC CCCACGGCGG CGGAGGCCCC GAAGTACGAG GTATACTTCG CCGAGAGGCC CCACCCCTCT AGGCACCTCA CCGGCACTAA GGGCATCGGC GAGGCCGCCA CCATTGCCTC AACCCCCGCC GTCGTCTCGG CGGTCGAGGA CGCGTTGAGG AGAATCAAGC CGGGGGTCAG AATAGAGAAA ACCCCAGTCA CGCCAGAGGA CGTCTGGCGC ATGCTGAGGT GA
|
Protein sequence | MKYVGRPIPR FEDDVILSGR AQYVDDIVLP GMLYAGFVRS PYAHARVLRV DLSDAAKQKG VVAVFGPEEM GFAPGGKVRY QGEAVAMVVA GDRYLLYDAL EKVVVDYEPL PAVLDVFEAL RPGAPLVDEN LGTNIAHEEV YEGGDVDSAM REAEVKIEER LTIQRVVPAA MEPRGVVAAY DGDMLTIWSS TQVPFDIRKE VAKALDIPLV KVRAVQPFVG GAFGSKLIVY PEEIWVSKAA YLLKRPVKWV ATRSEDFKTT THGRALILDY RVGATRDGRI LAIEGTVYAD AGAYYWGEGL ADTAARMLPG PYDIRNGRVK AVAVLTNKTP LSAYRGAGRP EATFFIERIM DRLADELGID RVEIRERNLI RQLPYTNVFG ITYDTGDYLT TFKQGLERLG YSQLKQWAEE ELKRGRVVGV GFSVYVEITT FGYETAILRA ERDGTFTLYT ALTPHGQGLA TALAQIVAEE LDVPIESVKV VWGDTALISD GIGTMGSRSI TAGGSAAILA ARRLKEELLK AARKVLGCDP EYSGGKFSCG GKSATVKDVV RAVYRGEAEA QLTVEAIYHA DSTFPFGVHL AVVELDPETG FVKPMLYKSY DDVGVVVNPL LASGQITGGA LQGIAQALYE EVVYDESGNL ITSNLAFYYV PTAAEAPKYE VYFAERPHPS RHLTGTKGIG EAATIASTPA VVSAVEDALR RIKPGVRIEK TPVTPEDVWR MLR
|
| |