Gene Pars_2021 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_2021 
Symbol 
ID5054032 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp1807959 
End bp1810130 
Gene Length2172 bp 
Protein Length723 aa 
Translation table11 
GC content56% 
IMG OID640469571 
Productaldehyde oxidase and xanthine dehydrogenase, molybdopterin binding 
Protein accessionYP_001154220 
Protein GI145592218 
COG category[C] Energy production and conversion 
COG ID[COG1529] Aerobic-type carbon monoxide dehydrogenase, large subunit CoxL/CutL homologs 
TIGRFAM ID[TIGR02416] carbon-monoxide dehydrogenase, large subunit 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones36 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGTACG TCGGCAGACC GATACCCAGG TTTGAGGACG ACGTAATTCT CAGCGGACGG 
GCGCAGTACG TCGACGACAT AGTCCTTCCT GGAATGTTGT ACGCGGGATT TGTCCGCTCC
CCCTACGCCC ACGCCAGAGT CCTTAGGGTT GATCTCTCCG ACGCGGCTAA ACAAAAGGGA
GTTGTGGCGG TGTTCGGGCC GGAGGAGATG GGCTTCGCCC CTGGGGGCAA GGTGAGATAC
CAGGGAGAGG CCGTGGCCAT GGTCGTGGCT GGTGACCGCT ATCTTTTATA CGACGCGTTA
GAGAAGGTAG TAGTGGATTA CGAGCCCCTC CCGGCGGTGT TAGACGTCTT TGAGGCCTTG
AGGCCAGGAG CGCCGTTGGT AGACGAAAAC CTCGGCACTA ATATAGCACA TGAAGAGGTG
TATGAAGGAG GCGATGTTGA CAGTGCAATG AGAGAGGCTG AGGTCAAGAT AGAGGAGAGG
CTTACAATAC AACGAGTAGT GCCGGCGGCT ATGGAGCCCC GGGGGGTGGT GGCGGCTTAT
GACGGCGATA TGCTGACTAT TTGGAGCTCT ACCCAAGTGC CTTTTGATAT AAGAAAAGAA
GTGGCCAAGG CGCTTGACAT TCCCCTTGTG AAAGTAAGAG CGGTACAGCC CTTTGTGGGC
GGCGCCTTCG GCTCAAAACT GATAGTCTAC CCCGAGGAGA TATGGGTCTC CAAGGCGGCG
TATTTATTGA AAAGGCCTGT GAAGTGGGTT GCAACTAGAA GCGAGGATTT CAAAACGACT
ACTCACGGCA GGGCGTTAAT ACTAGATTAC AGAGTAGGCG CCACGCGCGA CGGGAGGATT
TTAGCTATTG AGGGGACTGT ATATGCCGAC GCAGGGGCTT ATTACTGGGG GGAGGGGCTG
GCCGATACGG CCGCGAGAAT GCTCCCGGGG CCTTACGATA TACGCAACGG CAGAGTTAAA
GCCGTTGCAG TGTTGACTAA TAAAACTCCG CTTAGCGCGT ACAGGGGGGC CGGCAGGCCC
GAGGCCACGT TTTTTATTGA AAGAATTATG GACCGCCTCG CCGACGAGCT CGGCATAGAC
AGAGTGGAGA TTAGGGAGAG GAATTTAATT CGACAGCTGC CCTATACAAA TGTCTTTGGC
ATTACGTACG ACACCGGCGA CTACCTCACC ACGTTTAAAC AAGGGCTAGA GAGGCTGGGC
TATTCCCAGC TTAAACAGTG GGCTGAGGAG GAGCTTAAAC GCGGACGCGT CGTAGGAGTC
GGCTTCTCGG TATACGTAGA GATTACAACA TTTGGTTACG AAACGGCAAT TCTCAGAGCT
GAGAGAGACG GCACTTTCAC GTTGTACACG GCCCTCACGC CGCACGGCCA GGGCCTGGCC
ACTGCCCTGG CTCAAATAGT CGCCGAGGAG TTAGACGTGC CAATTGAGTC TGTTAAAGTC
GTCTGGGGGG ACACAGCCCT GATATCTGAC GGCATTGGGA CTATGGGCAG CCGGTCAATA
ACAGCTGGGG GCTCGGCGGC AATACTTGCG GCCAGGAGGC TGAAAGAGGA GCTTTTAAAA
GCGGCGCGGA AAGTGTTGGG GTGCGACCCG GAGTACAGCG GCGGGAAGTT TAGTTGCGGG
GGCAAGTCCG CTACAGTTAA AGACGTAGTT AGAGCAGTGT ACAGAGGAGA GGCGGAGGCC
CAGCTCACTG TAGAGGCTAT TTACCACGCA GACTCAACCT TTCCATTCGG CGTGCATTTG
GCCGTGGTAG AGCTGGATCC CGAGACCGGC TTTGTCAAGC CCATGCTCTA CAAGTCCTAT
GACGACGTGG GCGTTGTGGT CAATCCGTTA CTGGCGTCAG GCCAGATCAC CGGCGGCGCG
TTGCAGGGAA TAGCCCAGGC GCTGTATGAA GAGGTCGTTT ACGACGAGAG CGGCAATTTA
ATTACCTCAA ACCTTGCCTT TTATTACGTC CCCACGGCGG CGGAGGCCCC GAAGTACGAG
GTATACTTCG CCGAGAGGCC CCACCCCTCT AGGCACCTCA CCGGCACTAA GGGCATCGGC
GAGGCCGCCA CCATTGCCTC AACCCCCGCC GTCGTCTCGG CGGTCGAGGA CGCGTTGAGG
AGAATCAAGC CGGGGGTCAG AATAGAGAAA ACCCCAGTCA CGCCAGAGGA CGTCTGGCGC
ATGCTGAGGT GA
 
Protein sequence
MKYVGRPIPR FEDDVILSGR AQYVDDIVLP GMLYAGFVRS PYAHARVLRV DLSDAAKQKG 
VVAVFGPEEM GFAPGGKVRY QGEAVAMVVA GDRYLLYDAL EKVVVDYEPL PAVLDVFEAL
RPGAPLVDEN LGTNIAHEEV YEGGDVDSAM REAEVKIEER LTIQRVVPAA MEPRGVVAAY
DGDMLTIWSS TQVPFDIRKE VAKALDIPLV KVRAVQPFVG GAFGSKLIVY PEEIWVSKAA
YLLKRPVKWV ATRSEDFKTT THGRALILDY RVGATRDGRI LAIEGTVYAD AGAYYWGEGL
ADTAARMLPG PYDIRNGRVK AVAVLTNKTP LSAYRGAGRP EATFFIERIM DRLADELGID
RVEIRERNLI RQLPYTNVFG ITYDTGDYLT TFKQGLERLG YSQLKQWAEE ELKRGRVVGV
GFSVYVEITT FGYETAILRA ERDGTFTLYT ALTPHGQGLA TALAQIVAEE LDVPIESVKV
VWGDTALISD GIGTMGSRSI TAGGSAAILA ARRLKEELLK AARKVLGCDP EYSGGKFSCG
GKSATVKDVV RAVYRGEAEA QLTVEAIYHA DSTFPFGVHL AVVELDPETG FVKPMLYKSY
DDVGVVVNPL LASGQITGGA LQGIAQALYE EVVYDESGNL ITSNLAFYYV PTAAEAPKYE
VYFAERPHPS RHLTGTKGIG EAATIASTPA VVSAVEDALR RIKPGVRIEK TPVTPEDVWR
MLR