Gene Pars_0925 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_0925 
Symbol 
ID5055712 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp817467 
End bp820655 
Gene Length3189 bp 
Protein Length1062 aa 
Translation table11 
GC content56% 
IMG OID640468481 
Productmolybdopterin oxidoreductase 
Protein accessionYP_001153157 
Protein GI145591155 
COG category[C] Energy production and conversion 
COG ID[COG0243] Anaerobic dehydrogenases, typically selenocysteine-containing 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones36 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCACTTA CTAGACGCGA TGTTTTAAAA ACCGGCGTCG CCATAGGTAT AGCGGGGGGG 
CTAGCTGGAT TTGCGATAAA GAGCGTAGCC GAAACTACAG CGGCGCCGCA GTCCGAGTCT
AAGGCAACTA TCGTCTCTAT ACCCTCTATA TGCGGTATGT GTATGGCGCA GTGCGCCATT
TACATAGATG TAGTTAACGG TAAGCCGGTG CGTATTAGGC CTAATACAAA CGCCCCAACC
AGCGCGATTG GGATATGTGC CAGGGGGGTC TCAGGCACGT TTAACACGTG GCTAAACCCC
GACGTCATTA AGAAGCCCAT GGCTAGGAAA GCCCTTGTCG ACTGGGCCCA GGGCAAAATC
TCGTGGGAAG AAGTCAAGAG GCAGATAGCG CAGAGCCGCG GCAGGTACGA CGACATGGTG
GAGGTGGATT GGAACACCGC TATCGAAATC ATCGCGAAGA AGCTTAAGGA GCTTGCCGAC
AACAACGAGC GTCAAGCCTT CACCTTCCTC TTCGGCGCCT GGGGGCCAAC CGCATCTATG
CGCGCCGGGG TGCCCATATC CAGATTCGCC GACACTTTCG GCGGCGGGCA GATCACCTTC
GACAACCCCT ACTGCACTTA CCCCCGCTAC CTCGGCCACT GGCTGACTTG GGGCCATGGC
CACCAAGCCC ACGTATCTTG CATAGATTAC GGCGAGGCGG AGGCCATACT GGTAGTTAGG
AGGAACGTGA TAGGCGCAGG CGTCGTGACG GAGACATGGC GCTTCATGGA GGCTGTGAAG
AGGGGAGCGA TGCTGGTGGT GTTGAGTCCC GTCTTCGACG AGACCGCCTC ATATGCCACT
GTCTGGCTAC CTGTAAAGCC CGGCACAGAC CTCGCAGTCC TCTTGGCATT TATCAAATAC
GTGCTTGACA ACGGATGTTA TGTAGAGCCG TATCTCAGGA CTTATACAAA CGCGCCGTTT
TTAATAAAAG AAGATGGCTT GCCGTTGCTC GCCTCCGAGG TTGCTTGGGA CAAGTACGGA
GTGCAGGCGC CTTCCGGCTT TGCCTACGTG GTCTGGGACA CGGCTACAAA CGCCCCCGCC
CCCGACAACG CAGCGAGGCA AGCCTCTTTG TTTGGACAGT ATGAGGTACA GCTTAAAGAT
GGGACAGTGG CCAAGGTGAA GACCGCGTTG ACTATACTTA AGGAGTGGGT TGACGCAAAC
CTCGCGGCAC TCGCTAAGAA GCACGGGGTG GGCGACTACA TGGAGGCCGT GGCCAAAGAG
GCGGATGTTG ATGTAAACGA CTTAAGGAGG GCTGCCAAGA TTGTGTCTCA GTACCGCGCG
GTGGCTCCCA TAGGCTGGCA CGACCCGCGT TACAGCAACT CGCCGCAGAC TTGGAGGGCA
GTGGGCGTCT TGATGGCCCT CCTGGGCAGA ATACAACAGC CCGGCGGCTT ATTCCTATTG
ACCCACTTGA TAATGCCCTA CGCAGATGTG TATAACAAGG TGATGAAGTA TACTAAGAAA
GACGTGCCCT ACAAAACAAT ACGGGGAATG ACGTTTTCTG AATACGTCTC TTCAAACATG
TCTGCTGTGT ATGTAATTCC TATCGCGCCG CCGTTGCCTG GTCCTAGCGA CCGCGGCGCC
CCGCCAGTGC CGACGCTTGT GGAGAAATGG GCTGAGGAGG CTGAAAAGCA GGGCTACCTC
TACCCGTACG ACACAGTCCA GGCGCTTTAC GAGAGCGTTG TCTACGGCAA GCCGTTTAAG
ACAAAGGTGG TCTTCATCAC GGGGTCTAAC CCAATTCCGC AGATCGGCAA CAGTAAACTG
GTGGAGGAGA TTTTCCGCAA CCTCGACCTT GTCATTGTCC ACGACATACA GTTCAACGAC
ACGACGGCCT TCGCCGACGT GATTCTGCCC GACCTCCCCT ACCTAGAGAG AATGGACCTA
GCGCTCCCAG GCCCGTTCTC TCCGTTCCCA GCCATCTCCG TGCGGTTCCC CTGGTATTAC
GAGGAGTATA AGGCGAGGCT ACAGCAGGGG GAGAAGCCGG GCGAGTTGGA CAAGAAGTTC
AGATCGCGCA ACGGGAGGAC AATTTTCGAG GTTTTGTTGA TGATTGCCAG GAGGCTACAG
CAGATGGGAG TCAAGGCGAG GGACGGCACT GACTGGTCCC AGAACATGCC CGTGGGGATG
ATAACGGAAG ACGGCATATT CCCCATCCCC AACTTGATGA ACTTCATAAA CGCCACGTTT
AGGAGGATTA GGATTATTGA CGAGAACGGC CAGGTAAGGG CGCCGACTGT TGACGATTTG
TATAAGATGG GCGGCTACAT GGTGTTAGTC CCCACGGGCA GATTAGAAAC TGTAGTAGAC
GAGAGGTGGA GCCAAGCGCT TGGGCGGGAG GTGAGGGTGA GGGTGCATGT GTTTAAGCCT
GTCCAGTATA CAGTAGACAA GGAGGCTTGG CTGTGGCGCG TCGTCCACTA CAACTCCCCC
ATTACCCAGG GCCTGGCGCC GTTGCCGACG CCGAGTGGGA AAGTCGAGAT ATACAGCATC
AACTTGGCAT ACGACGTCAA GAGGGTATTC GGCAAGCCTG CGACCTCTAT CGACCCGTCT
GACCTTGGGG GGACTAAAAG CGGTGTTGAC CCCTTGTTCT CGCCTGTGCC GCTCTACGCC
GGCATGGCTA GGCCGGACTA CATGTGGGCT ACCGGCCCGC CAACGCCAGA CATCAAGGTG
AACGGCCTAG TGCCGCCGGA GCCGCCGAAG AGGCTGTTGC TGGTTTACAG GCACGGGCCC
TATACCCATA CCCACAGCCA TACGCAGAAT AACATGTTGC TTAACACGCT GACTCCTGAC
GAGTTGCTGA TGGCGTGGAT CCACCCGGAC ACCGCGGCTA AGCTAGGGGT GAACGACGGC
GACGTGATAG AGGTGAGACC AGCGGCGCCG AAAGTCCTTG AACAGTTAAA GGCAGTGGGC
GTAGGCGAGG TACCTGCGGC GAGGTTTAAG GTGAGGGTTA CTCCGATGGT TAGACCAGAC
ATCATTGCGA TATACCACTA CTGGCTTGTG CCGAGGGGGA GGCTTAGGGC TAAGGCGGAG
AAGCTTGTAA ACCTCCGCTC CGGCTACAGC GACGACAACT ACCTCGGCCC AATGTTGGCT
GGGAGGCTTG GGACGCCCGG CGCCATGGGC AACACAGTAG TAGAGGTGAG TAAGGTGGGT
GGGCTATGA
 
Protein sequence
MSLTRRDVLK TGVAIGIAGG LAGFAIKSVA ETTAAPQSES KATIVSIPSI CGMCMAQCAI 
YIDVVNGKPV RIRPNTNAPT SAIGICARGV SGTFNTWLNP DVIKKPMARK ALVDWAQGKI
SWEEVKRQIA QSRGRYDDMV EVDWNTAIEI IAKKLKELAD NNERQAFTFL FGAWGPTASM
RAGVPISRFA DTFGGGQITF DNPYCTYPRY LGHWLTWGHG HQAHVSCIDY GEAEAILVVR
RNVIGAGVVT ETWRFMEAVK RGAMLVVLSP VFDETASYAT VWLPVKPGTD LAVLLAFIKY
VLDNGCYVEP YLRTYTNAPF LIKEDGLPLL ASEVAWDKYG VQAPSGFAYV VWDTATNAPA
PDNAARQASL FGQYEVQLKD GTVAKVKTAL TILKEWVDAN LAALAKKHGV GDYMEAVAKE
ADVDVNDLRR AAKIVSQYRA VAPIGWHDPR YSNSPQTWRA VGVLMALLGR IQQPGGLFLL
THLIMPYADV YNKVMKYTKK DVPYKTIRGM TFSEYVSSNM SAVYVIPIAP PLPGPSDRGA
PPVPTLVEKW AEEAEKQGYL YPYDTVQALY ESVVYGKPFK TKVVFITGSN PIPQIGNSKL
VEEIFRNLDL VIVHDIQFND TTAFADVILP DLPYLERMDL ALPGPFSPFP AISVRFPWYY
EEYKARLQQG EKPGELDKKF RSRNGRTIFE VLLMIARRLQ QMGVKARDGT DWSQNMPVGM
ITEDGIFPIP NLMNFINATF RRIRIIDENG QVRAPTVDDL YKMGGYMVLV PTGRLETVVD
ERWSQALGRE VRVRVHVFKP VQYTVDKEAW LWRVVHYNSP ITQGLAPLPT PSGKVEIYSI
NLAYDVKRVF GKPATSIDPS DLGGTKSGVD PLFSPVPLYA GMARPDYMWA TGPPTPDIKV
NGLVPPEPPK RLLLVYRHGP YTHTHSHTQN NMLLNTLTPD ELLMAWIHPD TAAKLGVNDG
DVIEVRPAAP KVLEQLKAVG VGEVPAARFK VRVTPMVRPD IIAIYHYWLV PRGRLRAKAE
KLVNLRSGYS DDNYLGPMLA GRLGTPGAMG NTVVEVSKVG GL