Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Pars_0925 |
Symbol | |
ID | 5055712 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pyrobaculum arsenaticum DSM 13514 |
Kingdom | Archaea |
Replicon accession | NC_009376 |
Strand | - |
Start bp | 817467 |
End bp | 820655 |
Gene Length | 3189 bp |
Protein Length | 1062 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 640468481 |
Product | molybdopterin oxidoreductase |
Protein accession | YP_001153157 |
Protein GI | 145591155 |
COG category | [C] Energy production and conversion |
COG ID | [COG0243] Anaerobic dehydrogenases, typically selenocysteine-containing |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 36 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCACTTA CTAGACGCGA TGTTTTAAAA ACCGGCGTCG CCATAGGTAT AGCGGGGGGG CTAGCTGGAT TTGCGATAAA GAGCGTAGCC GAAACTACAG CGGCGCCGCA GTCCGAGTCT AAGGCAACTA TCGTCTCTAT ACCCTCTATA TGCGGTATGT GTATGGCGCA GTGCGCCATT TACATAGATG TAGTTAACGG TAAGCCGGTG CGTATTAGGC CTAATACAAA CGCCCCAACC AGCGCGATTG GGATATGTGC CAGGGGGGTC TCAGGCACGT TTAACACGTG GCTAAACCCC GACGTCATTA AGAAGCCCAT GGCTAGGAAA GCCCTTGTCG ACTGGGCCCA GGGCAAAATC TCGTGGGAAG AAGTCAAGAG GCAGATAGCG CAGAGCCGCG GCAGGTACGA CGACATGGTG GAGGTGGATT GGAACACCGC TATCGAAATC ATCGCGAAGA AGCTTAAGGA GCTTGCCGAC AACAACGAGC GTCAAGCCTT CACCTTCCTC TTCGGCGCCT GGGGGCCAAC CGCATCTATG CGCGCCGGGG TGCCCATATC CAGATTCGCC GACACTTTCG GCGGCGGGCA GATCACCTTC GACAACCCCT ACTGCACTTA CCCCCGCTAC CTCGGCCACT GGCTGACTTG GGGCCATGGC CACCAAGCCC ACGTATCTTG CATAGATTAC GGCGAGGCGG AGGCCATACT GGTAGTTAGG AGGAACGTGA TAGGCGCAGG CGTCGTGACG GAGACATGGC GCTTCATGGA GGCTGTGAAG AGGGGAGCGA TGCTGGTGGT GTTGAGTCCC GTCTTCGACG AGACCGCCTC ATATGCCACT GTCTGGCTAC CTGTAAAGCC CGGCACAGAC CTCGCAGTCC TCTTGGCATT TATCAAATAC GTGCTTGACA ACGGATGTTA TGTAGAGCCG TATCTCAGGA CTTATACAAA CGCGCCGTTT TTAATAAAAG AAGATGGCTT GCCGTTGCTC GCCTCCGAGG TTGCTTGGGA CAAGTACGGA GTGCAGGCGC CTTCCGGCTT TGCCTACGTG GTCTGGGACA CGGCTACAAA CGCCCCCGCC CCCGACAACG CAGCGAGGCA AGCCTCTTTG TTTGGACAGT ATGAGGTACA GCTTAAAGAT GGGACAGTGG CCAAGGTGAA GACCGCGTTG ACTATACTTA AGGAGTGGGT TGACGCAAAC CTCGCGGCAC TCGCTAAGAA GCACGGGGTG GGCGACTACA TGGAGGCCGT GGCCAAAGAG GCGGATGTTG ATGTAAACGA CTTAAGGAGG GCTGCCAAGA TTGTGTCTCA GTACCGCGCG GTGGCTCCCA TAGGCTGGCA CGACCCGCGT TACAGCAACT CGCCGCAGAC TTGGAGGGCA GTGGGCGTCT TGATGGCCCT CCTGGGCAGA ATACAACAGC CCGGCGGCTT ATTCCTATTG ACCCACTTGA TAATGCCCTA CGCAGATGTG TATAACAAGG TGATGAAGTA TACTAAGAAA GACGTGCCCT ACAAAACAAT ACGGGGAATG ACGTTTTCTG AATACGTCTC TTCAAACATG TCTGCTGTGT ATGTAATTCC TATCGCGCCG CCGTTGCCTG GTCCTAGCGA CCGCGGCGCC CCGCCAGTGC CGACGCTTGT GGAGAAATGG GCTGAGGAGG CTGAAAAGCA GGGCTACCTC TACCCGTACG ACACAGTCCA GGCGCTTTAC GAGAGCGTTG TCTACGGCAA GCCGTTTAAG ACAAAGGTGG TCTTCATCAC GGGGTCTAAC CCAATTCCGC AGATCGGCAA CAGTAAACTG GTGGAGGAGA TTTTCCGCAA CCTCGACCTT GTCATTGTCC ACGACATACA GTTCAACGAC ACGACGGCCT TCGCCGACGT GATTCTGCCC GACCTCCCCT ACCTAGAGAG AATGGACCTA GCGCTCCCAG GCCCGTTCTC TCCGTTCCCA GCCATCTCCG TGCGGTTCCC CTGGTATTAC GAGGAGTATA AGGCGAGGCT ACAGCAGGGG GAGAAGCCGG GCGAGTTGGA CAAGAAGTTC AGATCGCGCA ACGGGAGGAC AATTTTCGAG GTTTTGTTGA TGATTGCCAG GAGGCTACAG CAGATGGGAG TCAAGGCGAG GGACGGCACT GACTGGTCCC AGAACATGCC CGTGGGGATG ATAACGGAAG ACGGCATATT CCCCATCCCC AACTTGATGA ACTTCATAAA CGCCACGTTT AGGAGGATTA GGATTATTGA CGAGAACGGC CAGGTAAGGG CGCCGACTGT TGACGATTTG TATAAGATGG GCGGCTACAT GGTGTTAGTC CCCACGGGCA GATTAGAAAC TGTAGTAGAC GAGAGGTGGA GCCAAGCGCT TGGGCGGGAG GTGAGGGTGA GGGTGCATGT GTTTAAGCCT GTCCAGTATA CAGTAGACAA GGAGGCTTGG CTGTGGCGCG TCGTCCACTA CAACTCCCCC ATTACCCAGG GCCTGGCGCC GTTGCCGACG CCGAGTGGGA AAGTCGAGAT ATACAGCATC AACTTGGCAT ACGACGTCAA GAGGGTATTC GGCAAGCCTG CGACCTCTAT CGACCCGTCT GACCTTGGGG GGACTAAAAG CGGTGTTGAC CCCTTGTTCT CGCCTGTGCC GCTCTACGCC GGCATGGCTA GGCCGGACTA CATGTGGGCT ACCGGCCCGC CAACGCCAGA CATCAAGGTG AACGGCCTAG TGCCGCCGGA GCCGCCGAAG AGGCTGTTGC TGGTTTACAG GCACGGGCCC TATACCCATA CCCACAGCCA TACGCAGAAT AACATGTTGC TTAACACGCT GACTCCTGAC GAGTTGCTGA TGGCGTGGAT CCACCCGGAC ACCGCGGCTA AGCTAGGGGT GAACGACGGC GACGTGATAG AGGTGAGACC AGCGGCGCCG AAAGTCCTTG AACAGTTAAA GGCAGTGGGC GTAGGCGAGG TACCTGCGGC GAGGTTTAAG GTGAGGGTTA CTCCGATGGT TAGACCAGAC ATCATTGCGA TATACCACTA CTGGCTTGTG CCGAGGGGGA GGCTTAGGGC TAAGGCGGAG AAGCTTGTAA ACCTCCGCTC CGGCTACAGC GACGACAACT ACCTCGGCCC AATGTTGGCT GGGAGGCTTG GGACGCCCGG CGCCATGGGC AACACAGTAG TAGAGGTGAG TAAGGTGGGT GGGCTATGA
|
Protein sequence | MSLTRRDVLK TGVAIGIAGG LAGFAIKSVA ETTAAPQSES KATIVSIPSI CGMCMAQCAI YIDVVNGKPV RIRPNTNAPT SAIGICARGV SGTFNTWLNP DVIKKPMARK ALVDWAQGKI SWEEVKRQIA QSRGRYDDMV EVDWNTAIEI IAKKLKELAD NNERQAFTFL FGAWGPTASM RAGVPISRFA DTFGGGQITF DNPYCTYPRY LGHWLTWGHG HQAHVSCIDY GEAEAILVVR RNVIGAGVVT ETWRFMEAVK RGAMLVVLSP VFDETASYAT VWLPVKPGTD LAVLLAFIKY VLDNGCYVEP YLRTYTNAPF LIKEDGLPLL ASEVAWDKYG VQAPSGFAYV VWDTATNAPA PDNAARQASL FGQYEVQLKD GTVAKVKTAL TILKEWVDAN LAALAKKHGV GDYMEAVAKE ADVDVNDLRR AAKIVSQYRA VAPIGWHDPR YSNSPQTWRA VGVLMALLGR IQQPGGLFLL THLIMPYADV YNKVMKYTKK DVPYKTIRGM TFSEYVSSNM SAVYVIPIAP PLPGPSDRGA PPVPTLVEKW AEEAEKQGYL YPYDTVQALY ESVVYGKPFK TKVVFITGSN PIPQIGNSKL VEEIFRNLDL VIVHDIQFND TTAFADVILP DLPYLERMDL ALPGPFSPFP AISVRFPWYY EEYKARLQQG EKPGELDKKF RSRNGRTIFE VLLMIARRLQ QMGVKARDGT DWSQNMPVGM ITEDGIFPIP NLMNFINATF RRIRIIDENG QVRAPTVDDL YKMGGYMVLV PTGRLETVVD ERWSQALGRE VRVRVHVFKP VQYTVDKEAW LWRVVHYNSP ITQGLAPLPT PSGKVEIYSI NLAYDVKRVF GKPATSIDPS DLGGTKSGVD PLFSPVPLYA GMARPDYMWA TGPPTPDIKV NGLVPPEPPK RLLLVYRHGP YTHTHSHTQN NMLLNTLTPD ELLMAWIHPD TAAKLGVNDG DVIEVRPAAP KVLEQLKAVG VGEVPAARFK VRVTPMVRPD IIAIYHYWLV PRGRLRAKAE KLVNLRSGYS DDNYLGPMLA GRLGTPGAMG NTVVEVSKVG GL
|
| |