Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Pars_2070 |
Symbol | |
ID | 5055993 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pyrobaculum arsenaticum DSM 13514 |
Kingdom | Archaea |
Replicon accession | NC_009376 |
Strand | + |
Start bp | 1850392 |
End bp | 1852248 |
Gene Length | 1857 bp |
Protein Length | 618 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 640469619 |
Product | molydopterin dinucleotide-binding region |
Protein accession | YP_001154268 |
Protein GI | 145592266 |
COG category | [C] Energy production and conversion |
COG ID | [COG0243] Anaerobic dehydrogenases, typically selenocysteine-containing |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.360567 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 0.0416579 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGACTTC TAGCCTGCAC CCGGGACTGC TACGACACGT GCATTTTCCA CGTGGTGAGA GATGGGGAGA TACGCCTAGT CCCAATCAGC GATTTTCCCA CTCTAGGCTT CACCTGCGCC CGGGGCATGG CAGATGTGCG GAGGCTAAAC TCGCCGAGGA GGATAAAGAC GCCGTTGCTC AGAGGAGAGC GGCAAGTCGT GGAGGCGAGC TGGGGCCGCG CCCTTGGGGA GCTGGCGGCG AGGATCAAGG AGGCGGACCC CCAACGCGTA ATACACATAG ACTACGACGG CAACCAAGGC CTCCTCACCT GGTACTACCC CGCAAGGCTC TTCAACGCCC TGGGGACAGC CTCTACCGAC TACTCCATCT GTAGCGCCGA GGGGCACGAG GCTATAAAGC TACACTGGGG CCGCTCCTAC GGCGCCATGC CGGAGGAGCT GGGGAGGAGG CCCGTGGTGT TCTGGGCGCT TGACGCCTCC ACGTCGTTTA TCCACGGCTG GGCCCTCGCC AAAAGGGGTA GAAACCCCAC TGCCGCCGTA GACGTGGTGT GGACCAGGAC CATGAAGGCA GTGGATCTGC CAGTGCTGGT GCGGCCTGGG ACAGACGTGG TTTTGGCCCT CGGCGTCGCC AGGGAGATAA TTGAGAGGGG GGCCTACGAC AGGGAGTTCG TGGAGAAATA CACCTACGGC TTCCACCTCT TTAGGGAGTA CGTCCAGAAG TTCACTCCTC AGTACGTCGA GGATGAGGCC GGCGTCCCCC GGGATATCTT CTACAAGCTG GTAGACATCT ATCTGAGAAG GCCCGTGACG GTTATAGGCT TTGCCATTGG GAGAACCGAG AACGGCGGCG ACGCCGCTAG GGCCATATCG CTAATACACG CCCTTCTGGG AGACCCCGCC GGCTTCTACT ACTCCAACTC GGGGGCCTGG GGCATCGACT TCGCCTACCT GCGCGGGTTG CACGTGGCTA AGCCGAGCAG GGTAGTCCCT ATGGGGGTTG TGGGCGGCGT CATCGAGGAG TTTAGGGTGG TGTACGTCTG GAACGCCAAC CCCGTCCTCA CGCTCCCCCA GGGAGACAGA ATCGCCAAGG CGGCGGAGAG GGGCGACATA ACCCTCGCCG TGCATGCCCC GTTGCTGGAC GAAACCGCCG AGGCGGCGCA CATCGTGTTG CCGGCGCCGC TGTACTTGGA AAAAGACGAC GTGATCTACA GCTACTGGCA CAACTACCTC GTCTACAACG CCGCAGTGGC TGAGCCCCCC GGGGATGCGA GGAGGGAGAC CTGGGTCGTG AAGAAGCTCG CCGAGCTTCT GGGAGTTGGC GACCACCCCC TCTTGCGGGA AGACCCCTGG GACGCCGTGG ATATTGCCAT AAGGGGAACC GGCGTTACCT TAAAAGAGCT GAGGGAGCGC CAGCTGGTTA AGCTCAAGGC GCCGGACTAC TATAAGTTCC CCACGGCTAC GGGCAAGGTG GAGTTCTACA GCGCCACGGC GGAGCGCCGA GGCTTGCCGC CGCTTCCCCA GTACGCGCCG CCAAGGAGGG GCTACGTCTT GACCTTCCCG CCCCATACCC TATACACCAA TAGCCAGTTT AGAGACGTCT ACGGGGAGCC TGAGCCCGCC GTGTTGGTAA ACCCAAGCGA CTACGTGGGC GACTGCATTG TACTGTACAA CGAGGCGGGG GAGGTGAGGG TAAGAGCTAG GCCCAGCCCA GAAGTGCCCC GCGGCGTAGT CGCCTATCTG GGCATCGGCA AGGACCTCCG GGGGGAGCCC ATAAACAAGA TAGCAAGAGG CGAGCCGGGG CCCTACGGAG GCACCCCCAA GCTCTACACT ACCTATGTAC AAATGAGACC ATGTTAA
|
Protein sequence | MGLLACTRDC YDTCIFHVVR DGEIRLVPIS DFPTLGFTCA RGMADVRRLN SPRRIKTPLL RGERQVVEAS WGRALGELAA RIKEADPQRV IHIDYDGNQG LLTWYYPARL FNALGTASTD YSICSAEGHE AIKLHWGRSY GAMPEELGRR PVVFWALDAS TSFIHGWALA KRGRNPTAAV DVVWTRTMKA VDLPVLVRPG TDVVLALGVA REIIERGAYD REFVEKYTYG FHLFREYVQK FTPQYVEDEA GVPRDIFYKL VDIYLRRPVT VIGFAIGRTE NGGDAARAIS LIHALLGDPA GFYYSNSGAW GIDFAYLRGL HVAKPSRVVP MGVVGGVIEE FRVVYVWNAN PVLTLPQGDR IAKAAERGDI TLAVHAPLLD ETAEAAHIVL PAPLYLEKDD VIYSYWHNYL VYNAAVAEPP GDARRETWVV KKLAELLGVG DHPLLREDPW DAVDIAIRGT GVTLKELRER QLVKLKAPDY YKFPTATGKV EFYSATAERR GLPPLPQYAP PRRGYVLTFP PHTLYTNSQF RDVYGEPEPA VLVNPSDYVG DCIVLYNEAG EVRVRARPSP EVPRGVVAYL GIGKDLRGEP INKIARGEPG PYGGTPKLYT TYVQMRPC
|
| |