Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Pars_0776 |
Symbol | |
ID | 5055914 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pyrobaculum arsenaticum DSM 13514 |
Kingdom | Archaea |
Replicon accession | NC_009376 |
Strand | + |
Start bp | 691728 |
End bp | 692693 |
Gene Length | 966 bp |
Protein Length | 321 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 640468335 |
Product | thioredoxin-disulfide reductase |
Protein accession | YP_001153014 |
Protein GI | 145591012 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0492] Thioredoxin reductase |
TIGRFAM ID | [TIGR01292] thioredoxin-disulfide reductase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.624753 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 37 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAGGCGG ATTACGATGT GATTATTGTG GGGGCGGGGA TAGCGGGTCT CTCGGCGGCT TTGTACGCGG CCAGGCAGAG GCTCAAGACT CTAGTCATTA GCAAGGATCT CGGCGGTCAG CTGAACATGA CGACTCTCAT CGAGAACTAC CCGGCGATCG CCAAGATCTC CGGCCCCGAG CTTGCCAAGC GCGTTGAACA ACAGGCTAGG GCCTTCGGCG CAGAGGCCAT CTTCGACGAG GTGAAGACGG TGGAGAAGCA GGGCGACGTC TTCGTGGTGA AGACGGAGGG GGGCGACGAG TACAAGGCGT TGGCAGTAGT CCTGGCCTTC GGCAAGACGC CGAAGGAGCT GAACGTCCCA GGGGAGGCCA AGTTCAAGAA CAAGGGCGTC TCCTACTGCA CCATTTGCGA CGCGCCGTTT TTCAAGGGCC AAGACGTGGC GCTGGTGAGC TGGGGGGACT TGGCCAGGGA GCCCGTAACT ATCTTGTCCT CCGTCTCGAA CAAATTCTAC TGGATTTTCC CAGGGGATAA GCCTATACAC GACGAAGAGT TTATAGAACA AGCCAAGAGG CTTGGCAAAG CCGTTTTTAT GCCTAACAGC GAGGTGGTGG AGATAAAGGG CGACGCCAAG GTTAAAGCCG TGGTGGTCAA GAACAGGAAG ACGGGCGAGA TCCAAGAGCT ACCCGTATCG GCAGTATTTA TTGAAGTAGG CTACGTCACG AAAAGCGACT TCGTAAAACA TCTTGTCGAT TTGAATGAGA GAGGCGAGAT AATAGCAGAT TGGGAAGGAA GGACAAAAAC ACCAGGCGTC TTCGCCGCTG GCGACATCGT TGCCTATCCC TACAAACAGG CGGTAATATC GGCCGCCATG GGCGTCGCCG CCGCCCTCTC AGCCACGGCG TACGTAATGA AGCTGAAGGG CAAGCCCGTA CACAGCCTAG TGGACTGGAG GGCCGAGAAA AAATAA
|
Protein sequence | MEADYDVIIV GAGIAGLSAA LYAARQRLKT LVISKDLGGQ LNMTTLIENY PAIAKISGPE LAKRVEQQAR AFGAEAIFDE VKTVEKQGDV FVVKTEGGDE YKALAVVLAF GKTPKELNVP GEAKFKNKGV SYCTICDAPF FKGQDVALVS WGDLAREPVT ILSSVSNKFY WIFPGDKPIH DEEFIEQAKR LGKAVFMPNS EVVEIKGDAK VKAVVVKNRK TGEIQELPVS AVFIEVGYVT KSDFVKHLVD LNERGEIIAD WEGRTKTPGV FAAGDIVAYP YKQAVISAAM GVAAALSATA YVMKLKGKPV HSLVDWRAEK K
|
| |