Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Pars_0241 |
Symbol | |
ID | 5056221 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pyrobaculum arsenaticum DSM 13514 |
Kingdom | Archaea |
Replicon accession | NC_009376 |
Strand | + |
Start bp | 215203 |
End bp | 216123 |
Gene Length | 921 bp |
Protein Length | 306 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 640467820 |
Product | aldo/keto reductase |
Protein accession | YP_001152508 |
Protein GI | 145590506 |
COG category | [C] Energy production and conversion |
COG ID | [COG0667] Predicted oxidoreductases (related to aryl-alcohol dehydrogenases) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 26 |
Fosmid unclonability p-value | 0.292882 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAGTATA GGGAGACGGT TGGGCTGAGG CTTTCCGAGA TCGGCTTCGG GGCGTGGGTT GTGGGGAGCG ACTTGTATAG GCTAGACGAC GACACAGCCA GGAGGCTGGT GAAGAGGGCC CTCGACCTCG GCATAAACCT CTTCGACACG GCCGACGTAT ACGGCCGCGG CCGGAGCGAG GAGCTCCTCG GCCAGTGGCT CAAGGGCTAT GACGTGGTCA TATCCACGAA GGTGGGCTAC GACTTCTACT CCGGCGCGAA GCCAGCCAGG AGGTACGACC CCCAGTATCT GGAATTCGCC GTGTCTAAGT CCGCGGAGAG GCTTGGCAGG AGGCCGGACA TCTTGATGCT CCACAACCCG CCGGCGGACG CGGTTAAGTC CGCGGCGGAG TATGCCTTGT CCAAAAGAGG AGTCTGGGCC GACAGGATAG GGGCCGCTCT GGGCCCGGAG ACCAACGTCC TCGCTGAGGG CCTCGCGGCG CTTGAGGCGG GATACGACGC CTTGATGTTC GTCTTTAACA TACTGGAGCA AGAGCCAGCT CTCGAGCTGG TCGGCCGCGG CGCAGGGAGG ATTCTCTTGG CGAGGGTCCC ACACGCCAGC GACGTGCTGA CCGACAGGTT CAAGCCGGAG TTCCCGCCGG AGGACCACCG CTCCCTCCGG AAGAAGGAGT GGCTCATAAA GGCCAGGAAG CTGGTGGAGG CCGAGGTAGC CCCCCTAGCC AAGGAGCTGG GCTACACCCT GGGGCAGTAC GCCCTCAAGT TCGTGCTCTC TTTCCCAGTC ACCAGCGTCT TGGTAACGGC CACCTCTGTA GAGGAGCTGG AGGAATACGC CGAGGCCTCC GACGGCAAAC CCCTACCCCG AGACCACCTA GAAGCGCTGA GGGAGTTTTG GACAAAACAC AGAGAGGAGC TAAGCGAGTA A
|
Protein sequence | MQYRETVGLR LSEIGFGAWV VGSDLYRLDD DTARRLVKRA LDLGINLFDT ADVYGRGRSE ELLGQWLKGY DVVISTKVGY DFYSGAKPAR RYDPQYLEFA VSKSAERLGR RPDILMLHNP PADAVKSAAE YALSKRGVWA DRIGAALGPE TNVLAEGLAA LEAGYDALMF VFNILEQEPA LELVGRGAGR ILLARVPHAS DVLTDRFKPE FPPEDHRSLR KKEWLIKARK LVEAEVAPLA KELGYTLGQY ALKFVLSFPV TSVLVTATSV EELEEYAEAS DGKPLPRDHL EALREFWTKH REELSE
|
| |