Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Pars_0547 |
Symbol | |
ID | 5054275 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pyrobaculum arsenaticum DSM 13514 |
Kingdom | Archaea |
Replicon accession | NC_009376 |
Strand | - |
Start bp | 489883 |
End bp | 490968 |
Gene Length | 1086 bp |
Protein Length | 361 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 640468109 |
Product | alcohol dehydrogenase |
Protein accession | YP_001152794 |
Protein GI | 145590792 |
COG category | [R] General function prediction only |
COG ID | [COG1064] Zn-dependent alcohol dehydrogenases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 31 |
Fosmid unclonability p-value | 0.986986 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGAGCCG CTTTATTAAC TGAATTCAAC AAGCCGCTTG AAATTAAGGA CGTAGAGGTT CCGAAGGTAG GCAAGGGGGA GGTTTTACTA CAAGTACTCG CGTGTGGCAT CTGTAGATCG GACTGGCACC TGTGGAGGGG AGATCCCTCC TTGGTCGCGT ATATGCAGTG GTCGGGCGGC AAACTGCCCA TTATCCCGGG ACATGAAGTG GCGGGGAGGG TGGTCGAGGT GGGGGAGGGG GTTAGCAATG TTGAAGTGGG AGACGTAGTG GTGGCCCCGG CGTCGTCAAC GGGGGATAAC AGAACTTGTA GGTACTGCAA GGAGGGGGCT TCGAACATAT GCGAACACCT TTGGATTCCG GGCTTCGGCA CACACGGATG CTACGCTGAA TATATGAAGG TGCCGGCCAG CTCGGTAGTA GACCTCGTTA AAGTCCCAGA AGGCGTCCCG CCTGAATACG CGGCTATAAC CGGGTGCGGT TTCGGCACTG CGTGGAACGC CCTAGTGGTT AAGAACGGCA TTAGGCCTGG CGAAACGCTA TTAATAACGG GAGCGGGGGG CATGGGCCTC AGCGCTTTGT TAATAGCCTC TGCCGCCGGG GCGAAAACCG TCGTGGTCGA TGTAAACCCC GCCTCAGTAG AAAAGGCGAA GAAAATGGGA GCAACTGCGG CATATCACTA CTCTGGACAT CCCCAGGAGC TCGCCAAGCT CGTTAACGAG GAGATCGTGA AGTCTTTTGG CATGGTCGAT GCTGTGTTCG ATTCCACGGG CAATCCCGAC GTCCTATCCG CGGTGTTGCC GGCAGTGCGG CCGCAGGGCA GGATACTGTT GGCGGGGCTC ATGATGAAGG GCAAGGAGAT CTGGCCGCTG GCCTCCGATA TAGTAGTCGC CAGAGAGTTG ACCATACAAG GAGTGTTGAT GCTACCGTCG CAGAAATACG ACGGGATATT TAAGCTTATA TCGGAGGGAA GGGTGAACCT TGAGCCTGTG ATCTACCGGA GGATATCTCT CGATGAGGTG AACGACGCAT ACGCCGAGAT GTCCCGTTTC AAAAACGCCG GCAGATTTGT AATTACTAAA TTTTAA
|
Protein sequence | MRAALLTEFN KPLEIKDVEV PKVGKGEVLL QVLACGICRS DWHLWRGDPS LVAYMQWSGG KLPIIPGHEV AGRVVEVGEG VSNVEVGDVV VAPASSTGDN RTCRYCKEGA SNICEHLWIP GFGTHGCYAE YMKVPASSVV DLVKVPEGVP PEYAAITGCG FGTAWNALVV KNGIRPGETL LITGAGGMGL SALLIASAAG AKTVVVDVNP ASVEKAKKMG ATAAYHYSGH PQELAKLVNE EIVKSFGMVD AVFDSTGNPD VLSAVLPAVR PQGRILLAGL MMKGKEIWPL ASDIVVAREL TIQGVLMLPS QKYDGIFKLI SEGRVNLEPV IYRRISLDEV NDAYAEMSRF KNAGRFVITK F
|
| |