Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Pars_1545 |
Symbol | |
ID | 5054034 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pyrobaculum arsenaticum DSM 13514 |
Kingdom | Archaea |
Replicon accession | NC_009376 |
Strand | - |
Start bp | 1400632 |
End bp | 1401600 |
Gene Length | 969 bp |
Protein Length | 322 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 640469086 |
Product | alcohol dehydrogenase |
Protein accession | YP_001153751 |
Protein GI | 145591749 |
COG category | [R] General function prediction only |
COG ID | [COG1064] Zn-dependent alcohol dehydrogenases |
TIGRFAM ID | [TIGR01202] 2-desacetyl-2-hydroxyethyl bacteriochlorophyllide A dehydrogenase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 0.0146504 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGGCTG TACAGCTTGT CAAATTCGGC GAACCCGCGG AAGCGTTGAA GTTTGTTGAT CTTCCCGATC CTGTGCCGGG TCCCGGCGAC GTGGTCGTGA AAATAGAGGC CGCGGGGGTA TGTGGGAGGG ACTTAGTGGT GAGGAAGGGC GCCTTCCCCC ACGTGAAGCC GCCGATAGTT CCAGGACACG AAGGCGTGGG GAAAATAGTA GATGTAGGCC CCGGCGTGGA GAAGGATATT ATCGGCGAGA GGGTGTTCCT CTCCGGTATA TACGACGGCA CGTGCGAATA CTGTAAGAGA GGGCTTGAAA ATCTATGTAA AAACGCCGAG CTACTAGGCG AGTCGCGCAA CGGGACATAC GCCGAGTATG TGCTAGTCCC AGCAAAGTTC GCCCACCCAT TCCACGGCCT AGATCCAAGA GTTGCGGTCG TGGCCACATG CCCGCTGTCC ACAGCAGTGT ACGCGTTGAG ACACGTGGAC GTAGAGGGAA AAAAAGTACT GGTAGTAGGC GCAGGCGGAA CAGGTATCTA CATTGCACAG CTGGCTAAAG TAAGAGGCGC CGAGGTCTAC GTCTCAACCA GGTCGCCGGA CAAGGCAAGA GTTTTGAAAG AGTTGGGTAT CAACACGGCG CCGGAGGGCG AGAAGGACTT TGACGTCGTG GTGGATACGG TGGGAAGCCC CACACTGGAG CGCTCCCTCA AGCTGGCCAA GAGATCGGGC TCTGTCTTGG TCATCGGCAA CGTAACTGGA GAAAAGGCGT TGCTAAGCCC CGCGCTGATA ATTCTAAGAC AGTTGAAGGT AATAGGCAGC ATGGCCTTCC GGCCCTGGGA CATATACGAG GCGCTGGACA TACTGAAAAG AGGGCTAGTA AAGCCGCTCT ACACCGAGTA TAAGCTACAA GACGCCGCTA GGGCCCATGA GGATATGGAA AGAGGAGCGG TCATAGGCAG GGCCATCCTC GTGCCTTGA
|
Protein sequence | MKAVQLVKFG EPAEALKFVD LPDPVPGPGD VVVKIEAAGV CGRDLVVRKG AFPHVKPPIV PGHEGVGKIV DVGPGVEKDI IGERVFLSGI YDGTCEYCKR GLENLCKNAE LLGESRNGTY AEYVLVPAKF AHPFHGLDPR VAVVATCPLS TAVYALRHVD VEGKKVLVVG AGGTGIYIAQ LAKVRGAEVY VSTRSPDKAR VLKELGINTA PEGEKDFDVV VDTVGSPTLE RSLKLAKRSG SVLVIGNVTG EKALLSPALI ILRQLKVIGS MAFRPWDIYE ALDILKRGLV KPLYTEYKLQ DAARAHEDME RGAVIGRAIL VP
|
| |