Gene Pars_1545 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_1545 
Symbol 
ID5054034 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp1400632 
End bp1401600 
Gene Length969 bp 
Protein Length322 aa 
Translation table11 
GC content56% 
IMG OID640469086 
Productalcohol dehydrogenase 
Protein accessionYP_001153751 
Protein GI145591749 
COG category[R] General function prediction only 
COG ID[COG1064] Zn-dependent alcohol dehydrogenases 
TIGRFAM ID[TIGR01202] 2-desacetyl-2-hydroxyethyl bacteriochlorophyllide A dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.0146504 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGGCTG TACAGCTTGT CAAATTCGGC GAACCCGCGG AAGCGTTGAA GTTTGTTGAT 
CTTCCCGATC CTGTGCCGGG TCCCGGCGAC GTGGTCGTGA AAATAGAGGC CGCGGGGGTA
TGTGGGAGGG ACTTAGTGGT GAGGAAGGGC GCCTTCCCCC ACGTGAAGCC GCCGATAGTT
CCAGGACACG AAGGCGTGGG GAAAATAGTA GATGTAGGCC CCGGCGTGGA GAAGGATATT
ATCGGCGAGA GGGTGTTCCT CTCCGGTATA TACGACGGCA CGTGCGAATA CTGTAAGAGA
GGGCTTGAAA ATCTATGTAA AAACGCCGAG CTACTAGGCG AGTCGCGCAA CGGGACATAC
GCCGAGTATG TGCTAGTCCC AGCAAAGTTC GCCCACCCAT TCCACGGCCT AGATCCAAGA
GTTGCGGTCG TGGCCACATG CCCGCTGTCC ACAGCAGTGT ACGCGTTGAG ACACGTGGAC
GTAGAGGGAA AAAAAGTACT GGTAGTAGGC GCAGGCGGAA CAGGTATCTA CATTGCACAG
CTGGCTAAAG TAAGAGGCGC CGAGGTCTAC GTCTCAACCA GGTCGCCGGA CAAGGCAAGA
GTTTTGAAAG AGTTGGGTAT CAACACGGCG CCGGAGGGCG AGAAGGACTT TGACGTCGTG
GTGGATACGG TGGGAAGCCC CACACTGGAG CGCTCCCTCA AGCTGGCCAA GAGATCGGGC
TCTGTCTTGG TCATCGGCAA CGTAACTGGA GAAAAGGCGT TGCTAAGCCC CGCGCTGATA
ATTCTAAGAC AGTTGAAGGT AATAGGCAGC ATGGCCTTCC GGCCCTGGGA CATATACGAG
GCGCTGGACA TACTGAAAAG AGGGCTAGTA AAGCCGCTCT ACACCGAGTA TAAGCTACAA
GACGCCGCTA GGGCCCATGA GGATATGGAA AGAGGAGCGG TCATAGGCAG GGCCATCCTC
GTGCCTTGA
 
Protein sequence
MKAVQLVKFG EPAEALKFVD LPDPVPGPGD VVVKIEAAGV CGRDLVVRKG AFPHVKPPIV 
PGHEGVGKIV DVGPGVEKDI IGERVFLSGI YDGTCEYCKR GLENLCKNAE LLGESRNGTY
AEYVLVPAKF AHPFHGLDPR VAVVATCPLS TAVYALRHVD VEGKKVLVVG AGGTGIYIAQ
LAKVRGAEVY VSTRSPDKAR VLKELGINTA PEGEKDFDVV VDTVGSPTLE RSLKLAKRSG
SVLVIGNVTG EKALLSPALI ILRQLKVIGS MAFRPWDIYE ALDILKRGLV KPLYTEYKLQ
DAARAHEDME RGAVIGRAIL VP