Gene Pars_0534 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_0534 
Symbol 
ID5055282 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp480931 
End bp481917 
Gene Length987 bp 
Protein Length328 aa 
Translation table11 
GC content67% 
IMG OID640468096 
Productalcohol dehydrogenase 
Protein accessionYP_001152781 
Protein GI145590779 
COG category[R] General function prediction only 
COG ID[COG1064] Zn-dependent alcohol dehydrogenases 
TIGRFAM ID[TIGR01202] 2-desacetyl-2-hydroxyethyl bacteriochlorophyllide A dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value0.83673 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGGGCTG TTGTCTTCCA CAGCCCTGGG CTTGAGAACC TCAGGCTGGA GGATCTCCCG 
AAGCCGCGGC CGGGGCCGGG CGAGGTTCTT GTCAGGGTTA AATACGTGGG GGTGAACCCC
ATCGACTACG CGGTGGTTTC CGGCTCGTAC AAGGCGTCGC CCATGCCCCA CATCCCGGGG
TGCGAGTTCG CTGGGGTGGT TGAGGAGGTG GGGCCCGGCG TCTCGGGGCC TGCGCCTGGC
ACGCCGGTGG CCGTCTACAA CCGCCTCTTT TGCGGCGCCT GTAGGCAGTG CCTCACCGGG
TGGACTCAGC TCTGCGAGGC CGGCGGCATA ATAGGCGTGG CGACCCAGGG TGGCATGGCT
GAATATGCCG TGGTGCCCTC CAGGAATGCG GAGCCTGTGA AGGCGGATCT GAGGGACGCC
GCCACGCTCC CCATAGGCGC GTTGACTGCC TACAACATGG CTCTGTGCGC CTCGATAGCC
CCCGGGGAGA GAGTCGCCGT TGTGGGCGCC ACGGGGAACG TGGGGACATA CGCAGTACAG
TTCGCCAAGA TCTTCGGCGG CGAGGTATAC GCTGTGACCA GGAGGAAGGA TGCCGCCGCG
GCAATGTTGC GGCAACTAGG CGCGGAGGTA GTCACGCCGG ACGAAGCCCG GGGGCTCGCC
CCCTTCGACG TGGTGCTGGA CCCAACGGGC GCCGCCAACT GGGGCCTCAG CATGTCTCTG
CTGGGCCGCG GCGGGCGGTA CGTCACAGCG GGGGCCCTAA CAGGCGCCGA AGTCTCTCTG
GACCTCAGGC GGGTGTTTGG ACAGCAGATC TCAGTGATAG GCTCCACCGG CGGCAGGAGG
GCGGACTTCA AGACGGTGGT GAGACTCCAC GAGGCGGGGA GGATAAGGGC GGTGATACAC
GCAGTGTATC CGCTGGCCGA CGCCGCCAAG GCCCTCGCCG GCCTCAGCTC GCCCGCGAGG
GTCGGCAAGA TCCTGCTGGA GGTATGA
 
Protein sequence
MRAVVFHSPG LENLRLEDLP KPRPGPGEVL VRVKYVGVNP IDYAVVSGSY KASPMPHIPG 
CEFAGVVEEV GPGVSGPAPG TPVAVYNRLF CGACRQCLTG WTQLCEAGGI IGVATQGGMA
EYAVVPSRNA EPVKADLRDA ATLPIGALTA YNMALCASIA PGERVAVVGA TGNVGTYAVQ
FAKIFGGEVY AVTRRKDAAA AMLRQLGAEV VTPDEARGLA PFDVVLDPTG AANWGLSMSL
LGRGGRYVTA GALTGAEVSL DLRRVFGQQI SVIGSTGGRR ADFKTVVRLH EAGRIRAVIH
AVYPLADAAK ALAGLSSPAR VGKILLEV