Gene Pars_0547 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_0547 
Symbol 
ID5054275 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp489883 
End bp490968 
Gene Length1086 bp 
Protein Length361 aa 
Translation table11 
GC content55% 
IMG OID640468109 
Productalcohol dehydrogenase 
Protein accessionYP_001152794 
Protein GI145590792 
COG category[R] General function prediction only 
COG ID[COG1064] Zn-dependent alcohol dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value0.986986 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGAGCCG CTTTATTAAC TGAATTCAAC AAGCCGCTTG AAATTAAGGA CGTAGAGGTT 
CCGAAGGTAG GCAAGGGGGA GGTTTTACTA CAAGTACTCG CGTGTGGCAT CTGTAGATCG
GACTGGCACC TGTGGAGGGG AGATCCCTCC TTGGTCGCGT ATATGCAGTG GTCGGGCGGC
AAACTGCCCA TTATCCCGGG ACATGAAGTG GCGGGGAGGG TGGTCGAGGT GGGGGAGGGG
GTTAGCAATG TTGAAGTGGG AGACGTAGTG GTGGCCCCGG CGTCGTCAAC GGGGGATAAC
AGAACTTGTA GGTACTGCAA GGAGGGGGCT TCGAACATAT GCGAACACCT TTGGATTCCG
GGCTTCGGCA CACACGGATG CTACGCTGAA TATATGAAGG TGCCGGCCAG CTCGGTAGTA
GACCTCGTTA AAGTCCCAGA AGGCGTCCCG CCTGAATACG CGGCTATAAC CGGGTGCGGT
TTCGGCACTG CGTGGAACGC CCTAGTGGTT AAGAACGGCA TTAGGCCTGG CGAAACGCTA
TTAATAACGG GAGCGGGGGG CATGGGCCTC AGCGCTTTGT TAATAGCCTC TGCCGCCGGG
GCGAAAACCG TCGTGGTCGA TGTAAACCCC GCCTCAGTAG AAAAGGCGAA GAAAATGGGA
GCAACTGCGG CATATCACTA CTCTGGACAT CCCCAGGAGC TCGCCAAGCT CGTTAACGAG
GAGATCGTGA AGTCTTTTGG CATGGTCGAT GCTGTGTTCG ATTCCACGGG CAATCCCGAC
GTCCTATCCG CGGTGTTGCC GGCAGTGCGG CCGCAGGGCA GGATACTGTT GGCGGGGCTC
ATGATGAAGG GCAAGGAGAT CTGGCCGCTG GCCTCCGATA TAGTAGTCGC CAGAGAGTTG
ACCATACAAG GAGTGTTGAT GCTACCGTCG CAGAAATACG ACGGGATATT TAAGCTTATA
TCGGAGGGAA GGGTGAACCT TGAGCCTGTG ATCTACCGGA GGATATCTCT CGATGAGGTG
AACGACGCAT ACGCCGAGAT GTCCCGTTTC AAAAACGCCG GCAGATTTGT AATTACTAAA
TTTTAA
 
Protein sequence
MRAALLTEFN KPLEIKDVEV PKVGKGEVLL QVLACGICRS DWHLWRGDPS LVAYMQWSGG 
KLPIIPGHEV AGRVVEVGEG VSNVEVGDVV VAPASSTGDN RTCRYCKEGA SNICEHLWIP
GFGTHGCYAE YMKVPASSVV DLVKVPEGVP PEYAAITGCG FGTAWNALVV KNGIRPGETL
LITGAGGMGL SALLIASAAG AKTVVVDVNP ASVEKAKKMG ATAAYHYSGH PQELAKLVNE
EIVKSFGMVD AVFDSTGNPD VLSAVLPAVR PQGRILLAGL MMKGKEIWPL ASDIVVAREL
TIQGVLMLPS QKYDGIFKLI SEGRVNLEPV IYRRISLDEV NDAYAEMSRF KNAGRFVITK
F