Gene Pars_0453 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_0453 
Symbol 
ID5054621 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp396108 
End bp398096 
Gene Length1989 bp 
Protein Length662 aa 
Translation table11 
GC content55% 
IMG OID640468018 
Product3-hydroxyacyl-CoA dehydrogenase, NAD-binding 
Protein accessionYP_001152703 
Protein GI145590701 
COG category[I] Lipid transport and metabolism 
COG ID[COG1024] Enoyl-CoA hydratase/carnithine racemase
[COG1250] 3-hydroxyacyl-CoA dehydrogenase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.840152 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones35 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCAAAGA TTGCGGTTAT TGGTGCCGGC ACAATGGGGC ATGGAATTGC TGAGCTTTTC 
GCCATCGCGG GCTATGAGGT TGCGTTGGTG GATATTGCTG AAGATTTTCT AAAAAAGGCT
TTACAAAATA TTGAGTGGTC TTTGAGGAAG CTTGCGGAGA AAGGGCAGGT TAAAGAAGAT
CCATCTGTTA TTCTTTCAAG AATTAAACCT ATTGTAAACG ACGTGTGTAA GGCGGTTGAG
GGGGCTGAGC TCATGGTCGA GGCCGCGGTT GAGAATATAG ACGTGAAAAA GAAGGTGTTT
TCGGAAGCTG ACCGCTGTGC TCCTCCAAAT GCCATACTTG CGACGAACAC CTCGTCGTTG
CCTATCACGG AGATAGCAGA GGCGGTGAGG CCAGAGAGGA AGCCGCTTGT CGTCGGGATG
CACTTCTTCA ACCCGCCTGT CCTGATGCCG CTTGTTGAGA TAATAAAGGG GGCTTACACA
AGCGACGAGA CTGTGAAAAA GATTGCCGAC TACGCCGCGA AGCTTGGCAA GCAGACTGTA
GTGGTGAATA AGGACGTGCC TGGGTTTATT GTAAATCGCA TACTTGCGAG GGTGAACGAG
GCTGCGTGTT GGATGGTCGC CCGCGGCGAG GCTACAATAC AAGAGGTGGA CTCCGCCCTG
ATGTACAAGG CGGGGTTGCC CATGGGCGCG TTTATACTCA TGGACTACAC CGGTATAGAC
GTCGTATGTT TTATAGGAGA CGCCATGTTG AAGCGCGGCT TCAAGTCGCA TCCATGTCCT
GTAATTACTC AGAAGTGTCA AGAGAAGAAG TTCGGCGTTA AGTCTGGCGA GGGCTTCTAC
AAGTACCCAG CGCCGGGGAA GTTCCAGTGG CCGGAGGTGC CCAAGTCGGC TGGTGACAAG
ATCGACGTGA CGTATCTACT AGCGCCAGCT GTAAACGAGG CTGCCTACTT ATTGCGAGAG
GGGATTGCCA CCAGAGAGGA TATCGATAAG GCGGTGAAGT TGGGCCTCAA CTGGCCGAAG
GGCCCGCTGG AATTTGCAGA TGAGCTGGGT ATAGACGCGG TGGTGAAGGC GTTGGAGACA
TGGAGGCAAA AAACTGGATA TGAAGAGTAC GAGCCGGATC CGTTGCTTAA GGAAATGGTT
TCGCAGGGTA AAACTGGCAA GAAAGCCGGC GAGGGGTTCT ACACTTACGT AAAGGCCGAG
GAGAAGAAGT TAGAGACTCT CATAGTGCGC TACGAGCCCG GCGTTGCGTG GATAATTCTG
AACAGGCCGG AGAGGCTCAA CGCCATTAAC CCCAAGATGG TCGAGGAACT CTGGAGGGTG
CTGGACGAGA TCGAGCAGAT GGACTACGAA AAGGTGAGGG TGGTGGTGAT CACGGGCGCA
GGCAGGGCGT TCTCCGCCGG GGCCGACGTG ACGGGCTTTA TGGGCGCCAC CCCCGTCACC
ATCTTCAAGG TGTCGAGGAA GCTTCAGATG CTGTACGAGA GACTGGAGTT GTTGGACAGG
CCGGTAATAT GCGGGCTGAA CGGCTACACG CTGGGCGGAG GCCTGGAGCT GGCCATGGCC
TGCGACTTCC GCATTGCGGC TGAGACCGCC GAGCTGGGCC AGCCGGAGAT AAACCTAGGC
TTCATACCCG GCGCCGGCGG CACTCAGCGC CTCGCTCGTC TAATCGGGAG GGATCGAGCC
AAGGAGCTGA TATTCACAGG CGACAGGATA CCGGCTAGGG AGGCGGAGAG GCTTGGCCTT
GTCCACAAGG TGGTGTCGCC TGACAAGCTT GAACAGGAGC TGAGGGCGTT CGCGGCTAAG
CTCGCCGAGA AGCCGCCGCT GGCTCTCGCC ATGGCGAAAT ATGCCATAAA CTTCGGCCTG
GAGGCGCCTC AATGGGTAGG GATGATGCTA GAGGCGACGC AGTTCGGCCT ACTGTTCAGC
ACAGAGGATG TAATAGAAGG CGTGTCGTCG TTTTTGCAGA AGAAAAAGCC GCAGTTCAAA
GGGAGATAG
 
Protein sequence
MPKIAVIGAG TMGHGIAELF AIAGYEVALV DIAEDFLKKA LQNIEWSLRK LAEKGQVKED 
PSVILSRIKP IVNDVCKAVE GAELMVEAAV ENIDVKKKVF SEADRCAPPN AILATNTSSL
PITEIAEAVR PERKPLVVGM HFFNPPVLMP LVEIIKGAYT SDETVKKIAD YAAKLGKQTV
VVNKDVPGFI VNRILARVNE AACWMVARGE ATIQEVDSAL MYKAGLPMGA FILMDYTGID
VVCFIGDAML KRGFKSHPCP VITQKCQEKK FGVKSGEGFY KYPAPGKFQW PEVPKSAGDK
IDVTYLLAPA VNEAAYLLRE GIATREDIDK AVKLGLNWPK GPLEFADELG IDAVVKALET
WRQKTGYEEY EPDPLLKEMV SQGKTGKKAG EGFYTYVKAE EKKLETLIVR YEPGVAWIIL
NRPERLNAIN PKMVEELWRV LDEIEQMDYE KVRVVVITGA GRAFSAGADV TGFMGATPVT
IFKVSRKLQM LYERLELLDR PVICGLNGYT LGGGLELAMA CDFRIAAETA ELGQPEINLG
FIPGAGGTQR LARLIGRDRA KELIFTGDRI PAREAERLGL VHKVVSPDKL EQELRAFAAK
LAEKPPLALA MAKYAINFGL EAPQWVGMML EATQFGLLFS TEDVIEGVSS FLQKKKPQFK
GR