Gene Pars_1164 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_1164 
Symbol 
ID5054547 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp1049932 
End bp1051839 
Gene Length1908 bp 
Protein Length635 aa 
Translation table11 
GC content62% 
IMG OID640468714 
Productaldehyde ferredoxin oxidoreductase 
Protein accessionYP_001153387 
Protein GI145591385 
COG category[C] Energy production and conversion 
COG ID[COG2414] Aldehyde:ferredoxin oxidoreductase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.19471 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.00826779 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGGCGATTA AAGGAGTCAA CGGGGTTGTA GGTTTTGTAA ATTTATCTAC TGGCGAAGTG 
AAGAAGTTAC AAGTCCCAGC CGAAGTCTAC AGATACTTCC TCGGGGGCTA CGGCCTCGGG
GCCTGGTTCG TCTACCGGCA CATGCAACGG GGAGCCGACC CCCTGGGCCC CGGCAATGTG
CTTGGCATTG TCTCGGGGCT TCTAAACGCC GCGGGCATCC CCATGACTGG GAGGTCAGCG
GCGGTGGGGA AGTCGCCCCA GACAGGGGGC TGGGGAGACG CCAACGCCGG GGGAAGGTTC
GGGCCTAAGC TCATGGAGGC GGGGCTAGAC GCCATATTCG TAACAGGGGC CGCCGAGAAG
CCGGTGTACA TCCTCGTAGA GAACGGCAAC ATATACATCG AAAACGCCTC CGACCTCTGG
GGCAAGAACA CGAGGGAGAC GGAGGACGAG CTTAGACGTA GGCATAAGGG CTCCCAGGCT
GTGGTGATAG GCCCTGCTGG GGAGAGGTTA CAGCTGATAG CCGCAATAAT TAACGAGTAC
GGGAGGGCCT ACGGCCGCTC GGGCCTAGGC GCCGTCATGG GGAGCAAGAA GCTCAAGGCG
ATAGTGGCGG TGGGGGACAA GAAGCCAGAG GTGTACGACC GGGAGCTCGT GAGGCAGAAA
GTGATGGAGA TGGCCCAGAT GCTCAAGACC ACCAGGGCCA AGGCATATGA GATTTGGCAC
AAATACGGGA CGATATCCAC CCTGGACTCC AGCGTCATGT CCGGCGACAC TCCCATAAAA
AACTGGGCAG GCATAGGGCC AAGGGACTAC GGCGAGAAGA ACTTGGAGAA GTTCGGCACA
AACACTGTGA TAAGGGACAA CGTCAAGCCC TACGCCTGCG CCAGCTGCCC AGTGGCCTGC
GGCGCCTGGA TAAGGCGGGC CACGAGGTAC GGCGTGGTGG AGGGGCACAG GCTGGAGTAC
GAGACGGCCT CCCTCTTCGG CCCCGCGCTA CTCGACGCAG ACCAAGACTC GGTGGTGTAC
GCCGGGGAGC TGTGCAACCT CTACGGCCTA GACACAATAT CGACAGCAAG CGTAGTGGGG
TTCGCCTTTG AGCTCTACGA GAGGGGCATC CTCACGGAGA GGGACGTCAG CTTCCCCCTG
AGGTGGGGCG ACCCAGACGC CGTGGTCAAG CTGGTGGAGC TGATAGGGAA GGCCGAGGGC
ATCGGCAAAG TGCTCGGCCA AGGAGTCAAA AAAGCCGCGG AGATCTTGGG CAGAGGCGCG
GAACAGTACG CCATGCATGT CGGGGGACAG GAGCTACCAG CCCACCACCC CCAGTACCTA
CCAGGCCTCG CCATCGCCTA CACCGGCGAC CCCACCCCGG CGAGGCACAC CGCCGGCGGC
GTCCACTGGT GGGGAGAGGG AGGCAAGAGG CTCTGGGCCC CCTTCGACCT GGGGCAAGAC
CTCGGCGCCT CGCCCAAGTA CGAATACGGC GACAAGGGCC GCAAAGCCGC CCTCATCACA
ATGGCTGTCC AGGTGGAGAA CGCCCTGGGC TTCTGCCAGT TCTCCTCCTC AGTGATATAC
CGCACCCTGC CCTACGCCGA CCTAATATAC GCCGTCACTG GGATGAAGTA CACCCCCCAA
GAGCTGTTAA AAGTCGGCCA GAGAATACAA ACCCTAAGAC AGCTATTCAA CGTGAGAGAA
GGCGCAAAAC CACGCGAGTG GAGGCTCCCC AGGAGGGTAC TGGAGCCCTT CCCAGAGGGC
CCCCTCGCCG GGGTGAGGCT AACCGAGGAC GACGTGGAGA TGATGAGGGC GCAGTACTGG
GAAGCCCTGG GCTGGGACCC CGCCACGGGC TACCCCAGAA GAGCCACAGT ACAAGAGCTG
GGACTAGCAG AAATCGCGGC AGACGTCCTG GACCTACTAC CCCCCTAA
 
Protein sequence
MAIKGVNGVV GFVNLSTGEV KKLQVPAEVY RYFLGGYGLG AWFVYRHMQR GADPLGPGNV 
LGIVSGLLNA AGIPMTGRSA AVGKSPQTGG WGDANAGGRF GPKLMEAGLD AIFVTGAAEK
PVYILVENGN IYIENASDLW GKNTRETEDE LRRRHKGSQA VVIGPAGERL QLIAAIINEY
GRAYGRSGLG AVMGSKKLKA IVAVGDKKPE VYDRELVRQK VMEMAQMLKT TRAKAYEIWH
KYGTISTLDS SVMSGDTPIK NWAGIGPRDY GEKNLEKFGT NTVIRDNVKP YACASCPVAC
GAWIRRATRY GVVEGHRLEY ETASLFGPAL LDADQDSVVY AGELCNLYGL DTISTASVVG
FAFELYERGI LTERDVSFPL RWGDPDAVVK LVELIGKAEG IGKVLGQGVK KAAEILGRGA
EQYAMHVGGQ ELPAHHPQYL PGLAIAYTGD PTPARHTAGG VHWWGEGGKR LWAPFDLGQD
LGASPKYEYG DKGRKAALIT MAVQVENALG FCQFSSSVIY RTLPYADLIY AVTGMKYTPQ
ELLKVGQRIQ TLRQLFNVRE GAKPREWRLP RRVLEPFPEG PLAGVRLTED DVEMMRAQYW
EALGWDPATG YPRRATVQEL GLAEIAADVL DLLPP