Gene Pars_1301 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_1301 
Symbol 
ID5056340 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp1175363 
End bp1176679 
Gene Length1317 bp 
Protein Length438 aa 
Translation table11 
GC content56% 
IMG OID640468847 
Productnickel-dependent hydrogenase small subunit 
Protein accessionYP_001153516 
Protein GI145591514 
COG category[C] Energy production and conversion 
COG ID[COG1740] Ni,Fe-hydrogenase I small subunit 
TIGRFAM ID[TIGR00391] hydrogenase (NiFe) small subunit (hydA)
[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones35 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGATAA AAAGACGTGA TTTCCTGAAG GCGTCGGCCC TCGCTTCAAT GTTGGCATCT 
CTCAACTGGA GCGCCTTAGT GAAGGCGGCA GGCGAGACCA TTAGGAGCGG CGGTATAGGA
GTTGTCTGGT TTGAGGCTCA GGACTGCGCC GGCAATACGA CTGCAGTGAT CCAAGCTACT
GACCCGTCCC TTCTAGACGT TTTGCTGGGA ACGACGCCGC TGGTGGGCCC CGGCACCGTC
AGACTTCTAT TCCACCATAC TGTGATGCCG CAGTGGGGTA CGTATCACAT ACAGTCCCCC
TCTGACGTCG CTGAACATAC AAAGCTTGAG CAGTATTTAG CAACCCAGCC TCCGCCTGGC
GACGCCATGA AAATCTTAGA AGAGATCGCA GAGGGGAAGC ACGGCCCCTA CGTCTTGGTC
CTAGAAGGGA GCTTCCCCCA AGAATATGGC ATTTCGGGCT CAAACATTGA AACGAAAGGC
GGCTACTACT GCGTAGTGGG TCACAGAACA TGTACCGAGT GGGCAAAGCT CTTATTTAAA
AACGCGGCCG CGGTGGTGGC CGTGGGCAAC TGCGCTGCCT ATGGTGGGGT AGTTGCGAAC
AAGGTGTTGG AACCTCCGCC GAATTTCAAA TTCCCCACTT GGTCGCCGTC TCCCACTGGC
GCAATAGGCA TGTTCGACGA CCCGATAAGG GGAGTAAAGG GAATGATCCA CCAGCCGTAC
TTCCAGCCAG AAGTGGAGCC GTTCCGCAAG TATATTGACG AGGGAGGAGT CCCTGACTTT
AAGACAATAA AGCCAGCTGT TGCCGTGCCG GGATGTCCGG CTAACGGCAA CGGCATTCTC
AGAACTCTAG CATTACTAGT ACTAGTAGCA GGGGGGGTAT TAAAGCCCGA CGTCCTTGAG
AGAAGGGCGT TTCTCGACGA ATATGCAAGA CCGAGGTTTA TATTTGACCA AACCGTCCAC
GAGCAGTGCC CAAGGGCGGG ATCCTACGCC GCAGGCGATC TCCGCCCCTA CGCTGGCGCC
GGCGATTACA AATGCCTATT CGGGGTGGGC TGTAAGGGGC CTATTTCAAA TTGTCCGTGG
AATAAGGTGG GATGGGTCAG CGGAATAGGG GGTCCGACAA GGACGGGAGG CGTGTGTATA
GGATGTACTA TGCCCGGATT TACCGACGCC TACGAGCCAT TCTGGGCTCC ACTTAACGCG
CCAAGGTTGC CTGCAATACC CACGCTCGTG GCCGCTGTGG GGGGCGCAGC AGTGGCAGGG
TTGGCTGGCG CTTATTTAAT GACCCGCGGG GCTAAGGAAA AGGAGGAAAA GAAGTAG
 
Protein sequence
MLIKRRDFLK ASALASMLAS LNWSALVKAA GETIRSGGIG VVWFEAQDCA GNTTAVIQAT 
DPSLLDVLLG TTPLVGPGTV RLLFHHTVMP QWGTYHIQSP SDVAEHTKLE QYLATQPPPG
DAMKILEEIA EGKHGPYVLV LEGSFPQEYG ISGSNIETKG GYYCVVGHRT CTEWAKLLFK
NAAAVVAVGN CAAYGGVVAN KVLEPPPNFK FPTWSPSPTG AIGMFDDPIR GVKGMIHQPY
FQPEVEPFRK YIDEGGVPDF KTIKPAVAVP GCPANGNGIL RTLALLVLVA GGVLKPDVLE
RRAFLDEYAR PRFIFDQTVH EQCPRAGSYA AGDLRPYAGA GDYKCLFGVG CKGPISNCPW
NKVGWVSGIG GPTRTGGVCI GCTMPGFTDA YEPFWAPLNA PRLPAIPTLV AAVGGAAVAG
LAGAYLMTRG AKEKEEKK