Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Pars_1301 |
Symbol | |
ID | 5056340 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pyrobaculum arsenaticum DSM 13514 |
Kingdom | Archaea |
Replicon accession | NC_009376 |
Strand | - |
Start bp | 1175363 |
End bp | 1176679 |
Gene Length | 1317 bp |
Protein Length | 438 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 640468847 |
Product | nickel-dependent hydrogenase small subunit |
Protein accession | YP_001153516 |
Protein GI | 145591514 |
COG category | [C] Energy production and conversion |
COG ID | [COG1740] Ni,Fe-hydrogenase I small subunit |
TIGRFAM ID | [TIGR00391] hydrogenase (NiFe) small subunit (hydA) [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 35 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTGATAA AAAGACGTGA TTTCCTGAAG GCGTCGGCCC TCGCTTCAAT GTTGGCATCT CTCAACTGGA GCGCCTTAGT GAAGGCGGCA GGCGAGACCA TTAGGAGCGG CGGTATAGGA GTTGTCTGGT TTGAGGCTCA GGACTGCGCC GGCAATACGA CTGCAGTGAT CCAAGCTACT GACCCGTCCC TTCTAGACGT TTTGCTGGGA ACGACGCCGC TGGTGGGCCC CGGCACCGTC AGACTTCTAT TCCACCATAC TGTGATGCCG CAGTGGGGTA CGTATCACAT ACAGTCCCCC TCTGACGTCG CTGAACATAC AAAGCTTGAG CAGTATTTAG CAACCCAGCC TCCGCCTGGC GACGCCATGA AAATCTTAGA AGAGATCGCA GAGGGGAAGC ACGGCCCCTA CGTCTTGGTC CTAGAAGGGA GCTTCCCCCA AGAATATGGC ATTTCGGGCT CAAACATTGA AACGAAAGGC GGCTACTACT GCGTAGTGGG TCACAGAACA TGTACCGAGT GGGCAAAGCT CTTATTTAAA AACGCGGCCG CGGTGGTGGC CGTGGGCAAC TGCGCTGCCT ATGGTGGGGT AGTTGCGAAC AAGGTGTTGG AACCTCCGCC GAATTTCAAA TTCCCCACTT GGTCGCCGTC TCCCACTGGC GCAATAGGCA TGTTCGACGA CCCGATAAGG GGAGTAAAGG GAATGATCCA CCAGCCGTAC TTCCAGCCAG AAGTGGAGCC GTTCCGCAAG TATATTGACG AGGGAGGAGT CCCTGACTTT AAGACAATAA AGCCAGCTGT TGCCGTGCCG GGATGTCCGG CTAACGGCAA CGGCATTCTC AGAACTCTAG CATTACTAGT ACTAGTAGCA GGGGGGGTAT TAAAGCCCGA CGTCCTTGAG AGAAGGGCGT TTCTCGACGA ATATGCAAGA CCGAGGTTTA TATTTGACCA AACCGTCCAC GAGCAGTGCC CAAGGGCGGG ATCCTACGCC GCAGGCGATC TCCGCCCCTA CGCTGGCGCC GGCGATTACA AATGCCTATT CGGGGTGGGC TGTAAGGGGC CTATTTCAAA TTGTCCGTGG AATAAGGTGG GATGGGTCAG CGGAATAGGG GGTCCGACAA GGACGGGAGG CGTGTGTATA GGATGTACTA TGCCCGGATT TACCGACGCC TACGAGCCAT TCTGGGCTCC ACTTAACGCG CCAAGGTTGC CTGCAATACC CACGCTCGTG GCCGCTGTGG GGGGCGCAGC AGTGGCAGGG TTGGCTGGCG CTTATTTAAT GACCCGCGGG GCTAAGGAAA AGGAGGAAAA GAAGTAG
|
Protein sequence | MLIKRRDFLK ASALASMLAS LNWSALVKAA GETIRSGGIG VVWFEAQDCA GNTTAVIQAT DPSLLDVLLG TTPLVGPGTV RLLFHHTVMP QWGTYHIQSP SDVAEHTKLE QYLATQPPPG DAMKILEEIA EGKHGPYVLV LEGSFPQEYG ISGSNIETKG GYYCVVGHRT CTEWAKLLFK NAAAVVAVGN CAAYGGVVAN KVLEPPPNFK FPTWSPSPTG AIGMFDDPIR GVKGMIHQPY FQPEVEPFRK YIDEGGVPDF KTIKPAVAVP GCPANGNGIL RTLALLVLVA GGVLKPDVLE RRAFLDEYAR PRFIFDQTVH EQCPRAGSYA AGDLRPYAGA GDYKCLFGVG CKGPISNCPW NKVGWVSGIG GPTRTGGVCI GCTMPGFTDA YEPFWAPLNA PRLPAIPTLV AAVGGAAVAG LAGAYLMTRG AKEKEEKK
|
| |