Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Pars_1298 |
Symbol | |
ID | 5055119 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pyrobaculum arsenaticum DSM 13514 |
Kingdom | Archaea |
Replicon accession | NC_009376 |
Strand | - |
Start bp | 1171312 |
End bp | 1173228 |
Gene Length | 1917 bp |
Protein Length | 638 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 640468844 |
Product | nickel-dependent hydrogenase, large subunit |
Protein accession | YP_001153513 |
Protein GI | 145591511 |
COG category | [C] Energy production and conversion |
COG ID | [COG0374] Ni,Fe-hydrogenase I large subunit |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 33 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCTACAA TAAAGCTCTG GATAGATCCC ATTACGCGCA TAGAGGGTCA CCTAGGGCTT TATGCCGAGG TGGATGCCGC TACCCGCGCC GTCTCCGTTG CGAAGACTAC CGTCATGATG TTCCGGGGCT TTGAGGTCTT TTTAAGGGGG AGGCCCCCAG AGGACGCCAT TGCCATAACC TCGCGCAGTT GCGGCGTCTG CGGGGCGGCC CACGCCAACG CCTCAACGAG GGCTTGCGAT GCCGCGGCTG GCATGACCCC CCTCCCCATG GGCAACGTGT TGAGGAATTT GGCCTACGCA ATGACAGATT ACACCTACGA CCACCCACTC ATCCTCAACA TGTTGGAAGG CCCCGACTAC AGTGAACTGA TTGTGAGTAA GCTGACGCCT TCTGTGTGGC AGACTGCCCA GCAGACGCCA GCAAAATACT CCAGCATACA CGGCTACCGC ACTATAGCTG ACATAATGCG CGACCTCAAC CCCATCCAAG GGCGTATTTG GCAACTGACG GTGAAATACC AGCGCATAGC CAGAGAGGCC GGTGTGTTGA TATATGGCCG CCACGCCCAC CCAGCGACGT TAATACCCGG CGGCATATCG ACAGACATAA CAAACCTGGC ATCGTTGCTC CAGGAGTACT ACGCGCGCCT ATCCCTCTTG ACCGCTTGGG TTAAGTTCGT CTGGGCCATA TGGCAAGACC TCTACGAGTT CTTCAGAGAC CACGTATCGA CGCCGGACGG ACAGCCTTAC GCCCTAACGC AAGGCAAGAC CCACGACCCG CCCGTGATGC TCGCGGGCGG ATGGTCCGAT GACCCCGAGG TCTACAGTAA TATATACGAC GAGGCTGGCG GTGATTGGGT GAAGATGTAC TCCCTCCTGG ATAAGGCCTA CAACGCCAGA TGGGAAAAGC CCGGCTTTGC GATAGGCCAC GAGATCTACA GCCCCAACCC CACCGAGATT CAGCTGGGCT ATCTCGAATT CGCCGACTCC TCCTTCTACG AGGACTGGGT CAAGGCCAAC GTGGCTCCGC CCTACGGCTG GCTCAAAACA GATCCGTTGG GCAGAGAGCT GGCATACGGC ACAGACCTCT ACAAATACCA TATGTGGAAC CGTACCACGA TCCCGAAGCC CGGCGCCATA AACTTCGCCG AGAAGTACAC GTGGGCCGCC GAGCCTAGGC TCCTGCTAAA AGACGGCAAG ATTGCCCCGA TAGAGACTGG GCCTATATCT CAGCTCTGGC TCAACACGTT GCACGCGACT AAGGTCGAAG TTGATAACCA CAAGGCTTGG GAGAGCAACG GGAGCCAGCT CAAGGTTTAT CTGCCCGGAG GCACCGTAAA TCCTGACCTC CCGCCAGGCA CCGCCGAGGA GTTGGTGATA ACGTGGAACT TGCCCAAATA CTCTACGACT ATGGAGAGGT TGTTAGCAAG AGCTGTGCAC TTGGCCCTCG TGGTGTCCCT CGCTTGGGCT AACCTCCTCT ACGCCTTTAA GCTTATAAAC GCCGGGAAGA TCCAGACGTC GAGGCCATGG AGCTACGGAA AATGGCCAAG CTTCTCTTAC AGCTTCGGCT GGTGGCAGGT GCCGAGGGGC AACTGCATGC ACTGGCTGGT TCAGCAGAAT GGTCGCCTGG CCAACTACCA GTACGAGGCG CCCACCACCC CCAACGTGAG CCCGACTAAT AACAGATGTA CCGACCCTTG GAAGGGCCAG TGCGCCGGCC CGTTCGAGAT GTCGGTACGC AACAGCAAGG TGACAGAGGA GGTGCCGCCG GATCAGTGGA CCGGCCTAGA CCTCGTGCGC GCCATTCGTA GCTTCGATCC CTGTCTAGCG TGCGCGGTGC ACTTTGAGGC TAAGGGCGAG GGTGGCCGCG TGTACAACGT GATAGAGAAA GTTATATGGA ACGCTTGCTC GTTGTAG
|
Protein sequence | MSTIKLWIDP ITRIEGHLGL YAEVDAATRA VSVAKTTVMM FRGFEVFLRG RPPEDAIAIT SRSCGVCGAA HANASTRACD AAAGMTPLPM GNVLRNLAYA MTDYTYDHPL ILNMLEGPDY SELIVSKLTP SVWQTAQQTP AKYSSIHGYR TIADIMRDLN PIQGRIWQLT VKYQRIAREA GVLIYGRHAH PATLIPGGIS TDITNLASLL QEYYARLSLL TAWVKFVWAI WQDLYEFFRD HVSTPDGQPY ALTQGKTHDP PVMLAGGWSD DPEVYSNIYD EAGGDWVKMY SLLDKAYNAR WEKPGFAIGH EIYSPNPTEI QLGYLEFADS SFYEDWVKAN VAPPYGWLKT DPLGRELAYG TDLYKYHMWN RTTIPKPGAI NFAEKYTWAA EPRLLLKDGK IAPIETGPIS QLWLNTLHAT KVEVDNHKAW ESNGSQLKVY LPGGTVNPDL PPGTAEELVI TWNLPKYSTT MERLLARAVH LALVVSLAWA NLLYAFKLIN AGKIQTSRPW SYGKWPSFSY SFGWWQVPRG NCMHWLVQQN GRLANYQYEA PTTPNVSPTN NRCTDPWKGQ CAGPFEMSVR NSKVTEEVPP DQWTGLDLVR AIRSFDPCLA CAVHFEAKGE GGRVYNVIEK VIWNACSL
|
| |