Gene Pars_1298 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_1298 
Symbol 
ID5055119 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp1171312 
End bp1173228 
Gene Length1917 bp 
Protein Length638 aa 
Translation table11 
GC content58% 
IMG OID640468844 
Productnickel-dependent hydrogenase, large subunit 
Protein accessionYP_001153513 
Protein GI145591511 
COG category[C] Energy production and conversion 
COG ID[COG0374] Ni,Fe-hydrogenase I large subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTACAA TAAAGCTCTG GATAGATCCC ATTACGCGCA TAGAGGGTCA CCTAGGGCTT 
TATGCCGAGG TGGATGCCGC TACCCGCGCC GTCTCCGTTG CGAAGACTAC CGTCATGATG
TTCCGGGGCT TTGAGGTCTT TTTAAGGGGG AGGCCCCCAG AGGACGCCAT TGCCATAACC
TCGCGCAGTT GCGGCGTCTG CGGGGCGGCC CACGCCAACG CCTCAACGAG GGCTTGCGAT
GCCGCGGCTG GCATGACCCC CCTCCCCATG GGCAACGTGT TGAGGAATTT GGCCTACGCA
ATGACAGATT ACACCTACGA CCACCCACTC ATCCTCAACA TGTTGGAAGG CCCCGACTAC
AGTGAACTGA TTGTGAGTAA GCTGACGCCT TCTGTGTGGC AGACTGCCCA GCAGACGCCA
GCAAAATACT CCAGCATACA CGGCTACCGC ACTATAGCTG ACATAATGCG CGACCTCAAC
CCCATCCAAG GGCGTATTTG GCAACTGACG GTGAAATACC AGCGCATAGC CAGAGAGGCC
GGTGTGTTGA TATATGGCCG CCACGCCCAC CCAGCGACGT TAATACCCGG CGGCATATCG
ACAGACATAA CAAACCTGGC ATCGTTGCTC CAGGAGTACT ACGCGCGCCT ATCCCTCTTG
ACCGCTTGGG TTAAGTTCGT CTGGGCCATA TGGCAAGACC TCTACGAGTT CTTCAGAGAC
CACGTATCGA CGCCGGACGG ACAGCCTTAC GCCCTAACGC AAGGCAAGAC CCACGACCCG
CCCGTGATGC TCGCGGGCGG ATGGTCCGAT GACCCCGAGG TCTACAGTAA TATATACGAC
GAGGCTGGCG GTGATTGGGT GAAGATGTAC TCCCTCCTGG ATAAGGCCTA CAACGCCAGA
TGGGAAAAGC CCGGCTTTGC GATAGGCCAC GAGATCTACA GCCCCAACCC CACCGAGATT
CAGCTGGGCT ATCTCGAATT CGCCGACTCC TCCTTCTACG AGGACTGGGT CAAGGCCAAC
GTGGCTCCGC CCTACGGCTG GCTCAAAACA GATCCGTTGG GCAGAGAGCT GGCATACGGC
ACAGACCTCT ACAAATACCA TATGTGGAAC CGTACCACGA TCCCGAAGCC CGGCGCCATA
AACTTCGCCG AGAAGTACAC GTGGGCCGCC GAGCCTAGGC TCCTGCTAAA AGACGGCAAG
ATTGCCCCGA TAGAGACTGG GCCTATATCT CAGCTCTGGC TCAACACGTT GCACGCGACT
AAGGTCGAAG TTGATAACCA CAAGGCTTGG GAGAGCAACG GGAGCCAGCT CAAGGTTTAT
CTGCCCGGAG GCACCGTAAA TCCTGACCTC CCGCCAGGCA CCGCCGAGGA GTTGGTGATA
ACGTGGAACT TGCCCAAATA CTCTACGACT ATGGAGAGGT TGTTAGCAAG AGCTGTGCAC
TTGGCCCTCG TGGTGTCCCT CGCTTGGGCT AACCTCCTCT ACGCCTTTAA GCTTATAAAC
GCCGGGAAGA TCCAGACGTC GAGGCCATGG AGCTACGGAA AATGGCCAAG CTTCTCTTAC
AGCTTCGGCT GGTGGCAGGT GCCGAGGGGC AACTGCATGC ACTGGCTGGT TCAGCAGAAT
GGTCGCCTGG CCAACTACCA GTACGAGGCG CCCACCACCC CCAACGTGAG CCCGACTAAT
AACAGATGTA CCGACCCTTG GAAGGGCCAG TGCGCCGGCC CGTTCGAGAT GTCGGTACGC
AACAGCAAGG TGACAGAGGA GGTGCCGCCG GATCAGTGGA CCGGCCTAGA CCTCGTGCGC
GCCATTCGTA GCTTCGATCC CTGTCTAGCG TGCGCGGTGC ACTTTGAGGC TAAGGGCGAG
GGTGGCCGCG TGTACAACGT GATAGAGAAA GTTATATGGA ACGCTTGCTC GTTGTAG
 
Protein sequence
MSTIKLWIDP ITRIEGHLGL YAEVDAATRA VSVAKTTVMM FRGFEVFLRG RPPEDAIAIT 
SRSCGVCGAA HANASTRACD AAAGMTPLPM GNVLRNLAYA MTDYTYDHPL ILNMLEGPDY
SELIVSKLTP SVWQTAQQTP AKYSSIHGYR TIADIMRDLN PIQGRIWQLT VKYQRIAREA
GVLIYGRHAH PATLIPGGIS TDITNLASLL QEYYARLSLL TAWVKFVWAI WQDLYEFFRD
HVSTPDGQPY ALTQGKTHDP PVMLAGGWSD DPEVYSNIYD EAGGDWVKMY SLLDKAYNAR
WEKPGFAIGH EIYSPNPTEI QLGYLEFADS SFYEDWVKAN VAPPYGWLKT DPLGRELAYG
TDLYKYHMWN RTTIPKPGAI NFAEKYTWAA EPRLLLKDGK IAPIETGPIS QLWLNTLHAT
KVEVDNHKAW ESNGSQLKVY LPGGTVNPDL PPGTAEELVI TWNLPKYSTT MERLLARAVH
LALVVSLAWA NLLYAFKLIN AGKIQTSRPW SYGKWPSFSY SFGWWQVPRG NCMHWLVQQN
GRLANYQYEA PTTPNVSPTN NRCTDPWKGQ CAGPFEMSVR NSKVTEEVPP DQWTGLDLVR
AIRSFDPCLA CAVHFEAKGE GGRVYNVIEK VIWNACSL