Gene Pars_1287 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_1287 
Symbol 
ID5056078 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp1164413 
End bp1165624 
Gene Length1212 bp 
Protein Length403 aa 
Translation table11 
GC content61% 
IMG OID640468833 
Producthydrogenase formation HypD protein 
Protein accessionYP_001153502 
Protein GI145591500 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0409] Hydrogenase maturation factor 
TIGRFAM ID[TIGR00075] hydrogenase expression/formation protein HypD 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value0.791611 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACGCG AGATTGCCGC GATAGAGCAG GCGTTTAGGC GCAAGACGGC AGTGACCTCG 
ACCCTTATCC GGAGCATCAA GAAGTACTCC GAGGAGCTCA AGGCAAGGGA TCCGGGCTAT
GTCTATAAGA TCATGGATTT CTGCGGCACG CACGAGTGGA CCATAGTCCA CTTCGGCTTG
AGGAGCCTTT TGAAGAAGGC CGGGGTCGAC AACGTGGAGC TCGTCGCGGG GCCGGGCTGC
CCCGTGTGCG TCACGCCCTC CTACTACATA GAGCAAGCCA TCAAGCTGGC CCTTGAAGGC
GTCGTGATCT ACACCTATGG GGACGTGTAC AAGTTGCCGG CTCTGCGCCC CGTCAAGGGC
GCGCGCTCGC TCGCCGAGGC GAGGGCGCTG GGCGGCGACG TAAGGATAGT GCATTCGTTC
CTGCACGCGA TCTTGGACGC GCGGAAACAC GCCAAGCCCT CGGCCTTCGT CGGGATAGGG
TTCGAGACGG TTGCGCCGGG CTATTCCGAG GCCATCTTGA AGGGGCTGGT GCCCAACCAC
CTCAAGCTCA TGTCCTTGGT CAAGCTCACC CCGCCCGCCA TGTTCTACAC GCTCGAGGTC
GTGAGGGAGA AGCCTACCGA CTTCCCCATA TCGGGCGTCA TAGCGCCGGG CCACGTGTCG
ACCATAGTGG GCGGCAAGGC GTGGCGGCCC GTGGCCGAAC AGTTCGAGAT ACCCGTGGTC
GTGGCGGGCT TCGAGCCCAA CGACGTCTTG ACCGCCGTTG CGGAGATACT GAGGCAGTTA
GCCAAAGGCG AGCACAAGGT GGTGATAGAA TACACGAGAG CTGTAACGTG GGAGGGGGAC
TTAAAAGCCC AGTCGTCCAT AAGGACTGTG TTCGAGACCG TGGACTCGGC CTGGCGGGGC
ATAGGCTATA TCCCAAAGAG CGGGCTTGCG CTGAGGGATG AATTCAAGAA ACATGACGCG
TTGGAGCATT TCGGCATACC GGACCTAACG CCGGATACTT GGCGCTACGA CCTCCCGGCC
AACTGTAAAT GCGCCGAGGT CAACTTGGGC AAGGCGAAGC CCACCGACTG CCCGCTCTTC
ATGAAGGCCT GCACGCCGGA TAGGCCGATA GGCCCTTGCA TGGTGTCCGT CGAGGGGACT
TGCGCCATAT GGGCTAGATT CGGCGGAGGA GGGCTGGCCG AGGAAATAGC TAAGGAAATC
GGTGTGTTCT AG
 
Protein sequence
MKREIAAIEQ AFRRKTAVTS TLIRSIKKYS EELKARDPGY VYKIMDFCGT HEWTIVHFGL 
RSLLKKAGVD NVELVAGPGC PVCVTPSYYI EQAIKLALEG VVIYTYGDVY KLPALRPVKG
ARSLAEARAL GGDVRIVHSF LHAILDARKH AKPSAFVGIG FETVAPGYSE AILKGLVPNH
LKLMSLVKLT PPAMFYTLEV VREKPTDFPI SGVIAPGHVS TIVGGKAWRP VAEQFEIPVV
VAGFEPNDVL TAVAEILRQL AKGEHKVVIE YTRAVTWEGD LKAQSSIRTV FETVDSAWRG
IGYIPKSGLA LRDEFKKHDA LEHFGIPDLT PDTWRYDLPA NCKCAEVNLG KAKPTDCPLF
MKACTPDRPI GPCMVSVEGT CAIWARFGGG GLAEEIAKEI GVF