Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Pars_1309 |
Symbol | |
ID | 5054444 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pyrobaculum arsenaticum DSM 13514 |
Kingdom | Archaea |
Replicon accession | NC_009376 |
Strand | + |
Start bp | 1181788 |
End bp | 1182828 |
Gene Length | 1041 bp |
Protein Length | 346 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 640468855 |
Product | hydrogenase expression/formation protein HypE |
Protein accession | YP_001153524 |
Protein GI | 145591522 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0309] Hydrogenase maturation factor |
TIGRFAM ID | [TIGR02124] hydrogenase expression/formation protein HypE |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.19082 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 31 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGCGAGG TCGTCAAGTT GGCCCACGGG GCAGGCAGTG TGGAGACGTC GCAAATCCTC GAGTCATTGA TTTTCTCCAA GATCGAGGAG AGGCTTAAAA AAGTGGAGGG TGGCTTGGGT ATAGACTTCC CCGACGATGC GGCGGCAATA CCCATGGGCG ATGGGCGCTT TTTGGTCGTG ACGGTAGACT CCTACACGGT TAACCCGCCA TTTTTCCCCG GAGGCGATAT AGGCGTCTTA GCGGCCTCAG GCTCTATCAA CGATGTCTTA ATGTTAGGCG GAAAGCCCAT TGCCCTCATG GACGCCATCA TAGTAGAGGA GGGCTTCCCC CTGGAAGATC TGAGGAGAAT CGTGGATTCA ATGTTGAGGG TGTTGCGCGA GGAGGGCGTC GCGCTGATAG GCGGCGACTT CAAGGTGATG CCGAAGGGCC AGATAGACAA GATAGCGATA GCCACAGTGG GCATTGGGAT AGCCGATAGG CTGATAGTGG ACAGGCCCCA GCCTGGCGAT AAAATAGTCG TGAGCGGATA TCTCGGAGAT CACGGGGCTG TGATCTTGGC GAGGCAGATC GGCATAATAG ACGAAGGCTC GGGAGGTGGG CTCGTAAGCG ACGTAAAGCC CTTGACCAGG CTCATGTTAC CTCTAGTCGA GAAGTACGGC CCCCACATCC ACGCAGCACG CGACCCGACT AGAGGCGGGT TAGCCATGGC GCTCAACGAC TGGGCCAAGG CCTCCGGCAC TGTCATCATC GTGGAAGAAT CTGCGATACC CATTAGGCCC CAGGTGGCGT ACTACGCCAA CATGTTGGGC ATAGACCCCC TGGCGCTGGC CAGCGAAGGC GCGGCCGTGC TATCTGTAAG CCCCGACGTA GCCGAAGAGG TCGTGGAGTT TATGAAGAAG CTCGGCTTCG ACAATGCTGC AATCATAGGC GAGGTTAGAA AAGCCGAGAG GTACAGAGGG TACGTCCTGC TCAAGACCGT GGTAGGGGGG CTGAGAATAC TTGAGGCTCC CCGTGGGGAC CTCGTCCCGA GGATATGCTA A
|
Protein sequence | MGEVVKLAHG AGSVETSQIL ESLIFSKIEE RLKKVEGGLG IDFPDDAAAI PMGDGRFLVV TVDSYTVNPP FFPGGDIGVL AASGSINDVL MLGGKPIALM DAIIVEEGFP LEDLRRIVDS MLRVLREEGV ALIGGDFKVM PKGQIDKIAI ATVGIGIADR LIVDRPQPGD KIVVSGYLGD HGAVILARQI GIIDEGSGGG LVSDVKPLTR LMLPLVEKYG PHIHAARDPT RGGLAMALND WAKASGTVII VEESAIPIRP QVAYYANMLG IDPLALASEG AAVLSVSPDV AEEVVEFMKK LGFDNAAIIG EVRKAERYRG YVLLKTVVGG LRILEAPRGD LVPRIC
|
| |