Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpen_0591 |
Symbol | |
ID | 4600633 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermofilum pendens Hrk 5 |
Kingdom | Archaea |
Replicon accession | NC_008698 |
Strand | + |
Start bp | 544852 |
End bp | 546123 |
Gene Length | 1272 bp |
Protein Length | 423 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 639773366 |
Product | hydrogenase (NiFe) small subunit HydA |
Protein accession | YP_919999 |
Protein GI | 119719504 |
COG category | [C] Energy production and conversion |
COG ID | [COG1740] Ni,Fe-hydrogenase I small subunit |
TIGRFAM ID | [TIGR00391] hydrogenase (NiFe) small subunit (hydA) [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCATTGA AAGGCATTAG CATCTCACGC CGAGATTTCT TAAAGATGGG CGGAGTGGCA GCCCTGCTTG CATCACTGAA CTGGGACGAG CTTGTCAGAC TGGCGGTGGC ACAGGCGCAG AGCGGTAGCA TAAACATCGT CTGGTTCGAG GCCCAGGACT GCGCAGGAAA CACTACGGCT TTAATACAGG CCACGGAGCC CGACCTCATA GATGTGCTTG GAGGCTCCTT CCACCTCGTG GGACCTGGAA ACGTGAAGCT AGCTTTCCAC GAGACAGTAA TGCCTGAGTG GGGTGAGGGG GCCTTAAGCA TAGCGCGCGC AGCTGCTCAG GGAAAGCTTG ACCCCTTTGT TCTCGTACTC GAGGGAAGTT TTCCGCAGGA CGAGAAGGCA GGAGGCCCGC CCGGAAGCGA CTCCTACTGC TACATAGGGG ACGAGAACGG GAAGGTAATA TCCTGCGTAG AATGGATGCG GAGGCTCCTT CCAAGGGCTG TTGCTGTGGT TACCATGGGT AACTGCGCTT CCTACGGAGG GCTCGTCGCG AACAAGGTCT TGGAGAGGGT ACCGGGTCTA CCTTCTTCCA GCTGGTCTGA TAGCCCGACG GGCGCTGTAG GATTCTTCGA CGACCCCATT AGGGGTATCA AGGGCCTTGT AAGCACGTTG CCGGAAGCCG AACCTTTCAG AAGGTTTATC ACAGGACAGT GCACGCTGAA GCCCGGGGAG ATAAGGCCCG ACTGTAGGCC GGCGATGGCC GTCCCAGGTT GCCCGGCTAA CGGCAACGGA CAGCTACGTG CACTAGCTGC TCTTGTTTTG TGGGCTAAGG GGTTGCTACC ATTACCGGAG CTGGACAAGT ACTGGCGTCC CAAGTTCATA TTTGGGAACA CGGTTCACGA GCAGTGTCCG AGAGCTGGTA GCTACGCTGC AGGAGACTTC AGAGCCTATC CAGGCGAGAA CTCCTACCGG TGCCTCTACG CGGTAGGGTG CAAGGGTCCG ATTTCTAACT GTCCTTGGAA CAAGCTAGGG TGGGTTAACG GAGTCGGAGG ACCCACGAGG ACGGGCGGCG TGTGTATAGG GTGTACTATG CCGGGCTTTA CGGACGCTTA CGAGCCGTTC TACAAGCCGT TGCAGGCGCC GCGCGTTCCC AGCGGGTTAG AAGGTGTAGC GGCTATCGCA GCGTTGCTAG CAGGGGTTGG CCTTGCTTAC GGCGCGTCGA AGAGCATCGA GAAGAGGAAG GAGGAGGTTG CTAAGGAGAT TAAGGAGGTG AAGGCCGAAT GA
|
Protein sequence | MSLKGISISR RDFLKMGGVA ALLASLNWDE LVRLAVAQAQ SGSINIVWFE AQDCAGNTTA LIQATEPDLI DVLGGSFHLV GPGNVKLAFH ETVMPEWGEG ALSIARAAAQ GKLDPFVLVL EGSFPQDEKA GGPPGSDSYC YIGDENGKVI SCVEWMRRLL PRAVAVVTMG NCASYGGLVA NKVLERVPGL PSSSWSDSPT GAVGFFDDPI RGIKGLVSTL PEAEPFRRFI TGQCTLKPGE IRPDCRPAMA VPGCPANGNG QLRALAALVL WAKGLLPLPE LDKYWRPKFI FGNTVHEQCP RAGSYAAGDF RAYPGENSYR CLYAVGCKGP ISNCPWNKLG WVNGVGGPTR TGGVCIGCTM PGFTDAYEPF YKPLQAPRVP SGLEGVAAIA ALLAGVGLAY GASKSIEKRK EEVAKEIKEV KAE
|
| |