Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tneu_0404 |
Symbol | |
ID | 6166171 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermoproteus neutrophilus V24Sta |
Kingdom | Archaea |
Replicon accession | NC_010525 |
Strand | + |
Start bp | 365049 |
End bp | 366365 |
Gene Length | 1317 bp |
Protein Length | 438 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 641667562 |
Product | nickel-dependent hydrogenase small subunit |
Protein accession | YP_001793798 |
Protein GI | 171184879 |
COG category | [C] Energy production and conversion |
COG ID | [COG1740] Ni,Fe-hydrogenase I small subunit |
TIGRFAM ID | [TIGR00391] hydrogenase (NiFe) small subunit (hydA) [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 30 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 0.266321 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGATAA CAAGACGCGA TGTACTTAGG GCGGGTTCCC TAGCGGCGGC CCTCTCCGCT TTAAACTGGC CAGCGCTTGT AAAAGCGGCG GGGGAGGCTG TGAAGGACGG CCTTGTAAAC ATCGTTTGGT TCGAGGCGCA GGACTGCGCC GGCAACACAA CCGCGGTTAT CCAGGCCACA GACCCGTCCC TCCTAGACGT CTTGCTCGGC ACGACGCCTC TCGTGGGCCC CGGCACAGTG CGGCTGATAT TCCACGAGAC CGTGATGCCC CAGTGGGGCA CCTACCACGT GAAGGAGGCC ACAGACGTGG CCGACCACAA GATCCTAGAG AACTACCTCC AGACCCAGCC GCCGCCCGGC GATGCGATGA AAATACTTGA GGAGATAGCG GAGGGCAAAT ACGGGCCGTA CGTGTTGGTT CTGGAGGGGA GCTTCCCGCA GGAGTACGGA ATATCCGGCA CAAACATCGA GCAGAAGGGC GGCTACTACT GCCTAGTGGG CCACAGGACG TGTACAGATT GGGCGAAGCT CCTCTTCAAG AACGCACTCG CCGTAGTGGC AGTAGGCAAC TGCGCCGCCT ACGGCGGCCT CGTGGCGAAC AAGGTGCTGG AGCCCCCGCC CGGCTTCAAG TTCCCCACGT GGTCCCCATC GCCGACCGGC GCCGTCGGCA TGTTCGACGA CCCGGTGAGG GGTATAAAGG GCATGATACA CATCGACTAC TTCCAGCCTG AGGTGGAGCC GTTTAGGAAG TACATAGACG AAGGCGGCGT GCCCGACTTC AAGACTATGA AGCCCGCCGT GGCGGTGCCC GGCTGCCCCG CAAACGGCAA CGGCATACTG AGGACCCTCG CGCTTCTGAC GCTTGTCGCC GCCGGGTTGC TTAAGCCGGA CGTCCTGGAG AGAAAGGCCT TCCTAGACCA GTACGCCAGG CCGCGCTTCA TATTTGAAAA CACAGTTCAT GAGCAGTGTC CACGCGCCGC ATCCTACGCA GCTGGCGACC TAAGGCCCTA CCCGGGCGCC GGCGACTACA AGTGCCTATT CGGCGTCGGA TGCAAGGGGC CGATATCCAA CTGCCCGTGG AACAAGGTGG GTTGGGTCAG CGGCATAGGC GGACCTACGA GGACGGGAGG CGTCTGCATT GGCTGCACCA TGCCGGGCTT CACCGACGCC TTCGAGCCCT TCTACGCGCC GCTCAACGCG CCTAGGTTGC CGACGACGGA GACGCTGGGG GTTGCGCTGG GCGGCGCCGC TCTACTTGGC GTCGCCGGAG CATACCTAGC CTCAAAGGCG GCTAAGCCCA AGGAGGAGAA GAAATGA
|
Protein sequence | MKITRRDVLR AGSLAAALSA LNWPALVKAA GEAVKDGLVN IVWFEAQDCA GNTTAVIQAT DPSLLDVLLG TTPLVGPGTV RLIFHETVMP QWGTYHVKEA TDVADHKILE NYLQTQPPPG DAMKILEEIA EGKYGPYVLV LEGSFPQEYG ISGTNIEQKG GYYCLVGHRT CTDWAKLLFK NALAVVAVGN CAAYGGLVAN KVLEPPPGFK FPTWSPSPTG AVGMFDDPVR GIKGMIHIDY FQPEVEPFRK YIDEGGVPDF KTMKPAVAVP GCPANGNGIL RTLALLTLVA AGLLKPDVLE RKAFLDQYAR PRFIFENTVH EQCPRAASYA AGDLRPYPGA GDYKCLFGVG CKGPISNCPW NKVGWVSGIG GPTRTGGVCI GCTMPGFTDA FEPFYAPLNA PRLPTTETLG VALGGAALLG VAGAYLASKA AKPKEEKK
|
| |