Gene Tneu_0404 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTneu_0404 
Symbol 
ID6166171 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermoproteus neutrophilus V24Sta 
KingdomArchaea 
Replicon accessionNC_010525 
Strand
Start bp365049 
End bp366365 
Gene Length1317 bp 
Protein Length438 aa 
Translation table11 
GC content62% 
IMG OID641667562 
Productnickel-dependent hydrogenase small subunit 
Protein accessionYP_001793798 
Protein GI171184879 
COG category[C] Energy production and conversion 
COG ID[COG1740] Ni,Fe-hydrogenase I small subunit 
TIGRFAM ID[TIGR00391] hydrogenase (NiFe) small subunit (hydA)
[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.266321 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGATAA CAAGACGCGA TGTACTTAGG GCGGGTTCCC TAGCGGCGGC CCTCTCCGCT 
TTAAACTGGC CAGCGCTTGT AAAAGCGGCG GGGGAGGCTG TGAAGGACGG CCTTGTAAAC
ATCGTTTGGT TCGAGGCGCA GGACTGCGCC GGCAACACAA CCGCGGTTAT CCAGGCCACA
GACCCGTCCC TCCTAGACGT CTTGCTCGGC ACGACGCCTC TCGTGGGCCC CGGCACAGTG
CGGCTGATAT TCCACGAGAC CGTGATGCCC CAGTGGGGCA CCTACCACGT GAAGGAGGCC
ACAGACGTGG CCGACCACAA GATCCTAGAG AACTACCTCC AGACCCAGCC GCCGCCCGGC
GATGCGATGA AAATACTTGA GGAGATAGCG GAGGGCAAAT ACGGGCCGTA CGTGTTGGTT
CTGGAGGGGA GCTTCCCGCA GGAGTACGGA ATATCCGGCA CAAACATCGA GCAGAAGGGC
GGCTACTACT GCCTAGTGGG CCACAGGACG TGTACAGATT GGGCGAAGCT CCTCTTCAAG
AACGCACTCG CCGTAGTGGC AGTAGGCAAC TGCGCCGCCT ACGGCGGCCT CGTGGCGAAC
AAGGTGCTGG AGCCCCCGCC CGGCTTCAAG TTCCCCACGT GGTCCCCATC GCCGACCGGC
GCCGTCGGCA TGTTCGACGA CCCGGTGAGG GGTATAAAGG GCATGATACA CATCGACTAC
TTCCAGCCTG AGGTGGAGCC GTTTAGGAAG TACATAGACG AAGGCGGCGT GCCCGACTTC
AAGACTATGA AGCCCGCCGT GGCGGTGCCC GGCTGCCCCG CAAACGGCAA CGGCATACTG
AGGACCCTCG CGCTTCTGAC GCTTGTCGCC GCCGGGTTGC TTAAGCCGGA CGTCCTGGAG
AGAAAGGCCT TCCTAGACCA GTACGCCAGG CCGCGCTTCA TATTTGAAAA CACAGTTCAT
GAGCAGTGTC CACGCGCCGC ATCCTACGCA GCTGGCGACC TAAGGCCCTA CCCGGGCGCC
GGCGACTACA AGTGCCTATT CGGCGTCGGA TGCAAGGGGC CGATATCCAA CTGCCCGTGG
AACAAGGTGG GTTGGGTCAG CGGCATAGGC GGACCTACGA GGACGGGAGG CGTCTGCATT
GGCTGCACCA TGCCGGGCTT CACCGACGCC TTCGAGCCCT TCTACGCGCC GCTCAACGCG
CCTAGGTTGC CGACGACGGA GACGCTGGGG GTTGCGCTGG GCGGCGCCGC TCTACTTGGC
GTCGCCGGAG CATACCTAGC CTCAAAGGCG GCTAAGCCCA AGGAGGAGAA GAAATGA
 
Protein sequence
MKITRRDVLR AGSLAAALSA LNWPALVKAA GEAVKDGLVN IVWFEAQDCA GNTTAVIQAT 
DPSLLDVLLG TTPLVGPGTV RLIFHETVMP QWGTYHVKEA TDVADHKILE NYLQTQPPPG
DAMKILEEIA EGKYGPYVLV LEGSFPQEYG ISGTNIEQKG GYYCLVGHRT CTDWAKLLFK
NALAVVAVGN CAAYGGLVAN KVLEPPPGFK FPTWSPSPTG AVGMFDDPVR GIKGMIHIDY
FQPEVEPFRK YIDEGGVPDF KTMKPAVAVP GCPANGNGIL RTLALLTLVA AGLLKPDVLE
RKAFLDQYAR PRFIFENTVH EQCPRAASYA AGDLRPYPGA GDYKCLFGVG CKGPISNCPW
NKVGWVSGIG GPTRTGGVCI GCTMPGFTDA FEPFYAPLNA PRLPTTETLG VALGGAALLG
VAGAYLASKA AKPKEEKK