Gene Tneu_0401 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTneu_0401 
Symbol 
ID6165115 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermoproteus neutrophilus V24Sta 
KingdomArchaea 
Replicon accessionNC_010525 
Strand
Start bp362884 
End bp363984 
Gene Length1101 bp 
Protein Length366 aa 
Translation table11 
GC content62% 
IMG OID641667559 
Producthydrogenase expression/formation protein HypE 
Protein accessionYP_001793795 
Protein GI171184876 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0309] Hydrogenase maturation factor 
TIGRFAM ID[TIGR02124] hydrogenase expression/formation protein HypE 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.496763 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTATCACT TCGAACGGGC ATATATCCTT TATCCTTCTA CGTTTTTAAA TAGGGGGTTG 
TGGCCCCTTG TGATTAAGCT GTCTCACGGC TCCGGCGGGG TGGAGACGTC GGAGATCGTC
GAAAAGCTGT TCCTCAGACG TCTGCCCGAG GGGCTTAAGA AGGTGGCGGG GGGGCTTGGG
CTCGACTTCC CCGACGACGC CGCCGCCATA CCGATGGCCG ACGGGCGGTA CCTCGTGGTG
ACTGTGGACG CATATACTGT CAACCCGCCG TTTTTCCCCG GGGGGGACAT CGGGATGTTG
GCGGCCTCAG GCTCTATAAA CGACGTGTTG ATGCTGGGGG GTAGGCCCCT GGCGATGCTG
GACTCCATAA TCGCCGAGGA GGGGCTCCCC TACGAGACGC TTGACAGAGT GGTCAAGTCC
TTCCTATCCG TACTCGAGGC GGAGGGCGTG GCCCTCATAG GCGGGGACTT CAAGGTCATG
CCCAAGGGCC AGCTTGACAA GATCGTCATT ACGACGGTGG GGATAGGGGT GGCGGATAGG
ATCATCGTGG ATAAGCCGAG GCACGGCGAC AAGATCGTGG TGAGCGACTT CGTGGGGGAC
CACGGCGCCG TCATCCTCAT GCTTCAGATG GGGGACGTGG ACAAGCCGGA GCAGCTACAG
CTTAAGAGCG ACGTGAAGCC GCTTACTAAG CTCATGGTGC CGCTGGTGGA GAAATACGGC
GAGTACATCC ACGCGGCTAG GGACCCCACT AGGGGCGGCC TCGCGATGGT GTTGAACGAC
TGGGCTAAGG CCGGCGGCGG CGTGATTGTG GTGGAGGAGG AGAGCCTCCC GGTGAGGCCG
GAGGTGGCGT CGTACGCCGG GATGCTCGGC ATAGACCCGC TCTACCTGGC GAGCGAGGGG
GCGGCCGTCC TGGCGGTGGA CCCCGCCGTC GCCGAGGAGG TGGTGAAGTT CATGAGGGGG
CTGGGGTTCC AAAACGCGCG GGTGGTGGGC GAGTTTAGAG AGGCGAGGCA ACACAGGGGA
TACGTCCTGC TGAAGACGGT GGCGGGCGGG CTTAGGATTC TGGAGCCTCC CAGGGGCGAC
ATCGTGCCGA GGATATGCTG A
 
Protein sequence
MYHFERAYIL YPSTFLNRGL WPLVIKLSHG SGGVETSEIV EKLFLRRLPE GLKKVAGGLG 
LDFPDDAAAI PMADGRYLVV TVDAYTVNPP FFPGGDIGML AASGSINDVL MLGGRPLAML
DSIIAEEGLP YETLDRVVKS FLSVLEAEGV ALIGGDFKVM PKGQLDKIVI TTVGIGVADR
IIVDKPRHGD KIVVSDFVGD HGAVILMLQM GDVDKPEQLQ LKSDVKPLTK LMVPLVEKYG
EYIHAARDPT RGGLAMVLND WAKAGGGVIV VEEESLPVRP EVASYAGMLG IDPLYLASEG
AAVLAVDPAV AEEVVKFMRG LGFQNARVVG EFREARQHRG YVLLKTVAGG LRILEPPRGD
IVPRIC