Gene Tpen_1006 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpen_1006 
Symbol 
ID4600679 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermofilum pendens Hrk 5 
KingdomArchaea 
Replicon accessionNC_008698 
Strand
Start bp952106 
End bp953239 
Gene Length1134 bp 
Protein Length377 aa 
Translation table11 
GC content65% 
IMG OID639773784 
Productiron-containing alcohol dehydrogenase 
Protein accessionYP_920409 
Protein GI119719914 
COG category[C] Energy production and conversion 
COG ID[COG1454] Alcohol dehydrogenase, class IV 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGTCCGCAG GGTTAAGGTT CACCCATAAG CACCCGACGA CCGTTGTTTT CGAGAGAGGC 
TCCCTCAACA GGCTGGGCGA GCTCGTGAAG GGGCTCGGCG GGAAAGCCCT CGTAGTCACG
GGCCGGCAGT TCGCCAAGAA GTACGGCTAC GACTCCCTCA TAAAGAGCCA GCTCGAGTCC
GCGGGCGTCG AGGTCTACTT CTTCTCGGAG GTGGAGCCGA ACCCCACCAT CGAGACTGCA
GACAGGTGCG CCGAGGCGGC GCGGAAGGCG GGCGTAGACT TCTTCGTAGC GTTCGGCGGG
GGAAGCGTGA TAGACGTCGC TAAGGCGGCT AACGTTGTGT ACAGCCTCGG GGGCAGCGCT
AAGGACTACC TGTGGCCCCG CAACGTCGAG GAGAAACTCA GGCCGCTCGT AGCCGTGCCC
ACAACCCACG GAACCGGGAG CGAGGTGACG AAGTACTCCG TCCTCGTCGA CAGGGAGACG
GGCATGAAGG TCGCCGTCTC CGGGAGCGGG CTCCTACCGA CGCTCGCCGT CCTCGACCCC
CTCGTGCTGA AGCACCTACC GAGCGACCAG TCGGCCAGCA CGGGGCTGGA CGCGCTCTCC
CACGCCATCG AGGCGTTCTT CAGCTCCAGG GCGACACCCT TCACCGACAT GTTCGCGCTG
GAGGCTTCCA GGATAGCGTT CCGCAAGCTA CCCTGCGCCG TGGAGGGCTT CCTGGACTGC
AGGGAGTGGA TGCTGTACGC GAGCATGCTG GCAGGCTACG CTATCAACTA CACCGGTACC
AACATAGGGC ACGGGCTCGG ATACCCCCTC ACAACGAGGC TAAACCTACC CCACGGCTTC
GCGAACACCG TGCCCCTCCT GGGGGCCCTT GAGTACTACG AGAAGTACGC GCCGGAGAGA
GCCAGGCTGT TCGCCGAGCA CGTGGGGGCC ACGGGGGTCG GAGGGCTGAG GGGGCTCTTC
AAGGAGCTAT GCAGGGAGGT GGGAGCCCCG ACGTCGCTGA GCGGGCTCGG GGTTCGGAGG
GAGGAGCTGG ACGACTACGT CCGGGAGGGG TTGAAGTACA AGAGGAACCT CTCGAACGCC
CCCTTCGAGG TGACGGAGGA GATAGTGCGG GACATCTACG AGAGGGTGTT CTAG
 
Protein sequence
MSAGLRFTHK HPTTVVFERG SLNRLGELVK GLGGKALVVT GRQFAKKYGY DSLIKSQLES 
AGVEVYFFSE VEPNPTIETA DRCAEAARKA GVDFFVAFGG GSVIDVAKAA NVVYSLGGSA
KDYLWPRNVE EKLRPLVAVP TTHGTGSEVT KYSVLVDRET GMKVAVSGSG LLPTLAVLDP
LVLKHLPSDQ SASTGLDALS HAIEAFFSSR ATPFTDMFAL EASRIAFRKL PCAVEGFLDC
REWMLYASML AGYAINYTGT NIGHGLGYPL TTRLNLPHGF ANTVPLLGAL EYYEKYAPER
ARLFAEHVGA TGVGGLRGLF KELCREVGAP TSLSGLGVRR EELDDYVREG LKYKRNLSNA
PFEVTEEIVR DIYERVF