Gene Tpen_0094 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpen_0094 
Symbol 
ID4601386 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermofilum pendens Hrk 5 
KingdomArchaea 
Replicon accessionNC_008698 
Strand
Start bp74694 
End bp76538 
Gene Length1845 bp 
Protein Length614 aa 
Translation table11 
GC content59% 
IMG OID639772848 
Productaldehyde ferredoxin oxidoreductase 
Protein accessionYP_919507 
Protein GI119719012 
COG category[C] Energy production and conversion 
COG ID[COG2414] Aldehyde:ferredoxin oxidoreductase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCGGCGA AGGGCTGGTG GGGAAGGGTA CTGTGGGTTG ACCTCTCCCG GAAGTCTACA 
AAGGTGCAGG AGCTGGACGG CGGAATACTG TTAAGTCACG TAGGGGGCAG GGGTCTCGCT
GTGCGCCTAC TGTGGGATTA CACAAGCCCG GGGGTGGACC CCCTCTCCCC GGAGAACCTG
CTGGTTTTCT CCGCGGGGCC GATAACAGCC CTACCTGGGC CAAGCACAGG TAAGCTCGTA
GTCGCGTCGA AGAGCCCGTT AACGCACGGC TACGGCGACG GAAACTTGGG TACGAGAGCG
GCCGTCATGC TTAGATGGGC CGGATACGAC GCCGTTGTGT TCAAGGGCAA GTCCCCGAAG
CCCGTCTACG TTTACGTAGA GAACGAGAAG GTAGAGTTCC TCGACGCCGA CGATCTATGG
GGCCTTGACA CCTTCTCGGC GGAAAAGAGG CTCCTGGAGC GCCACGGGAA AGACGCGGGC
GTCCTGCTCA TCGGGCCCTC CGGAGAGAGA ATGGTCAAGA TGGCTACCGT GGTCTCCCAG
AGCGGTAGGA GCGGGGGTAG ACCTGGTATA GGAGCTGTCA TGGGTAGCAA GAACTTGAAG
GCTGTCGTGT TTCGCGGCGA CAAGATGCCC GAGGTCGCTG AGCAGTCGCT GTTAAGGAAG
ACCGCGGCGG AAGCTTACGC GTCCGCGAAG AGCAAGCCTC CCTACTCCTT CTGGATGAGG
CAGGGGACGA TGGCAACTAT CCAGTGGTCT CAGGAAAACA GTGTTCTCCC CACGTTTAAC
TTTAGTGAAG GAGTATTCGA CGAGAGTAGC GGGATTGACG GCTTCGCTAT GGAGAGGCTG
AAGGTTTCCC AGCGTGGATG CCCGAACTGT AACTCTATCT GCGGCAACGT TATCCTCGAC
GACGAAGGAG CGGAGTCGGA GCTGGACTAC GAGAATGTAG CCATGCTGGG CTCGAACATA
GGGCTGGGCG ACCTGCGTAA AGTGGCGCGT CTAAATAGGC TAGCGGATAT GTGGGGCATC
GATACGATCG GGCTTGGCTC AGCGCTAGGC TTCGCGATAG AGGCTTCTCA AAGGGGCTTG
CTGAAGGACA GGATAGAGTG GGGAGATTTC GACAAAATAC TGGAGCTTTC CCGGGAAATA
TCCCTCGGAG AGGGCCCCGT AGGAAGCGTG CTATCAGAGG GCGTCGAGCA TGCATCCAAG
GTTCTGGGAT GCGAGGAGTG CGCCGTACAC GTCAAAGGGC TAAGCGTAAG CGCTTACGAC
TGCCACGCCG CCCCAGGAAT GGCTCTATCG TACGGTGTGA GCAGTGTCGG CGCCCACCAC
AAGGACGCGT GGGTAATATC CTGGGAAGTT GCACATGGCA GGTTCGAGTA CTCGAAGGCG
AAGGCCAAGA GGGTGTACGA GCTACAGAGG ATACGCGGAG GCTTCTTCGA AAACCTAGTG
GCATGCCGCC TACCGTGGGT GGAGCTAGGG CTCGAGCTAG ACTGGTACGT GAAGCTGTTC
AACTACGCTA CTGGGCTCTC CTGGACTCTC GACGACCACC TAAAGGTGGC GGACCGCACT
ATAACGCTTA TAAGGAGCTA CTGGGTGCGG GAGTACCTCG CGGAAGGCAG GCGTTGGGGT
AGGCAACTGG ACTACCCGCC CTTAAAGTGG TTTACGAAGC CGTATACCCG CGGACCGCTG
AAAGGCGCTA GGCTTGATCC TCAGAAGTAC GACGAGCTCC TCGGAAACTA CTACGAGCTC
GTAGGATGGG ATCACCGTGG CGTTCCGCGC GCGTCGACGC TGGAGAGGCT CGGACTTTCC
TACGTGAGGA CAGAGCTACA GAGAATGACT GAGCTCACGA ACTAA
 
Protein sequence
MAAKGWWGRV LWVDLSRKST KVQELDGGIL LSHVGGRGLA VRLLWDYTSP GVDPLSPENL 
LVFSAGPITA LPGPSTGKLV VASKSPLTHG YGDGNLGTRA AVMLRWAGYD AVVFKGKSPK
PVYVYVENEK VEFLDADDLW GLDTFSAEKR LLERHGKDAG VLLIGPSGER MVKMATVVSQ
SGRSGGRPGI GAVMGSKNLK AVVFRGDKMP EVAEQSLLRK TAAEAYASAK SKPPYSFWMR
QGTMATIQWS QENSVLPTFN FSEGVFDESS GIDGFAMERL KVSQRGCPNC NSICGNVILD
DEGAESELDY ENVAMLGSNI GLGDLRKVAR LNRLADMWGI DTIGLGSALG FAIEASQRGL
LKDRIEWGDF DKILELSREI SLGEGPVGSV LSEGVEHASK VLGCEECAVH VKGLSVSAYD
CHAAPGMALS YGVSSVGAHH KDAWVISWEV AHGRFEYSKA KAKRVYELQR IRGGFFENLV
ACRLPWVELG LELDWYVKLF NYATGLSWTL DDHLKVADRT ITLIRSYWVR EYLAEGRRWG
RQLDYPPLKW FTKPYTRGPL KGARLDPQKY DELLGNYYEL VGWDHRGVPR ASTLERLGLS
YVRTELQRMT ELTN