Gene Tpen_1073 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpen_1073 
Symbol 
ID4601651 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermofilum pendens Hrk 5 
KingdomArchaea 
Replicon accessionNC_008698 
Strand
Start bp1011424 
End bp1012392 
Gene Length969 bp 
Protein Length322 aa 
Translation table11 
GC content61% 
IMG OID639773850 
Productrespiratory-chain NADH dehydrogenase, subunit 1 
Protein accessionYP_920475 
Protein GI119719980 
COG category[C] Energy production and conversion 
COG ID[COG1005] NADH:ubiquinone oxidoreductase subunit 1 (chain H) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.781085 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGCTCGACA TTGTCCTCGG CGTCCTGGTC TTCCCCGGGC TCCTGTTCAC CGTGGCTATG 
GGCTTCTGGT TCGAGTACCT CGAGAGAAAA GTTACCGCGA GGATCCAGAG AAGGGTGGGC
CCCTTGTACG CGGGCCCCCA CGGGCTTCTC CAACCCGTCT ACGACTTCTT CAAGCTTCTC
TTGAAGGAGG AGATAGTCCC TGGGTGGACC GACGTCTTTA CGTTCAGGGT CGCGCCGATC
CTAGCCGTAA CCATACCTGT CTTCGGTATG TGCGTCCTCC CGGTGGCGAG CACCAAGGGC
CTGCTGTCGT TCGAAGGTGA CTTCGCGCTA GTTTTCCTTT TGCTGGCGCT CGGCGTGCTG
ACCCTGTCCC TCACCGGCTA CTCCGTCCTA AGCCCCTACA CGGCGATAGG TGTTGGAAGG
CTCCTCGTGC AGTACTCGAT GTACGAAGGG GTGTTCCTCT TAAGCCTTGC GTCGGCCGCC
CTGCAGGCGA AGACAATGAG CTTCGAAGGC ATACTCGCGT ACCAGGAGTC CCACGGCTTC
CTCGGTCTCT ACCAGCCGGT CTCGCTCGCC GCCGCGCTCG TCGCGCTACT AGCTAAGCTC
GAGAAGCGCC CCTTCGACCT CCCCCACGCC AAGCAGGAGG TCGTAGCGGG CTGGATGACC
GAGCTCAGCG GGAGGGGTCT AGCCTTCATG AGGCTCTACG AGGACTTGAG CATGGTTTGG
GGGATAGCGC TCATAGTCGT AGTTTTCCTC GGGGGACCGC TGGGCCCCGG CTACAAGGAG
CTGGGCGCGC TGGCCGGTTT CGCGTGGTTC GCGCTGAAAT CGCTAATCGT CGCACTTGCA
GTCATCCTCG TTAGCGCGAC TACGAGTAGA GTCAGGGTCT ATGGGCTCGC GGAGGTCTTC
TGGAAGAGGG TTTACCCGCT AGTCCTGCTC CAGCTAGTCG TGGCGTTCCT TCTGGGGTGG
TGGGCGTGA
 
Protein sequence
MLDIVLGVLV FPGLLFTVAM GFWFEYLERK VTARIQRRVG PLYAGPHGLL QPVYDFFKLL 
LKEEIVPGWT DVFTFRVAPI LAVTIPVFGM CVLPVASTKG LLSFEGDFAL VFLLLALGVL
TLSLTGYSVL SPYTAIGVGR LLVQYSMYEG VFLLSLASAA LQAKTMSFEG ILAYQESHGF
LGLYQPVSLA AALVALLAKL EKRPFDLPHA KQEVVAGWMT ELSGRGLAFM RLYEDLSMVW
GIALIVVVFL GGPLGPGYKE LGALAGFAWF ALKSLIVALA VILVSATTSR VRVYGLAEVF
WKRVYPLVLL QLVVAFLLGW WA