Gene Tpen_0226 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpen_0226 
Symbol 
ID4601216 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermofilum pendens Hrk 5 
KingdomArchaea 
Replicon accessionNC_008698 
Strand
Start bp203494 
End bp204669 
Gene Length1176 bp 
Protein Length391 aa 
Translation table11 
GC content53% 
IMG OID639772980 
ProductFAD-dependent pyridine nucleotide-disulphide oxidoreductase 
Protein accessionYP_919639 
Protein GI119719144 
COG category[C] Energy production and conversion 
COG ID[COG1252] NADH dehydrogenase, FAD-containing subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAGTGGTA AAACTATAGT CGTCGTAGGC GGAGGCTTCG GAGGCTTCTA CGCCCTCAAG 
ACCCTTTCAG GGCTAGGCCT GACGGAGAGA AACAGAGTAG TGCTGGTGGA TAAGAGCAGA
CGCTTCGTCT ACTTGCCGTC TCTACCTTAC CTTATTTCCG GGAAGAAAAC AGTGGAGGAT
CTCACGGAGC CCCTTGAGGA GATAGCGAAA AGGCTGGGCG CAGAGTTCTT GTTAGGAGAA
GTCACAAGGG TTAGCCTCCA GGAGAAAACG ATCGAGCTTA GCAACGGGGA AAAATTGAGG
TACGACTACT TGGTGCTAGC GGCAGGTGCT ACTACAGAGT ACTACGGCAT ACCCGGTGCC
CACCAGGCTC TTCCGGCGTG GAGGCTCGAG GACTACGAGA GAATAGTGAG GGAGCTCGGC
AAGTGCGGTA GCGGTTGCAG AGTATGCATC GCCGGGGGCG GGCTGACCGG CGTAGAAGTT
GCGGGCGAAG TTGCCGAGAA GTACGAAGGG AAGAGTGAGG TGGTCGTTGT AGAGAAGATG
CAGATGCTAA TGCCTACCCT AAACTCTCAA CGGGCATCAA AGATAATAGA GGAATTCCTC
TCGAGCAAAG GGGTCAGGAT AATCAAGGGT AACGGTGTCT CAAGTGTTTC GGAGAAAAAC
CTGAACCTTG AGGACGGTAC AAGCATTCAG TGCGACATCG TAGTCTGGAC TGTGGGTGTG
AGACCCCCAG ACATAAGGTT CGACGTAGAT GTGCCCGTGA AGGGTAGAGG GTGGATCTGC
GTCAAGCCAA CGTTGCAAGT GATGAGCAGG GACGACGTCT ACGCTATCGG CGATATAAAC
CATTTCGCCG TGGATTCCGA CTACGCTATG AAGATGGCCG AAGAAGCTAT CCTTCAAGGC
AAAACAGCCG CGAAGAACAT AGCGCTACAG ATCAGCGGCG AGACCCCACG CTACACGCAT
AAGCCCATAT TCCTTGCTTC AAAGCCGAAA AGCCTCGTGT CCGTCGGTTA CGGAAGGGCG
TTGATGATCT GGGAAAACAA GATCCTCTTC GGAAGAGCAC CCTATATAAG CAAGATGCTC
ATAGAAAGCA TCGTTATGAG AGACGTGAAG GGGAAAGTCG GCGGTGGGAT AGCAACGTCG
CTCGAAAGCA AGATTCTTAG AACAATCTCC GGGTAA
 
Protein sequence
MSGKTIVVVG GGFGGFYALK TLSGLGLTER NRVVLVDKSR RFVYLPSLPY LISGKKTVED 
LTEPLEEIAK RLGAEFLLGE VTRVSLQEKT IELSNGEKLR YDYLVLAAGA TTEYYGIPGA
HQALPAWRLE DYERIVRELG KCGSGCRVCI AGGGLTGVEV AGEVAEKYEG KSEVVVVEKM
QMLMPTLNSQ RASKIIEEFL SSKGVRIIKG NGVSSVSEKN LNLEDGTSIQ CDIVVWTVGV
RPPDIRFDVD VPVKGRGWIC VKPTLQVMSR DDVYAIGDIN HFAVDSDYAM KMAEEAILQG
KTAAKNIALQ ISGETPRYTH KPIFLASKPK SLVSVGYGRA LMIWENKILF GRAPYISKML
IESIVMRDVK GKVGGGIATS LESKILRTIS G