Gene Tpen_0821 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpen_0821 
Symbol 
ID4601992 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermofilum pendens Hrk 5 
KingdomArchaea 
Replicon accessionNC_008698 
Strand
Start bp774143 
End bp775339 
Gene Length1197 bp 
Protein Length398 aa 
Translation table11 
GC content60% 
IMG OID639773598 
Productalcohol dehydrogenase 
Protein accessionYP_920225 
Protein GI119719730 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG1063] Threonine dehydrogenase and related Zn-dependent dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.973197 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGAGCAG CCTTAGTGAC CGCGGACTTT GCGCCTAGAC CCGGCTACAA GATCACGCAG 
GACGAGATAA GGACCCACAA GGTGAGAGAG GGGGCGAAGG TCTGGCGGAA CCCGAAGCTC
GTGCTGAGAA CAGACTACCC TGTACCGGAG CCGAAGCCCG ACGAGATACT CATACGCGTC
AAGGCGGTAG GCATATGCGG GTCCGATATA CACTTCCTCG AGACGGATAG CGAAGGCTAT
ATCCTTTACC CCGGCCTCAC GAGGTTCCCC GTGGTTATCG GGCACGAGTT CAGCGGCGTG
GTCGAGAAGG TAGGAACCAA CGTCAAGACG TTCAAGCCGG GAGACATGGT GACGAGCGAG
GAGATGTTCT GGTGCGGGGA ATGCGATGCG TGTAGAAGCG TGGACTTCAA CCACTGCCTG
CGGCTAAACG ACCCGGCGGA CCTCGAGTTC GGCGAGCTCG GCTTCACGCA CGACGGCGCG
ATGGCAGACT ACGTAGTCGT CAAGGCCAAG TATGCCTGGA AGATAGACTC GCTCCTCGAC
AGGTACGGTA GCGAGGACAA AGCGTTCGAG GCGGGGAGCC TCGTAGAGCC TACCAGCGTA
GCCTACCACG CCATGTTCAC GAGGGCAGGC GGCTTCAAAC CGGGAGCCTA CGTCGCGGTC
TGGGGCGCAG GCCCCATAGG GCTTGCAGCC ATAGCTCTCG CGAAGGCCGC GGGCGCGGGC
AAGGTCATCG CGTTCGAGGT AAGCCCGACG AGAAGAGAGC TCGCAAAGAA GGTGGGCGCA
GACTACGTGT TCAACCCGGT CGAGCTCTCG AAGAACGGCG TTGAGCCTTG GGAGAAGATA
ATGGAGGTGA CCGGGGGCCA GGGCGCGGAC TTCCACGTTG AAGCTGCCGG CGCGCCGAGA
CACACCATAC CGCAGATGCA GAAAAGCCTC GCCATTAACG GCAAGATAGT GCAGATAGGG
CGGGCGGCTG AGGACGTCCC GATATACCTT GAAGTGTTCC AGGTTAGGAG AGGCCAGATA
TTCGGGAGCC AGGGGCACTC CGGCTTCGGC AACTTCGGCA ACGTCATACG GCTGATGGCC
GCCGGCAAGA TAGACATGAC CCAGATCATA ACCGCCAGGT TCTCTCTCGA CGAGGTCTAC
AAGGCTTTCG AGAGGGCCCA CCAGAGGATA GACGGCAAGA TAACTCTCAA ACCGTAA
 
Protein sequence
MRAALVTADF APRPGYKITQ DEIRTHKVRE GAKVWRNPKL VLRTDYPVPE PKPDEILIRV 
KAVGICGSDI HFLETDSEGY ILYPGLTRFP VVIGHEFSGV VEKVGTNVKT FKPGDMVTSE
EMFWCGECDA CRSVDFNHCL RLNDPADLEF GELGFTHDGA MADYVVVKAK YAWKIDSLLD
RYGSEDKAFE AGSLVEPTSV AYHAMFTRAG GFKPGAYVAV WGAGPIGLAA IALAKAAGAG
KVIAFEVSPT RRELAKKVGA DYVFNPVELS KNGVEPWEKI MEVTGGQGAD FHVEAAGAPR
HTIPQMQKSL AINGKIVQIG RAAEDVPIYL EVFQVRRGQI FGSQGHSGFG NFGNVIRLMA
AGKIDMTQII TARFSLDEVY KAFERAHQRI DGKITLKP