Gene Tpen_0938 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpen_0938 
Symbol 
ID4601067 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermofilum pendens Hrk 5 
KingdomArchaea 
Replicon accessionNC_008698 
Strand
Start bp889347 
End bp890756 
Gene Length1410 bp 
Protein Length469 aa 
Translation table11 
GC content66% 
IMG OID639773716 
Producthypothetical protein 
Protein accessionYP_920341 
Protein GI119719846 
COG category[O] Posttranslational modification, protein turnover, chaperones
[R] General function prediction only 
COG ID[COG0492] Thioredoxin reductase
[COG3383] Uncharacterized anaerobic dehydrogenase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.903873 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAGAATAA CGGAGCATCC CATACTGGAG TTCAGGCGGG GCTCCCCCGT AAAGTTCCGC 
TTCGACGGCG AAGAGGTAGA GGCTTTCGAG GGGGAAAGCA TAGCGGCGGC TCTCTGGGCT
TCGGGGATCA GGGATTTCCG GCGCGGGGAG CAAGGGCCCC AGGGCCCTTT CTGCATGATA
GGCTACTGCT CGGGTTGCAT GGTGCGCGTA GACGGCAGAA GCAGGGTTAG GGCTTGCCTC
GAGCCCGTGA GGGACGGCGC TGTCGTCGAG AGGGAGGATA AGCCGCTACC GTCTGGGGTT
GTCGGGGAGG CGGGCGAGGC GGGCGAGCTG GACGTCGACG TGATGGTAAT TGGTTCCGGG
CCTGCCGGTC TCTCGGCCGC CCTCGCCAGC GCTTCCGCCG GCCTGGAGGT ACACGTGTTC
GAGCGGCATT TCCGACCCGG CGGCCAGCTC GTGAAGCAGA CGCACAAGTT CTTCGGGAGC
GGCGAGCTCT TCGGGGGGTT GAGGGGCTTC CAGATAGCGG AGAGGCTCGT ATCCGAGGCG
GAGAGGGCTG GCGTGAAGAT CCACACGAGG TCCCCCGTCC TGGGCTGGTT CGGGGAGGGG
GTCTTCGCGG TGAACGAGGG CGGGAGGTTG CTCAGGGTTA GGGCTAAGGC AGTCGTCGTG
GGCACGGGGG CTGTCGAGAG GTTCCTGCCC TTTCCGGGGA ACACGCTTCC AGGCGTGATG
GGCGCCGGCG CTGCCCAGAC GCTGATGAAC GAGTACGGGG TAAAGCCCGG CGAGAAGGCG
GTGGTCGTCG GCGCGGGGAA CGTCGGGCTG ATAGTCTCCT ACCAGCTCCT CCAGGCCGGC
GTAAGCGTAG AGGCCGTCGT GGAGGTTAGG CCGGAGATAG GAGGCTGGTT CGTCCACGCG
GCGAAGCTGA GGAGGCTGGG CGTCCCCATA CTCACGGAGC ACACTGTCGT GAGGGCTGAG
GGCAGGGGGA GAGTCGAGAG GGTGGTCATC TCGAGGGTCG GGAAGGACTT CCAGCCGCTG
AAGGAGTACG AGAGGAGCGT AGAGGCGGAC CTCCTGCTCC TAGCCGTGGG GCTGACCCCT
GAGTCCAGGT TGCTCGCGGA GATGGGTGCC AGGATGACGT GGTCGACCGA GCTTGGAGGC
TACGTGCCGT ACCGGGACAG GTACATGGAG ACCAGTATCC CCGGGGTGTA CGTGGCGGGG
GACGCCTCGG GGATCGAGGA GGCTACGACC GCCCTGCTGA CGGGCAGGGT GGCGGGGCTC
TCGGCCGCGA TAAGGATCCT CGGGGAGAGG GGTGAGCTCG TGGAGGAGAG GGAGAAGGCT
CTGAGGATGC TCGACGAGAC CAGGAGGACT CCCTTCTCCG CGCGCGTCGT GGAGGGTATA
CGCAGGGTGA GCGTCGGTGT TCAGGCGTAG
 
Protein sequence
MRITEHPILE FRRGSPVKFR FDGEEVEAFE GESIAAALWA SGIRDFRRGE QGPQGPFCMI 
GYCSGCMVRV DGRSRVRACL EPVRDGAVVE REDKPLPSGV VGEAGEAGEL DVDVMVIGSG
PAGLSAALAS ASAGLEVHVF ERHFRPGGQL VKQTHKFFGS GELFGGLRGF QIAERLVSEA
ERAGVKIHTR SPVLGWFGEG VFAVNEGGRL LRVRAKAVVV GTGAVERFLP FPGNTLPGVM
GAGAAQTLMN EYGVKPGEKA VVVGAGNVGL IVSYQLLQAG VSVEAVVEVR PEIGGWFVHA
AKLRRLGVPI LTEHTVVRAE GRGRVERVVI SRVGKDFQPL KEYERSVEAD LLLLAVGLTP
ESRLLAEMGA RMTWSTELGG YVPYRDRYME TSIPGVYVAG DASGIEEATT ALLTGRVAGL
SAAIRILGER GELVEEREKA LRMLDETRRT PFSARVVEGI RRVSVGVQA