Gene Tpen_1654 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpen_1654 
Symbol 
ID4601731 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermofilum pendens Hrk 5 
KingdomArchaea 
Replicon accessionNC_008698 
Strand
Start bp1601324 
End bp1602565 
Gene Length1242 bp 
Protein Length413 aa 
Translation table11 
GC content64% 
IMG OID639774427 
Producthypothetical protein 
Protein accessionYP_921052 
Protein GI119720557 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2407] L-fucose isomerase and related proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAACGTAT ACGCGGTAGC CTTCGCCTCG AGGATCCACG GGGAGGGCTA CTACAGGCAG 
GCGTACAGCT ACGTTTCGAG CGTTCTCCGC GTACCCGTCT ACCCCGAGGT GGTCGCGGAG
CGCGACACCT TGAAGAAAGC CGTCGAGGAG CTTAGGGGCT CCCTCCCCTT AGCCGTAGTC
TTAACCGGCG GGACCAGCGG GTTGATACAG GAGTTCGCCT CGGAAGGAGG CTTCAGGGCT
GTGGCGCTAT TAGCGCAGGG CGAGCACAAC AGCCTAGCCT CGGCGATCTC TGCGAGAGCG
GCTCTCGAAT CCAGGGGGGT CGGTGTAGCT CTCTTCCACT GCGGCTCCTT CTCGGACGGC
AACTGCGCCG CCGCGGCGAG CGCGGCTGTC AGAGTTGCCC GAGGAGCAGG CCGGGTTCTG
GGGGCGAGGG TGGGCGTGGT GGGCTCTAAG CCTCGCTACG CGGATGTCTT CTCGTCGAGG
CTCGGCTGGA CTATCGAAGT GGTGCCCGCC GAGGAGCTTT TCTCCGCCGC AGAGTCCGCT
CCCAGAGAGG CTGTGGAGTC CTTCCTTTCC AGGGTGTCGG GGGTCCCGGG CTTCGAGTTG
TACCGCTCAA GCCTCGAGCA CGTCGGCGGG GTGTACTACG CGTTGAGGAG GCTCTCCGAG
GAGAAAAGGC TCGACGCGGT CGCCGTTGAC TGCTTCCCCT ACCTCGTAGA GCACCGCGTA
TCCCCCTGCG TTGCGCTGGC GCTCCTGAAC GCGGACGGCT TCGCGGCGGC CTGCGAGGCT
GACCTCTACT CGGCGCTCTT AATGCTCGTC TCAAGGGAGC TTACAGGGTC CTCGGGGTGG
ATAGCCAACG CTACGCACTT CGAGGGCAGG GTCGGGGTCT TCTCTCACTG CACGATAGCG
TTCGACATCG CCAGGGCTCC CAGCCTGGTA GACCACTTCG AGAGCGGCTA CCCGGTAGCA
GTGGCGTCCC AGCTTCAGCC CGGTGAGGTA ACGGTGGCCT CGCTTTCACG GGACCTCTCG
GAGGTCTACG TAGCTAGGGG CAGGGTGGTG CGCTCTGGCT TTATCAGCCG AGCGATGTGC
AGGACGCAGG CACACGTGGA GTTCGACTTC GACGCGGAGG TAATCCCGCT GGTGGCACCC
GCGAACCACC ACCTCGTAAT GCCCGGCGAC GTCGTAAGGG AGGTTAAAAG CGTCTCGAAG
CTCCTCGGGC TACGCGTCAA GGAGTACTCA AAGGAGGCTT GA
 
Protein sequence
MNVYAVAFAS RIHGEGYYRQ AYSYVSSVLR VPVYPEVVAE RDTLKKAVEE LRGSLPLAVV 
LTGGTSGLIQ EFASEGGFRA VALLAQGEHN SLASAISARA ALESRGVGVA LFHCGSFSDG
NCAAAASAAV RVARGAGRVL GARVGVVGSK PRYADVFSSR LGWTIEVVPA EELFSAAESA
PREAVESFLS RVSGVPGFEL YRSSLEHVGG VYYALRRLSE EKRLDAVAVD CFPYLVEHRV
SPCVALALLN ADGFAAACEA DLYSALLMLV SRELTGSSGW IANATHFEGR VGVFSHCTIA
FDIARAPSLV DHFESGYPVA VASQLQPGEV TVASLSRDLS EVYVARGRVV RSGFISRAMC
RTQAHVEFDF DAEVIPLVAP ANHHLVMPGD VVREVKSVSK LLGLRVKEYS KEA