Gene Tpen_1153 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpen_1153 
Symbol 
ID4600959 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermofilum pendens Hrk 5 
KingdomArchaea 
Replicon accessionNC_008698 
Strand
Start bp1092105 
End bp1093166 
Gene Length1062 bp 
Protein Length353 aa 
Translation table11 
GC content62% 
IMG OID639773929 
Productglycosidase, PH1107-related 
Protein accessionYP_920554 
Protein GI119720059 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2152] Predicted glycosylase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0153392 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAAGGCGA AAATACTGTC AGCTGGGGAT CCTCTCTCGA AGCTCCTCGC GGAGAAGGTA 
GGGCGCAGAG CAGAGCTTAG GGGGCTGAGG CGCGGCTTGG AGCAGGACGT ATTCGAAAGG
CTGGCGTACA TCACTCCGAT CCGCGTAGAG GTAACGAACT ACGCGAGAAG ACCGGTAGCC
GTCTTCAACC CGGGTGCCGC GCTGGAGGGC TCGAATGTCG CCATTTTCCC AAGGATGGTC
TTCGACTACT ACTGGTACGT TTCATCGGTG GGGAGAATAC GCGTAGGCGT GGACGACCTT
CTCTCGGGGA GTATACCCGG AACCCTGAAG GCAGACCTGG TGATATACCC GAGCGAGGAA
TGGGAGCTGA GAGGGTGCGA GGACCCGAGG GTCTACGGCG CCGATGGAAA CTACCTCGTG
CTCTACACCG GCGTCCTGCC CTTGCAGAAC GGCGTGCTAC CTCTACAGGC CGTTGCCAGG
TACTCGGAGG GCCGTGTCGA GAAGCTCGGC TACCTGGCTT TCGAGTATGG AGGCGAAAGG
TACGTGGCGC CCTGGAAGGA CAGCGCCATT CTCTCGGAGG GTGGCGGCGA GGCTCTGGCG
CTCGTGCGCC CCTCCGTGCC GGTGCCGGGG GGTTTCCTGG AGGCCGGCTG GTTTACGCGC
TTCGACCTGG CAGGGCTTAC AGTGGACCCG GGCGAAGCGG TTCCCTTGCT CGTTGCAGAG
AGCTTTGAAT ACAAGGTTGG GTGGTCTACG AACGCCTTGA AGCTGTCGAG CGGAGAGTAC
CTGGTGGGGT GGCACGGGGT AGGGGTGGAC AACGTCTACA GGAATGGGCT CGCGGTGGTA
AGCGAGGAGG GGGAGCTCTT GGAGCTTTCC GAGTACCTCC TGGTCCCGCG GAAAAGCTTA
GAGGAGTTCT ACGGCGATAG GCCTGGCGTG GTCTTCGGGT GCGGGTTGCT GAGAATCAAG
GAGAAGCTCG TCTGGGTAGG CGGCGTCTCG GACTACGCGG TAGGCGTATT CGCCGTGGAC
ATGGACAAGG CGCTGGAGCA CTTGAAGAGA GTCTCGCGCT GA
 
Protein sequence
MKAKILSAGD PLSKLLAEKV GRRAELRGLR RGLEQDVFER LAYITPIRVE VTNYARRPVA 
VFNPGAALEG SNVAIFPRMV FDYYWYVSSV GRIRVGVDDL LSGSIPGTLK ADLVIYPSEE
WELRGCEDPR VYGADGNYLV LYTGVLPLQN GVLPLQAVAR YSEGRVEKLG YLAFEYGGER
YVAPWKDSAI LSEGGGEALA LVRPSVPVPG GFLEAGWFTR FDLAGLTVDP GEAVPLLVAE
SFEYKVGWST NALKLSSGEY LVGWHGVGVD NVYRNGLAVV SEEGELLELS EYLLVPRKSL
EEFYGDRPGV VFGCGLLRIK EKLVWVGGVS DYAVGVFAVD MDKALEHLKR VSR