Gene Tpen_0158 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpen_0158 
Symbol 
ID4601267 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermofilum pendens Hrk 5 
KingdomArchaea 
Replicon accessionNC_008698 
Strand
Start bp134798 
End bp135958 
Gene Length1161 bp 
Protein Length386 aa 
Translation table11 
GC content55% 
IMG OID639772912 
Productglycosyl transferase family protein 
Protein accessionYP_919571 
Protein GI119719076 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1215] Glycosyltransferases, probably involved in cell wall biogenesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCTAGCC GCGCTATCGC CGAGGTAGTC TTGGAGGGTA AGGCTTCGCA GGGCCGAAAG 
GTGTCGGTCA TCATACCGTC GTACAGGGGG TCTGAGAGGT TAGTCAGGCT TGTGAAGAGG
GTGGCGAGCC TCCCCTACGA GGACAAGGAA GTAGTAGTCG TTGTGGATGA GCCTCTGAGG
GAGGTGGCCG AGGAACTAAG GAGGATCGGC GGGGTGAAGC TCATCCTAAG GCCTAAGAGG
GGCGGCAAGG TTAGCGCGCT GAACGAGGCG CTTAGAGAGT CTAGCGGCGA GGTCGTAATA
TTCCTGGACG ACGACGTATA CGTGGAGGAC GACCGCTTCA TCGAGAAGGT TTTGAAGGCT
ATGGAGGGCT ACGACATAGC CGACATCAAG AAAGTTATAG TCGACACGGG GGGTATTCTC
TCGAAGCTCG TCTACATCGA GTACGCATCC TACAACTTTG CAAGTAAGCT GATGGCGAGG
GCCGCTAGGA GAACAGTCGC CGTCAACGGG GCGGCGTTCG CCGTCAGGAG AAAAGCGCTG
GACGAGATAG GGTACTTCCG CCCATCGATA TCCGAGGACT TCGACATAGC CCTGAGGTCG
TTTAAAGCTA AGCATAACTT TACGTACATC GAGAACACCT ACGTACTCAA CTATCCTCCG
AGCGATTTTA GGAAGTGGTT TAAGCAACGC AAAAGGTGGG CAATAGGCCT CGCCGCCTGG
CTGGAAGAGA ACTTCGCGGA CGCGCTTAAA ACGCTTCTCA GAATGCCGCA CGCCGTGATC
CCCGGGCTCC TGCTGGCTCT ACCGTCGCTT TCGAGCGCTT TGATAACGTT CGTCCTCAGC
AACCACGTCT ACGAGAAGAC GGCTTACCTC TTCATGCTCA CGTTGTCGTC CCTAGTAGCC
CAGGCGCTTC CATTCGCCTC GATCCTGCTT CTGAACATCC AGCTCATATA CCTCGTAAAG
GCGGGAGCAA TCCTCACAGC GTTCTTCGTG TTCCTATTCT GGCAGTTCGC GGCATCACGC
GCCGTGAAGA TGAAGTCATA CCTGTACCTA TACCCTGTCT ACTTCTTCGT TTACCAGCCA
CTCTGGCTTA CGATACTCTT AGCCGGCTTC ATTCGAGTTA TAGTCCTTAG AAGGAAGAGC
GTCGAAGACT GGGTTGTCTA A
 
Protein sequence
MSSRAIAEVV LEGKASQGRK VSVIIPSYRG SERLVRLVKR VASLPYEDKE VVVVVDEPLR 
EVAEELRRIG GVKLILRPKR GGKVSALNEA LRESSGEVVI FLDDDVYVED DRFIEKVLKA
MEGYDIADIK KVIVDTGGIL SKLVYIEYAS YNFASKLMAR AARRTVAVNG AAFAVRRKAL
DEIGYFRPSI SEDFDIALRS FKAKHNFTYI ENTYVLNYPP SDFRKWFKQR KRWAIGLAAW
LEENFADALK TLLRMPHAVI PGLLLALPSL SSALITFVLS NHVYEKTAYL FMLTLSSLVA
QALPFASILL LNIQLIYLVK AGAILTAFFV FLFWQFAASR AVKMKSYLYL YPVYFFVYQP
LWLTILLAGF IRVIVLRRKS VEDWVV