Gene Tpen_0087 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpen_0087 
Symbol 
ID4601398 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermofilum pendens Hrk 5 
KingdomArchaea 
Replicon accessionNC_008698 
Strand
Start bp68664 
End bp69788 
Gene Length1125 bp 
Protein Length374 aa 
Translation table11 
GC content53% 
IMG OID639772841 
Productglycosyl transferase family protein 
Protein accessionYP_919500 
Protein GI119719005 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1215] Glycosyltransferases, probably involved in cell wall biogenesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCTCCT CGGAGAGGGC TCGAGTAAGT GTAATAGTGA CGGTCTACCG TCACGCGAGG 
CGCCTGCGGA GTCTCCTCGA AGCGCTGTGC AACCAGGAAA CCTCTTTCCC CGTGGAAGTG
ATTGTAGTCG CGGATGAGCC GGACGAGGAA GTCTTAGGCA TCCTGAGGGA AAAGGCTTGC
GTTAAGAGCT TGGTCTCCGA GAATAGGAGG GGGAAGGTTA GGGCTCTTAA CGAGGCGATC
TCGCTGAGCC AGGGCGACGT CCTGATTTTT CTCGACAACG ACGTGACCGT ACAGGACAGG
AAGTTCGTTG AGAAGATCTA CAAGTGGCTA CAGGATTTCG ACGTAGCAGA GATCAAGAAG
ATCGCCCGGG TAGACACATT CATCGGTAAA CTCGTCTACT ACGACTACAT GTCATTTGGC
GTTGCGAGCT ACATCTTCGA GAAGAGAGTG AAGAGGTGCG CTGGCTTAAA CGGTGCGGCG
ATGGCTTTCA CGAGGAAAGC TCTCAAGGAG CTTGGCGGTT ACAGAAACGT GGTTCTAGAA
GACATGGATA TAGGCTTTAG GAGCTTCTTC CACGGGTTCA GGTACAAGTA CATCTGGGAT
ACCGAGGTCG TGGTAGACCC TCCCTCATCT CTCAGAGAGT GGCTTAACCA GAGGCTAAGG
TGGTCTGTGG GTGCTTGGAC CTGGATAGAT GACTACGTGT TCCACTTCTC GAAAATAGCG
AGCATCGATT ACGCGCTGGA GAGCTTCGCT GCGCTTTTCG CAATGTTTCC CGGAGGAATC
GTCTACGCGT CTATACTACT CGCAGAGGGG CTACCGCTGT TGAAGCTTGG ACTGCTCGCA
GGATCCACAG TCGGCGGACT CTTCGCGCCG CTCGTACCGG TTCTCGCCAT CTACGAAACT
GTATCGATGT TCCTCCCGCC TCTACCATTA TCCCTTGTAG CCATTGCGGT TGCTTATTCC
GCCTTGGTAG TACCCATCGC GTACAGGATA GGCTACAAAG TCAAGCCGCA ACACTTCGCT
ACGTATATCC TATTCTACTC CACCCTCTGG TTCACGGTTA TGCTGGCAGG GTTCATAAGA
GTGTTCGTAT TCCGAAAGAG GGACGTCACC GGGTGGCAAC TCTAG
 
Protein sequence
MSSSERARVS VIVTVYRHAR RLRSLLEALC NQETSFPVEV IVVADEPDEE VLGILREKAC 
VKSLVSENRR GKVRALNEAI SLSQGDVLIF LDNDVTVQDR KFVEKIYKWL QDFDVAEIKK
IARVDTFIGK LVYYDYMSFG VASYIFEKRV KRCAGLNGAA MAFTRKALKE LGGYRNVVLE
DMDIGFRSFF HGFRYKYIWD TEVVVDPPSS LREWLNQRLR WSVGAWTWID DYVFHFSKIA
SIDYALESFA ALFAMFPGGI VYASILLAEG LPLLKLGLLA GSTVGGLFAP LVPVLAIYET
VSMFLPPLPL SLVAIAVAYS ALVVPIAYRI GYKVKPQHFA TYILFYSTLW FTVMLAGFIR
VFVFRKRDVT GWQL