Gene Tpen_1717 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpen_1717 
Symbol 
ID4601742 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermofilum pendens Hrk 5 
KingdomArchaea 
Replicon accessionNC_008698 
Strand
Start bp1656100 
End bp1657380 
Gene Length1281 bp 
Protein Length426 aa 
Translation table11 
GC content46% 
IMG OID639774490 
Productglycosyl transferase, group 1 
Protein accessionYP_921115 
Protein GI119720620 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.21193 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGATTAA GAGGATGGTT GTATACCAGA GCAACGAGGC GAAGAAGGAT GATGGAGGCA 
GGAAGTAGTG ACATATCGGT TGCTATAGTT TCGTCTGTTG TAGGTAAGTC GCCTAGAGAG
GTTACTTACT CGTTTGTTTT CGACGAAGCT TATAGACTGG TGCAAAGAGG AGTGAACGTC
CATGTAGTGC GGGCGGCAGT AGAGGAAAGC TCTTCCTCTT ATGGTATCAA TTTCCATGGA
ATAAGAAGAC GCGTTGATGC AGAAGCATTT GTAGAACTTC TGAAGAACCT TCCGCTGTAC
TCTCCGTTTG CTCTTCTAAG GAATCCCTTA ATGCTTTACT GGGAGAATCT CTACGCTTTA
AACATATCCA ACGTTGTTGA GAACCTTCGT ATCGATCTCA TTCATGCTCA CTTTGCCTAT
CCAGAGGGTT TCTCAGGGTT GATAGCTAAA CGCAGGACTA AGAAGCCTCT AGTGGTTACT
CTGCATGGAT ACGACATTCT TGTGGAACCA TCTGTAAAAT ACGGTATTAG ACTTAGCAAG
CGGTACGACG CTTTGGTACG CGAAGTCCTT GTGAACGCAG ACGCGGTAAT CGTAGCTAGT
AGAGCTGTTT TTGAGGAGGC TGTGAAGTTG CGTGGACGGA AAAGCGGTAC GTACTTGGTA
CATAACGGTG TTGACATCAA AAGGTTTAAC CCGAACCTTA ACGGCTCTCT TATTCGTAAA
AGGCTTGGTA TAGAAAACAA GTTTGTGGTG TTTAGTGCTC GTCATCATAG ACCTGTGTAC
GGCCTTGAAT ACCTGATAAA AGCAGCGGCC CTGGTGGTGA AACTTAGAAG CGATGTTGTC
TTTGTAATTG GTGGTGAAGG TCCTTTAAGA ACATACCACG AAAAGCTTGT TGAAATGTTA
AACTTGGAAA ACAATGTAAT TTTTACTGGC AGGATTCCGC GGGACGAGAT GCCTCACTAC
TATGCAGCAA GCGACGCTGT GGTGGTTCCT TCGTTGCAGG AGGCATGGAG CCTTGTCGTG
ACCGAGGCTA TGGCATCGGG TAAGCCCGTT GTGGGTACGA GAGTTGGGGG GATAGTGGAT
CAGATAATTG ACGGCTATAA CGGATTCCTA GTTCCGCCTA GGGATCCAAA GGCTATAGCC
GAGAAGATTC TCTGGCTCAT CGACAACCCT GACGAGGCTA AAAGGATGGG TATGAACGGC
AGAAGATTAG CTGAGGAGAA GTTCGATATT GAAAAGAGAA TCGAAAAGAT AATTGGTATA
TATAAAGAGC TTGTAGGGTA G
 
Protein sequence
MRLRGWLYTR ATRRRRMMEA GSSDISVAIV SSVVGKSPRE VTYSFVFDEA YRLVQRGVNV 
HVVRAAVEES SSSYGINFHG IRRRVDAEAF VELLKNLPLY SPFALLRNPL MLYWENLYAL
NISNVVENLR IDLIHAHFAY PEGFSGLIAK RRTKKPLVVT LHGYDILVEP SVKYGIRLSK
RYDALVREVL VNADAVIVAS RAVFEEAVKL RGRKSGTYLV HNGVDIKRFN PNLNGSLIRK
RLGIENKFVV FSARHHRPVY GLEYLIKAAA LVVKLRSDVV FVIGGEGPLR TYHEKLVEML
NLENNVIFTG RIPRDEMPHY YAASDAVVVP SLQEAWSLVV TEAMASGKPV VGTRVGGIVD
QIIDGYNGFL VPPRDPKAIA EKILWLIDNP DEAKRMGMNG RRLAEEKFDI EKRIEKIIGI
YKELVG