Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpen_1717 |
Symbol | |
ID | 4601742 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermofilum pendens Hrk 5 |
Kingdom | Archaea |
Replicon accession | NC_008698 |
Strand | - |
Start bp | 1656100 |
End bp | 1657380 |
Gene Length | 1281 bp |
Protein Length | 426 aa |
Translation table | 11 |
GC content | 46% |
IMG OID | 639774490 |
Product | glycosyl transferase, group 1 |
Protein accession | YP_921115 |
Protein GI | 119720620 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0438] Glycosyltransferase |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.21193 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGATTAA GAGGATGGTT GTATACCAGA GCAACGAGGC GAAGAAGGAT GATGGAGGCA GGAAGTAGTG ACATATCGGT TGCTATAGTT TCGTCTGTTG TAGGTAAGTC GCCTAGAGAG GTTACTTACT CGTTTGTTTT CGACGAAGCT TATAGACTGG TGCAAAGAGG AGTGAACGTC CATGTAGTGC GGGCGGCAGT AGAGGAAAGC TCTTCCTCTT ATGGTATCAA TTTCCATGGA ATAAGAAGAC GCGTTGATGC AGAAGCATTT GTAGAACTTC TGAAGAACCT TCCGCTGTAC TCTCCGTTTG CTCTTCTAAG GAATCCCTTA ATGCTTTACT GGGAGAATCT CTACGCTTTA AACATATCCA ACGTTGTTGA GAACCTTCGT ATCGATCTCA TTCATGCTCA CTTTGCCTAT CCAGAGGGTT TCTCAGGGTT GATAGCTAAA CGCAGGACTA AGAAGCCTCT AGTGGTTACT CTGCATGGAT ACGACATTCT TGTGGAACCA TCTGTAAAAT ACGGTATTAG ACTTAGCAAG CGGTACGACG CTTTGGTACG CGAAGTCCTT GTGAACGCAG ACGCGGTAAT CGTAGCTAGT AGAGCTGTTT TTGAGGAGGC TGTGAAGTTG CGTGGACGGA AAAGCGGTAC GTACTTGGTA CATAACGGTG TTGACATCAA AAGGTTTAAC CCGAACCTTA ACGGCTCTCT TATTCGTAAA AGGCTTGGTA TAGAAAACAA GTTTGTGGTG TTTAGTGCTC GTCATCATAG ACCTGTGTAC GGCCTTGAAT ACCTGATAAA AGCAGCGGCC CTGGTGGTGA AACTTAGAAG CGATGTTGTC TTTGTAATTG GTGGTGAAGG TCCTTTAAGA ACATACCACG AAAAGCTTGT TGAAATGTTA AACTTGGAAA ACAATGTAAT TTTTACTGGC AGGATTCCGC GGGACGAGAT GCCTCACTAC TATGCAGCAA GCGACGCTGT GGTGGTTCCT TCGTTGCAGG AGGCATGGAG CCTTGTCGTG ACCGAGGCTA TGGCATCGGG TAAGCCCGTT GTGGGTACGA GAGTTGGGGG GATAGTGGAT CAGATAATTG ACGGCTATAA CGGATTCCTA GTTCCGCCTA GGGATCCAAA GGCTATAGCC GAGAAGATTC TCTGGCTCAT CGACAACCCT GACGAGGCTA AAAGGATGGG TATGAACGGC AGAAGATTAG CTGAGGAGAA GTTCGATATT GAAAAGAGAA TCGAAAAGAT AATTGGTATA TATAAAGAGC TTGTAGGGTA G
|
Protein sequence | MRLRGWLYTR ATRRRRMMEA GSSDISVAIV SSVVGKSPRE VTYSFVFDEA YRLVQRGVNV HVVRAAVEES SSSYGINFHG IRRRVDAEAF VELLKNLPLY SPFALLRNPL MLYWENLYAL NISNVVENLR IDLIHAHFAY PEGFSGLIAK RRTKKPLVVT LHGYDILVEP SVKYGIRLSK RYDALVREVL VNADAVIVAS RAVFEEAVKL RGRKSGTYLV HNGVDIKRFN PNLNGSLIRK RLGIENKFVV FSARHHRPVY GLEYLIKAAA LVVKLRSDVV FVIGGEGPLR TYHEKLVEML NLENNVIFTG RIPRDEMPHY YAASDAVVVP SLQEAWSLVV TEAMASGKPV VGTRVGGIVD QIIDGYNGFL VPPRDPKAIA EKILWLIDNP DEAKRMGMNG RRLAEEKFDI EKRIEKIIGI YKELVG
|
| |