Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpen_0087 |
Symbol | |
ID | 4601398 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermofilum pendens Hrk 5 |
Kingdom | Archaea |
Replicon accession | NC_008698 |
Strand | + |
Start bp | 68664 |
End bp | 69788 |
Gene Length | 1125 bp |
Protein Length | 374 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 639772841 |
Product | glycosyl transferase family protein |
Protein accession | YP_919500 |
Protein GI | 119719005 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1215] Glycosyltransferases, probably involved in cell wall biogenesis |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCTCCT CGGAGAGGGC TCGAGTAAGT GTAATAGTGA CGGTCTACCG TCACGCGAGG CGCCTGCGGA GTCTCCTCGA AGCGCTGTGC AACCAGGAAA CCTCTTTCCC CGTGGAAGTG ATTGTAGTCG CGGATGAGCC GGACGAGGAA GTCTTAGGCA TCCTGAGGGA AAAGGCTTGC GTTAAGAGCT TGGTCTCCGA GAATAGGAGG GGGAAGGTTA GGGCTCTTAA CGAGGCGATC TCGCTGAGCC AGGGCGACGT CCTGATTTTT CTCGACAACG ACGTGACCGT ACAGGACAGG AAGTTCGTTG AGAAGATCTA CAAGTGGCTA CAGGATTTCG ACGTAGCAGA GATCAAGAAG ATCGCCCGGG TAGACACATT CATCGGTAAA CTCGTCTACT ACGACTACAT GTCATTTGGC GTTGCGAGCT ACATCTTCGA GAAGAGAGTG AAGAGGTGCG CTGGCTTAAA CGGTGCGGCG ATGGCTTTCA CGAGGAAAGC TCTCAAGGAG CTTGGCGGTT ACAGAAACGT GGTTCTAGAA GACATGGATA TAGGCTTTAG GAGCTTCTTC CACGGGTTCA GGTACAAGTA CATCTGGGAT ACCGAGGTCG TGGTAGACCC TCCCTCATCT CTCAGAGAGT GGCTTAACCA GAGGCTAAGG TGGTCTGTGG GTGCTTGGAC CTGGATAGAT GACTACGTGT TCCACTTCTC GAAAATAGCG AGCATCGATT ACGCGCTGGA GAGCTTCGCT GCGCTTTTCG CAATGTTTCC CGGAGGAATC GTCTACGCGT CTATACTACT CGCAGAGGGG CTACCGCTGT TGAAGCTTGG ACTGCTCGCA GGATCCACAG TCGGCGGACT CTTCGCGCCG CTCGTACCGG TTCTCGCCAT CTACGAAACT GTATCGATGT TCCTCCCGCC TCTACCATTA TCCCTTGTAG CCATTGCGGT TGCTTATTCC GCCTTGGTAG TACCCATCGC GTACAGGATA GGCTACAAAG TCAAGCCGCA ACACTTCGCT ACGTATATCC TATTCTACTC CACCCTCTGG TTCACGGTTA TGCTGGCAGG GTTCATAAGA GTGTTCGTAT TCCGAAAGAG GGACGTCACC GGGTGGCAAC TCTAG
|
Protein sequence | MSSSERARVS VIVTVYRHAR RLRSLLEALC NQETSFPVEV IVVADEPDEE VLGILREKAC VKSLVSENRR GKVRALNEAI SLSQGDVLIF LDNDVTVQDR KFVEKIYKWL QDFDVAEIKK IARVDTFIGK LVYYDYMSFG VASYIFEKRV KRCAGLNGAA MAFTRKALKE LGGYRNVVLE DMDIGFRSFF HGFRYKYIWD TEVVVDPPSS LREWLNQRLR WSVGAWTWID DYVFHFSKIA SIDYALESFA ALFAMFPGGI VYASILLAEG LPLLKLGLLA GSTVGGLFAP LVPVLAIYET VSMFLPPLPL SLVAIAVAYS ALVVPIAYRI GYKVKPQHFA TYILFYSTLW FTVMLAGFIR VFVFRKRDVT GWQL
|
| |