Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpen_1727 |
Symbol | |
ID | 4601752 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermofilum pendens Hrk 5 |
Kingdom | Archaea |
Replicon accession | NC_008698 |
Strand | - |
Start bp | 1669535 |
End bp | 1670524 |
Gene Length | 990 bp |
Protein Length | 329 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 639774500 |
Product | glycosyl transferase family protein |
Protein accession | YP_921125 |
Protein GI | 119720630 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0463] Glycosyltransferases involved in cell wall biogenesis |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.350304 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGAAGGCTA GGGCGACGAG CACTCCCGCG AACCCCTGGG CCTCCGAGGT AACGATAGTC ATCCCAACGC TGAAAGAAGA GAGGGGGGTA GGGCTCGTGA TAGACGAGCT TAAAGGCGAG GGATGGACGA ACATACTCGT CGTCGACGGG GGAAGCACGG ACAGGACGAG GGAAGTCGCC GCGGAGAAGG GGGCCAGGGT CGTGCTCCAG GAGGGCCGGG GCAAGGCGGA CGCCGTTAGA ACCGCGCTCA GGTACGTCGA AACGCCCTAC ATAGTCGTGA TGGACGGCGA CTACACGTAC CCAGCGCGGC ACGTCGCAGA GCTTCTGAGG ACGGCTAGGG AGAGGGGGCT AGACGAAGTC ATAGGGGCAA GAACTAGGGG GCGCGAGAAC ATACCACTCC TCAACAGGTT CGGGAACTGG GTAATCACTG AGACGTTCAA CGTTCTCTTC GGGACAAACC TCTCCGACGT GTGTAGTGGG ATGTACCTCG TCAGGACGGA GGTAGCGAGG GAAGTCCAGT TCGAGTCCAA AGGCTTCAGC GTGGAGGTAG AGATAGCCGC TCACGTAGCG TCCACGACGA GGAGAATCGG GGAGATACCG ATAGAGTACA GGCCGAGGGT CGGCGAGCCC AAGCTCAGGA AGAGGCACGG GCTGAGAATA GTCCTCGACG CGTTCAGGCT CGCGCTCCGC TACAACCCCG TATTCCTATT CTTCTCCGCA GCCTCAATCG TACTCATACC ATCACTCGTG CTGGCAGCCT GGGTCGGCTA CCGCTGGCTA GTACAGGGAG TCAAGCACCA AGTATGGGGC ATAATAGCAA TCGTGGGGAC AGGCGTGGGC CTCGTAGCCC TGCTCAACGC GATAATGTCC CTCTACCTCA AAAGGCTGGA GCTCAGAATA ACAGAGCGAC TAACAAGGCT CGAAGCCGAG CTAAAAGCCA CCACAAAACA ACCGAGCAAG AAAGACGCCG ACTCTAGGCG ACTTCCATAA
|
Protein sequence | MKARATSTPA NPWASEVTIV IPTLKEERGV GLVIDELKGE GWTNILVVDG GSTDRTREVA AEKGARVVLQ EGRGKADAVR TALRYVETPY IVVMDGDYTY PARHVAELLR TARERGLDEV IGARTRGREN IPLLNRFGNW VITETFNVLF GTNLSDVCSG MYLVRTEVAR EVQFESKGFS VEVEIAAHVA STTRRIGEIP IEYRPRVGEP KLRKRHGLRI VLDAFRLALR YNPVFLFFSA ASIVLIPSLV LAAWVGYRWL VQGVKHQVWG IIAIVGTGVG LVALLNAIMS LYLKRLELRI TERLTRLEAE LKATTKQPSK KDADSRRLP
|
| |