Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpen_1746 |
Symbol | |
ID | 4601771 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermofilum pendens Hrk 5 |
Kingdom | Archaea |
Replicon accession | NC_008698 |
Strand | + |
Start bp | 1688577 |
End bp | 1689617 |
Gene Length | 1041 bp |
Protein Length | 346 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 639774519 |
Product | glycosyl transferase, group 1 |
Protein accession | YP_921144 |
Protein GI | 119720649 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0438] Glycosyltransferase |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.418679 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAGACGT TTAGGTTGCT CGTTGTTTCG CCGAGGGTTA GCGGGCTGGG CGGCGTGGCG CAACACGTCG GGAAGCTCGT CGAGTTGCTT CGGCGCGATG GGCACGAGGT CGAGGTTGTC TCCGCCGAGA ATACTCCTAT TCTGCCTGTG AAGGGCTTGA TGAACCCGAG CTTCGCGGCT ACGTCTGCTT TGAAGGTGGC ACTTGGCAGG CTGAAGGGGC GTAGGTACGA CGTTGTACAC GCGCATAACG TGCCGTCGGC GCCCGCGATG CGCGCGGCGA GGGGAGGGAG GGTTTTAACG CTCCACGGGG TTTTCTCGGA GCAGGTCGGC TACCTTCACG GCGGCTTGCT GGGCAGGCTG AGCGGGGTCG CCGAGAGGGT TGCGCTGGGA TGGGCGGACC GCGTGACGTC TGTTTCCAGG GCGACCGCCG AGCACTACTC TAGGATCGGC GTAAACGTCG TCCATGTTCC GAACGCCGTC GACCCGTCGG ACCTGCCGGG TGAAGGGGAG AGGATGTACG AAAGGCAGGT TGTCTACAGC GGTAGGCTGT CCAGGGAGAA GGGCGTCGAC CTCCTGGTGA AGGCTTTCAG GGCTCTCGAC GTTGATGCGC ACCTCGTCGT GGTCGGCGGT GGCCCCCTGG AGGAGGAGCT GAGGAGCCTC GCGGGGGGCG ACCCGAGGAT CCACTTCCTC GGCCCCATGC CGAGGGAGCG TGCGCTCAGA GTGGTGAAGG GGTCCGATGT CTTCGTGTTG CCGTCCCGCT ACGAGGGGCT TAGCACCGCG CTCTTGGAGG CGATGGCTAT GGGAGTCCCC GTCGTCGCCA CGAAGGTTGG AGGGAACACC GAGCTGGTAG AGGATGGGAA GACGGGGCTA CTCGTCGAGC CATCCCCGGA GGAGGTGGCG CGCGCCGTCA GGCTGCTCCT GGAGGACAGC GACCTCGCGG CCCGCCTGGC CTCGGCCGCG AAGAGAGTCG TGGCGGAGAA GTACAGCTGG GACAAGGTGT ACGCGCAGTA CCTCGATGTT TACAGGGAGG TTGCCCGGTA G
|
Protein sequence | METFRLLVVS PRVSGLGGVA QHVGKLVELL RRDGHEVEVV SAENTPILPV KGLMNPSFAA TSALKVALGR LKGRRYDVVH AHNVPSAPAM RAARGGRVLT LHGVFSEQVG YLHGGLLGRL SGVAERVALG WADRVTSVSR ATAEHYSRIG VNVVHVPNAV DPSDLPGEGE RMYERQVVYS GRLSREKGVD LLVKAFRALD VDAHLVVVGG GPLEEELRSL AGGDPRIHFL GPMPRERALR VVKGSDVFVL PSRYEGLSTA LLEAMAMGVP VVATKVGGNT ELVEDGKTGL LVEPSPEEVA RAVRLLLEDS DLAARLASAA KRVVAEKYSW DKVYAQYLDV YREVAR
|
| |