Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpen_1744 |
Symbol | |
ID | 4601769 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermofilum pendens Hrk 5 |
Kingdom | Archaea |
Replicon accession | NC_008698 |
Strand | - |
Start bp | 1686040 |
End bp | 1687071 |
Gene Length | 1032 bp |
Protein Length | 343 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 639774517 |
Product | glycosyl transferase, group 1 |
Protein accession | YP_921142 |
Protein GI | 119720647 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0438] Glycosyltransferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGGTGC TACTTACACC CCACGTGGAG CACTACACCG TGGGGCTTGC CGGCGAGCTA TCGAGAAGAA TCAAGGTCGA GCTCCTGGCG TTCGCACGAT ACCCCGTCTC CGCGAGGCAG ATGGTAGTCC CCCGCCTCCC GGTACCGGGG CACCGGGACC TACTCTACAA GTACGCCCTT AAGGCGCTGG CCCGGCGCTA CGACGTAGTA CACGTGAACA CCGCCAGCCA CGGAGCCCTC TTAGGCCCCC GCGACAACCT GCTACTAACG GAGCACGGCT GGCCTGAGCC AGAGCTCGTG GAAAGAGAGC AGCGCCGCTT CTACGAGAAG GAGCGCGAGA GCCTGCTTCA GCTATACGAG GCAGGGGTCC CCGTGGTAAC TATAAGTAAC TTCTCGGCTA GGATGCTCCG CGAGAGGCTG GGAGTAAAGG CTACGGCTGT CGTGTACCAC GGGGTCCTGG ACACCTTCCG AAGGAATAAG CCGCTGGAAC CGCCCATACA CCACGTCGTG CTCTGGAACT CGCGTCTCGT AGGATTCAAG GAGCCGTTCA CCTTCCTAGA GGCGATCAGG CTAGTGAAGG GGAAGGCGAG CTTCGAAGCG GTCATAAGGG GGGATGGGCC TCTAAAACGG GAAGTTGAAG GCTACCTAAG GAGGCATGGG CTCGAAGACA CGGTGCGCTT CGCCGAGAGG ATACCCTTCG AAAAGCTCCC CCAGCTTTAC CGCTCGGCAA CAATATTCAT TCACACATGC TCCAAAGAGC CTTTCGGGCT CGCCGTGCTG GAGGCTATGG CGAGCGGTCT ACCCGTCATC GTACCCGACG CCGGAGGCGC GGCGGAGGTG GCAGGCGACG CCGGGTTGAA GTTTCGGCCC GGCGACAGCG AGGACCTGGC GGAAAAGCTC CTAGTGTTGC TGACGGACCA ACAGCTTTAC GAGACCTTCT CCGCTAGGAG CATCGAGAGG TCCGCATTCT TCACTTGGGA AAAAGCCGCT TCCACTTACC TGGATCTTTA CAGGAAGATC TCCGGTGCGT GA
|
Protein sequence | MKVLLTPHVE HYTVGLAGEL SRRIKVELLA FARYPVSARQ MVVPRLPVPG HRDLLYKYAL KALARRYDVV HVNTASHGAL LGPRDNLLLT EHGWPEPELV EREQRRFYEK ERESLLQLYE AGVPVVTISN FSARMLRERL GVKATAVVYH GVLDTFRRNK PLEPPIHHVV LWNSRLVGFK EPFTFLEAIR LVKGKASFEA VIRGDGPLKR EVEGYLRRHG LEDTVRFAER IPFEKLPQLY RSATIFIHTC SKEPFGLAVL EAMASGLPVI VPDAGGAAEV AGDAGLKFRP GDSEDLAEKL LVLLTDQQLY ETFSARSIER SAFFTWEKAA STYLDLYRKI SGA
|
| |