Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpen_1724 |
Symbol | |
ID | 4601749 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermofilum pendens Hrk 5 |
Kingdom | Archaea |
Replicon accession | NC_008698 |
Strand | - |
Start bp | 1666467 |
End bp | 1667618 |
Gene Length | 1152 bp |
Protein Length | 383 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 639774497 |
Product | glycosyl transferase, group 1 |
Protein accession | YP_921122 |
Protein GI | 119720627 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0438] Glycosyltransferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.4293 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGTGAAA CCAGAATAAT ATTTTTAACG GACTACTTAA CAACTAAAAG TGGCGGTGTA GGTTTTCTCT ACGAAGTAAT GAAAAAAATA GCTGGAAAAT ATCGCATAGA TATAATTGCG GGGCGTGTTG AGGAAAGCCT CCAAAAGGAA GATTATGTAA GAATTTTGAA TCTTAATGTG TACCGGGATG ATCTTCCATC AGCACAACCC GAGAATGCAG TAAAATTTCT GAATTTATCG ACGAAAATGC TCAAAAGAAT AGTTAAGTGC ACCGGTGAGG AAGAGTTAAT CCTACACTTC AACAACCATT TTCCCAACTT AATACCCTGG TTTATTGTAA ATAGTGTGCC AAAAGTATGT TCAATACATC ACCTCGAAGA AACAGCGCAA TTCTCCGGGG TGATACCCAA GCTCGCCAAG GTAGCGGTAC AGGATGTATT CGAGGTCAAC AGCCCCTGCA CCGTCGTACT TACAGTCTCA AAGAGCGTCA GGCAAAAGCT AGCCTCGCTC AGAGCCGTGA GGAAGGGCGG CATCGTTGTG ATCCCTCCGG GCATAGATAC TGGGAAGTAC CTCTCGGTAC GCAGAGATCC AGAGGAAAAC ACCTTCATCA TGGTGGGAAG GCTGGAGAAA AGGAAGCACT ACGACCACGC GATAGTAGCC TTCAAAGCGG TAGCCAAGGC AGAGCCCAAC GCCAAGCTTC TCATAGTGGG CGAGGGGCCT CTACGACCGT ACCTAGCCCA GCTCATAAGA AAGTTTTCAC TCGTTAGGAA CGTTCAGTTG CTGGGATCAG TAAGCGAAGA GGAAAAGCTG AGTCTGCTTT CAAAAGCTCA GGCGCTGATC CACCTCGGGT ACCCCGAGGG ATTCGGCATC GTGCTCATAG AAGCGCTCGC CGCCGGAGTA CCAGTAATAG CCTACGACAT ACCACCGCTC AACGAAGTCG TGGAGCACGG TGCAACAGGC ATACTTGTGC CAAAGGATGA CGTAAGAGTG CTGGCCAGAG CTATAGTCAG GTTCAACAGC TATACCTTCG AGGAGAAAAC ACTGAGAAAG AGAGCCGAGC GCTACGACAT CAACATTATT GCAAGGGAGT TCGCCAGACT CTACGATACG CTAGCCTGCT GTAGGAGAAA TAATGCTGGA AGTATTCAAT GA
|
Protein sequence | MRETRIIFLT DYLTTKSGGV GFLYEVMKKI AGKYRIDIIA GRVEESLQKE DYVRILNLNV YRDDLPSAQP ENAVKFLNLS TKMLKRIVKC TGEEELILHF NNHFPNLIPW FIVNSVPKVC SIHHLEETAQ FSGVIPKLAK VAVQDVFEVN SPCTVVLTVS KSVRQKLASL RAVRKGGIVV IPPGIDTGKY LSVRRDPEEN TFIMVGRLEK RKHYDHAIVA FKAVAKAEPN AKLLIVGEGP LRPYLAQLIR KFSLVRNVQL LGSVSEEEKL SLLSKAQALI HLGYPEGFGI VLIEALAAGV PVIAYDIPPL NEVVEHGATG ILVPKDDVRV LARAIVRFNS YTFEEKTLRK RAERYDINII AREFARLYDT LACCRRNNAG SIQ
|
| |