Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpen_1722 |
Symbol | |
ID | 4601747 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermofilum pendens Hrk 5 |
Kingdom | Archaea |
Replicon accession | NC_008698 |
Strand | - |
Start bp | 1664147 |
End bp | 1665331 |
Gene Length | 1185 bp |
Protein Length | 394 aa |
Translation table | 11 |
GC content | 43% |
IMG OID | 639774495 |
Product | glycosyl transferase, group 1 |
Protein accession | YP_921120 |
Protein GI | 119720625 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0438] Glycosyltransferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.664722 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGAAACAGG TACAGGAATT CTCGCATAAA GGAAAATACT GCAAAAAGCT TAGAATAGCA TTCGTACACA ACTACTACAT CCACTATAGG GTTCCCCTAT TTAAAGCACT AAGCAGAGTT TTTCATGTAA AATTCTTCTT CGATGATGTC TACGAATACG TGAAGAAACC GGAAAAAGAA CTAGACTTCG TGATAAATAA AGGACCTAGG ATAAAAGGGA TAAGGCTTCC AGTCACACTT TTATTCCACT TGTTAAAGCA GAGACCACAT CTAGTAATTG CCGGCGACTC AACGTACCCC AGCACGCTAA TAGCCTTTCT TACCTCCAAG ATGCTAAGAG CGAAGTTTAT CCTGTGGGAA GAGAGGTGGT TCTGGCACAG CAGCCTCCTT TCAAATCTTC TATGGCCTTT CTCACGTACG GTGGCACTAA AAGCCGATGC GCTCATAGTA CCTGGCACGC TATCCAAAGA GTTTTACAAG AACATAGGAG TCGAGAAGGG AAGGATTTTC GTAGCACCGA ATGCAAGCTA CGTCGACATA AACGAGGAAA TTAAAAACAG GGCTAGAGCT CTTAGAAGAA AACTAGGCTT GGACAATAAG ATCGTAGTAC TTTACATTGG GAGAGTAATT CCTTTAAAAG GTGTCCACCT AATTCTCAAA GCTTTAACAA AAATAAATGA ATATAACCTG CACCTGTTGA TTCCCGGCAC GTTCGTCGAT CCTTGGTACA AGAGACTTCT TGACGATATT GTAAAGGCAA GCAAACTTGA AAGCAAGGTA ACAATGCTTA GTCTTAAGTT CGTCAGGGTA GAGGACAGGG GAATCTACTA CGAGCTAGCA GACATAGTTT GTTACCCTTC GTACTACGAG GCCTGGGGTA TGGTGGTTAA CGAGGCGGCA TATGCCGGGA AACCGGTAAT ATCCACGAGA ACGTGCGCCG CGGCATACGA CATACTCTTC GGACATCCCG AACTCGTAAT ACCTCCAGGA AACGTTGAAG AACTAGCCAA AAGCCTAAAA CTTTTAGCAA TGGATGCCAA TAAAAGAAAG GCTATCGGAA TGGAATTGAA ACGCTTAATA AGCGAGAAGT ACTCCTACGA GGAAATGCTG AAAGGCTTCC TCAAAGCCAT AAAATACACC TTGGTAAATC AGCTTACCGA GCAGTCACAA AACAAGCTAG ATTAG
|
Protein sequence | MKQVQEFSHK GKYCKKLRIA FVHNYYIHYR VPLFKALSRV FHVKFFFDDV YEYVKKPEKE LDFVINKGPR IKGIRLPVTL LFHLLKQRPH LVIAGDSTYP STLIAFLTSK MLRAKFILWE ERWFWHSSLL SNLLWPFSRT VALKADALIV PGTLSKEFYK NIGVEKGRIF VAPNASYVDI NEEIKNRARA LRRKLGLDNK IVVLYIGRVI PLKGVHLILK ALTKINEYNL HLLIPGTFVD PWYKRLLDDI VKASKLESKV TMLSLKFVRV EDRGIYYELA DIVCYPSYYE AWGMVVNEAA YAGKPVISTR TCAAAYDILF GHPELVIPPG NVEELAKSLK LLAMDANKRK AIGMELKRLI SEKYSYEEML KGFLKAIKYT LVNQLTEQSQ NKLD
|
| |