Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpen_0205 |
Symbol | |
ID | 4602216 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermofilum pendens Hrk 5 |
Kingdom | Archaea |
Replicon accession | NC_008698 |
Strand | + |
Start bp | 183990 |
End bp | 185009 |
Gene Length | 1020 bp |
Protein Length | 339 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 639772959 |
Product | glycosyl transferase family protein |
Protein accession | YP_919618 |
Protein GI | 119719123 |
COG category | [R] General function prediction only |
COG ID | [COG1216] Predicted glycosyltransferases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.234288 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGTACCCAA AGGTAAGCAT AGTCATAGTG AACTTTAAGG GAACAGAAGC ACTGAAAAAG TGTCTGAGAA GCGTTTTCGA AACCGAGTAC CCCGACTACG AGGTCATAGT TGTAGACTCC CTGACCGATA ACGTCGAGAA AGCTTTGAGG GACGAGTTCG GATCTAAGGA AAACTTGAAG ATCATTCACT TCGACTCGAA CATCGGCGCC TCGGGATCCC ACAACGTAGG GGCGATGGCG AGCGACCCTA ACAGTAAGTA CCTGGTGTTC CTCGACAACG ACGTTGAAGT AGAAAAGGAC TGGTTGAAGA GGCTGGTCGA GACCGCCGAG GAGAGCCCGA GGATCGGATG CGTTCAGGCG AAAGTGATCT CGAAGAGCAA TGAGGGTAGG ATGGATCACG CAGGGCTAGC GTTAGACTTG ACGGCGACGT GGCTTTCTAC GTACGGGTTT AGAGAAGAGA TATTCCAGCG CCCGATAGAG TTGTTCGTCG CCAGTTCGGC TGCGCTTTTA ACCCCACGCG AGCTCTACTT TAAGGTAGGC GGGTTCGACA GCTCCTACTT CATATACGAC GACGACACGG ATTACACGTG GCGCGTTAGG CTTCAGGGCT ACATCTCCCT CCTGGAGACA AGGGCTCGCG TATACCACGA GGACAAGATA AGCTCGAGGC TACGCTTCGA CAAGCTGTAC TTCGGGTATC GGAACAGGCT TCAGAACATC GTGAAAAACA TGGACGCGAA GAACATGGTC GTTAGCCTGC TGGTGACCCT CTACCTTGGA TACCTGGTAA CGGTACTCCT AGCGCTTGCC GGCAGGATCA GGGAGACAGC CGCATACTTC TTGTCGTCTA CGAGCGTCGT GTTCTCGCTA CCAAGGCTTA TGTGGAAGCG TAAGCTAGTC TCGCTGAAGA GAAGGGTTCC CGACAGCTAC TTCGAGAAGA AAGGCTTCCT TAGGAAGGAC CTTCTCGGGA CGATCTACAT GACGAGGGCA TTGCTTATTC GCTCGGTAAG GAAGAAGTAG
|
Protein sequence | MYPKVSIVIV NFKGTEALKK CLRSVFETEY PDYEVIVVDS LTDNVEKALR DEFGSKENLK IIHFDSNIGA SGSHNVGAMA SDPNSKYLVF LDNDVEVEKD WLKRLVETAE ESPRIGCVQA KVISKSNEGR MDHAGLALDL TATWLSTYGF REEIFQRPIE LFVASSAALL TPRELYFKVG GFDSSYFIYD DDTDYTWRVR LQGYISLLET RARVYHEDKI SSRLRFDKLY FGYRNRLQNI VKNMDAKNMV VSLLVTLYLG YLVTVLLALA GRIRETAAYF LSSTSVVFSL PRLMWKRKLV SLKRRVPDSY FEKKGFLRKD LLGTIYMTRA LLIRSVRKK
|
| |