Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpen_1730 |
Symbol | |
ID | 4601755 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermofilum pendens Hrk 5 |
Kingdom | Archaea |
Replicon accession | NC_008698 |
Strand | - |
Start bp | 1674348 |
End bp | 1675610 |
Gene Length | 1263 bp |
Protein Length | 420 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 639774503 |
Product | glycosyl transferase family protein |
Protein accession | YP_921128 |
Protein GI | 119720633 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1215] Glycosyltransferases, probably involved in cell wall biogenesis |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.461107 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACGTCGG AGCCGTCAGC GCTACTCGGA GCCGTAGCCC CTCCGGGATT GCTCCCGGAA GCCTTCTACC TGCTTCTAGA CGCCCTCTCC CTCCTCACCC TCGCGGGCAT ACTCGCGTGG TCCGCCTACC ACGCCCCCAT AATCGTCGCC GGGCTACTAG CCCCCCGGGG CAGCGGGGAC GACCCGGGGA ACGGGCTTCC CAGGGTGACA GTCATAGTGC CGTCGAAGGA CGAGGGGAGG CGCGTCGAGC GTTGCCTCAA CGCTATCCTA TCCTCGGACT ACCCCCTGGA GAAGCTCGAA GTGATAGTCG TCGACGCCAG CTCGGACGGG TACGTCGAGG AGATAGTGCG GAGAGCCGGA GAAAGGTACC CGGGCGCGGT CAGGTTGATA AGGGAGGAGG AGCCCCGCGG GAAGCCCGCC GCGCTGAACA GGGCGCTTAG GGAGGCGACG GGGGAGGTCG TAGCGGTGTT CGACGCGGAC AGCGTCCCCG AGAGGGACGC CATAAGGCGC GCCGTGAAGC ACCTCGAGGA GCCGGGGGTC GCCGCGGTCC AGGGGAAGAC GCTGGTACTC AACGAGCGCG AATCGGTGCT CGCGAGGGTA GCCTCCAAGG AGGAGAAGGC CTGGTTCCAC GCGCTCATAC GCGGGAGGGA GAGGCTCGGG CTCTTCGTAC CGCTCACCGG GAGCTGCCAG TTCGTTAAGA GGAGCGCGCT CGAGGAGGTC GGGGGGTGGA GGGAGGACGC CCTCGCGGAG GACCTGGAGC TATCGATGGA CCTCCTCGCC AGGGGCTACA GGGTGAAGTA CGCGAACGAC GTCGTCTCCT GGCAGGAGGC GCCGACCTCG CTGAGGAGCC TCGCCGTGCA GAGGAACAGG TGGTATAGGG GGTACATGGA GGCATTCGCG AGGCACCTGC GCCTCGCCCT CGCGGGCAGG AGGGGGCTGG ACGCCGCCAT CCTCTCGGCG GGGCCCTACC TGATGGCGCT CAGCCTCCTA GCGGTGGCCG CCTGGCTCGC CTCGACGGCT TTGCCCCACG TAAACCACTT CTCGACACCC GCCGCCCTCG TCGCCGCGCT GAACGCCGTG TCCCTCTTCT CGGTCAGCGT CGCCCTCGCG CTCAGCGAGA GGCCCGTAAG CGCGAAGAAC CTAGCCTGGG TCCCGGTTAT CTACGCGTAC TGGTTCACGC TCTCCGCGGT CGCCCTCCAC GCCCTCGCCG AGATAATCCT GAGAAGGCCG CGCGTCTGGA GAAGGACTCC GAAGCCCATA TAA
|
Protein sequence | MTSEPSALLG AVAPPGLLPE AFYLLLDALS LLTLAGILAW SAYHAPIIVA GLLAPRGSGD DPGNGLPRVT VIVPSKDEGR RVERCLNAIL SSDYPLEKLE VIVVDASSDG YVEEIVRRAG ERYPGAVRLI REEEPRGKPA ALNRALREAT GEVVAVFDAD SVPERDAIRR AVKHLEEPGV AAVQGKTLVL NERESVLARV ASKEEKAWFH ALIRGRERLG LFVPLTGSCQ FVKRSALEEV GGWREDALAE DLELSMDLLA RGYRVKYAND VVSWQEAPTS LRSLAVQRNR WYRGYMEAFA RHLRLALAGR RGLDAAILSA GPYLMALSLL AVAAWLASTA LPHVNHFSTP AALVAALNAV SLFSVSVALA LSERPVSAKN LAWVPVIYAY WFTLSAVALH ALAEIILRRP RVWRRTPKPI
|
| |