Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpen_1592 |
Symbol | |
ID | 4600556 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermofilum pendens Hrk 5 |
Kingdom | Archaea |
Replicon accession | NC_008698 |
Strand | + |
Start bp | 1543135 |
End bp | 1544379 |
Gene Length | 1245 bp |
Protein Length | 414 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 639774365 |
Product | glycosyl transferase, group 1 |
Protein accession | YP_920990 |
Protein GI | 119720495 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0438] Glycosyltransferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.0955331 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGGTAACTA TCAAAGCCAC CCCGAGGTAC TCCAGGAGAA TCGAGGACTA CGTGCCCGCC GTGGGTAGGG AGGATATCGA GGAGCTCGTA AGGGTCGCCA AGAGGTTAAA GGGGGTAAGC GTAGTACACG TGAACTCGAC GGCTTACGGC GGAGGAGTCG CCGAGATTCT TCACAGCATG GTCCCGGTGA TGAGTTCTCT CGGGATCGAC GCCAGGTGGG AGGTTCTGGA AGCCGAGGAC GAGTTCTTCC AGGTCACGAA GAAGATTCAC AACGGGCTTC AGGGTAACCC TTCGCTCGTA CTCACGGACG AGGACTGGAG AACCTACCTG AAGTGGAACC AGTATAACGC GGAGATCCTC GACCTCGACG CTGACGTGGT ACTCGTGCAC GACCCCCAGC CCATGGCGCT ACCTATGTTC AAGCGCGGTG CCAGGGGTGT CTGGGTCTGG AGGTGCCACA TAGACATCTC TTCCCCGAAT GCATCCTTCT GGGAAAAGCT CTCCCCCTTC CTCGGGTACT ACAGGGGGGT GATAGTACAC AGCGAGGAGT ACGTTAAAAA GGAGTTCGAG GACAGAGTCC TGGTATCCCC GCCTAGCATA GATCCTCTCA GCGACAAGAA CAGGGAGCTC GGCGAAAAAG AGGTGGAGAG CGTGTTTAAA CGCTTCGGAG TAGACCCCGA AAGACCCGTT ATCACCAAGG TTGCCCGCTT CGACCCTTGG AAGGACGTCT TCAGCGCGGT CGACGTCTTT AGAGAAGTGA AGAAGGAGGT CCCCGGCGCC CAGCTACTCC TCGTGTCCTC CATGGCGAGG GACGACCCAG AGGGCGCGGT TTTCTACGAG AAGGTACTCG GCTACGTTAA GGGCGAAGAG GGTGTACACA TACTGACAGA CGCTATCGGT GTCAGGGACT TGGAGGTGAA CGCCTTCCAG AGGGGAACCA CCGTGGGACT TCACACCGCT ATCCGCGAGG GCTTCGGTCT TGCGGTCACC GAGATGCTAT GGAAAAAAGT ACCCGTGGTG GCGAGGCCTG TTGGAGGCGT CAAGAAACAA GTGGTAGACG GGGTGACGGG ATTCACGGCT TGGAGCGTCC AGGAGCTGGC AGAGAGAGTT AAGGCCTTGC TGGCGGACAA CCAGCTTAGA GGGAGGTTGG GCGAGGCGGG GCGCGAGCAC GTGAGGAGGA ACTTCGTCAT TACGCAACAC GTGAAGAGGT ATCTATCCTT CTTCGGCGAG CTACTGGGAA AATAG
|
Protein sequence | MVTIKATPRY SRRIEDYVPA VGREDIEELV RVAKRLKGVS VVHVNSTAYG GGVAEILHSM VPVMSSLGID ARWEVLEAED EFFQVTKKIH NGLQGNPSLV LTDEDWRTYL KWNQYNAEIL DLDADVVLVH DPQPMALPMF KRGARGVWVW RCHIDISSPN ASFWEKLSPF LGYYRGVIVH SEEYVKKEFE DRVLVSPPSI DPLSDKNREL GEKEVESVFK RFGVDPERPV ITKVARFDPW KDVFSAVDVF REVKKEVPGA QLLLVSSMAR DDPEGAVFYE KVLGYVKGEE GVHILTDAIG VRDLEVNAFQ RGTTVGLHTA IREGFGLAVT EMLWKKVPVV ARPVGGVKKQ VVDGVTGFTA WSVQELAERV KALLADNQLR GRLGEAGREH VRRNFVITQH VKRYLSFFGE LLGK
|
| |