Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpen_1743 |
Symbol | |
ID | 4601768 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermofilum pendens Hrk 5 |
Kingdom | Archaea |
Replicon accession | NC_008698 |
Strand | - |
Start bp | 1684774 |
End bp | 1686036 |
Gene Length | 1263 bp |
Protein Length | 420 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 639774516 |
Product | glycosyl transferase, group 1 |
Protein accession | YP_921141 |
Protein GI | 119720646 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0438] Glycosyltransferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCATTCGA GTAAGCCTCA ACTAGCGGTG GTCGCTAACC CCTCGGAGCA CGGAAGCGGC GGCGAGCTTA GGGCTTTAAG GTCTGTAAAG GAGTATGCTA AACATTTCCA CGTCTACCTT TTCACCCCTC TACGCGCGGT AGGCTCTACG TCCTTAAGGA GCCTTGCGCG GCTGAAGAGC TTAGGCGTAG CTCTCTCGGG ATGCATTACG TATCCGAGAA TTGGGAAGCT GGAGCCCGAA TTCAAGCTTT TTTTCCCGCA ATTAACTAGG CTAACGACGA CAAAACTCCC CTTCTTCGAC GGCATCGTAG TGCTACACGA GAATATAGAC TATCTATACG CTGGCTACCT TCTCGGCAAA GCTAGCGGAG CGCCGGCGAT GGTTCTCCTG CAGAACCCCC CTCTTTTCGG GTCAAAGAAA AGGCTCAGTG AGATACTCAA AGCTGTATAC CTGTGGGAGA AGCTCACTTC AGCTTCCACT CTCGAGGAAG CTTATGCTAC CGTGCGCCTT GCAAGGCTAC AAGTGAGAAG ACCAATAGAG GAGGCCAGAA TTCGAATGTT GCTCAACAAG TACTCGCTCG TAGTAGGGGT GAGCAGAGCA ACAGTTCTAG AAATGGGGGA ACCCTGGTTC AGCAAGGCTT TCTACCTAGA TCCGGGTGTA AATCTAAACG AGGAGGAAGT ACATTTATTG AGCAGGATTC GCAAAAGTGT TAAAGAGAAG GAAAACTGCA TACTGTACAA AGGCGGCCTT ACTCCCGTGA AAGGTATCTT AGACGTTCTC CTTGCCTACA GACTCATTAG GAAGGAGAGA GGCGATCTGA AACTCGTCAT CACAGGGAAA CTGGACAGCA AAACTCATAG AAAGCTTCTC GTAGCACTTA AAAAGCTAAA CCTCAAGGAC TACGTTGTCC TTACAGGATT TATATCAAGG GAGCAGCTCT TCGAGCTCAA CGCTAAGGCA AGGCTCCTGC TCCACCCAAG CCATATGGAT TCATTTCCAT ACACTGTCCT GGAGTCTTTA CACGCAGGAA CCCCCGTCGT AGGCTACGAT ATACCCGCGC TCAAAATATA CTACGGCGGG CTTCCCGGAG TCAGGCTTGT ACGGGAAGGC GACGTAGAAG CCTTAGCAAC AGAGGCTATC GACATGCTGG AGAGAAGCCC CAAAGAGGTT AATCCTCCAA GCTTGAAAAG CTGGAACGAG ATAATAGAGG AGGAGCTTAG CTTGGTTAAA AAAAAATTGC TAGGCGAACA TAGACACTCG TAA
|
Protein sequence | MHSSKPQLAV VANPSEHGSG GELRALRSVK EYAKHFHVYL FTPLRAVGST SLRSLARLKS LGVALSGCIT YPRIGKLEPE FKLFFPQLTR LTTTKLPFFD GIVVLHENID YLYAGYLLGK ASGAPAMVLL QNPPLFGSKK RLSEILKAVY LWEKLTSAST LEEAYATVRL ARLQVRRPIE EARIRMLLNK YSLVVGVSRA TVLEMGEPWF SKAFYLDPGV NLNEEEVHLL SRIRKSVKEK ENCILYKGGL TPVKGILDVL LAYRLIRKER GDLKLVITGK LDSKTHRKLL VALKKLNLKD YVVLTGFISR EQLFELNAKA RLLLHPSHMD SFPYTVLESL HAGTPVVGYD IPALKIYYGG LPGVRLVREG DVEALATEAI DMLERSPKEV NPPSLKSWNE IIEEELSLVK KKLLGEHRHS
|
| |