Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Htur_5249 |
Symbol | |
ID | 8745797 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haloterrigena turkmenica DSM 5511 |
Kingdom | Archaea |
Replicon accession | NC_013747 |
Strand | - |
Start bp | 147272 |
End bp | 148432 |
Gene Length | 1161 bp |
Protein Length | 386 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 646515606 |
Product | glycosyl transferase group 1 |
Protein accession | YP_003406553 |
Protein GI | 284176276 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0438] Glycosyltransferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACATCG GTTTCTACCA CGATGCCGCC GGAACCCGCC ACGCCGGCGG GATCGCCGTC TACACGCAGC AGATGGCGGC CGCACTCAGT CGATCGAACG ACGTCTATCT CTATACGCAG CGCGGAGAGC CCGCACCGAT CGTCCGCGAG TCGGACGTTA CCGTCATCGA GACCCCGTCC TTCGACAGCG ACTGGCCGGT CTCGCTCGAG GAGGCGCTCC CGCTCGGCTC CCAGGACTGG ACGAAAGCCC GAATGACGCT GTGGGCCGAG CGAAACGGCG TCATCGACCA CATCGACGAC ACCCTGGACG TGCTGTTTAC CGCCCACTAT CTCGACGATC TCCTCCTGTC GAATCTGGTC GACGTGCCCA CGGTCTACAC GTACCACCGG CTCTCGGATA TCGGGGTCGG TGCGAAACTG CAACACGCGT TCTCCGCGAC GGAGCTGATT CTGGCCAACT CGCCGGAAAC CGCGGACCGA GTGGAATCGG CGTTCGACGT CGCGGTCGAG GAAATCGTCT ATCCGGGCGT CGACACGGAC CGGTTCCGGC CCGACGCCAA GCCCGTCATC TCGAGTTCCG ATCCGATCAT CCTCTTCGTC GGCCGACTGG TCGAATCGAA GGGGATCGAC GAACTGCTCG AGGCGGTCGC CCGACTCGAG GGCGACCAGG AGCTTCATGT GGTCGGGCGC GGCGACCAGG AGCGGATCCG CCGGCGGGCC CGCGACCTCG GAATCGCGGA GTCGGTGGTG CTCCACGGCG AAGTTCCCCA CCCCGAACTG CCGGGCTACC ACGCCGGCGC CGACGTGTTC TGCCTGCCGA GTCACGACGA GAGCTTCGCG ATGGCCAACG TCGAGGCGAT GGCCTGCGGG CTGCCGGTCG TGACGGCCGA TCTCGAGGCG ATCCGGACGT ACCTCGCCAA CGGCGACAAC GGACTCCTGG CTCGAGTCGG GGACTCACAA GACCTAGCTG ACAAACTCAG GCTCGTACTC GAATCGTCGA CGTTGCGGGC GCGGCTCGGC GAGCAGGCTC GTGCGGACGC GCAGGCGTTC GGGTGGCGAA CGCAGGCACG TCGACTCGAG GCGTTCTGTT ACGACGCCCT CGACATCGAG GAGTCGGTCG AAGAGGGTCG GCCCGACCAG CCACACCCGA GCACGGTTTA A
|
Protein sequence | MNIGFYHDAA GTRHAGGIAV YTQQMAAALS RSNDVYLYTQ RGEPAPIVRE SDVTVIETPS FDSDWPVSLE EALPLGSQDW TKARMTLWAE RNGVIDHIDD TLDVLFTAHY LDDLLLSNLV DVPTVYTYHR LSDIGVGAKL QHAFSATELI LANSPETADR VESAFDVAVE EIVYPGVDTD RFRPDAKPVI SSSDPIILFV GRLVESKGID ELLEAVARLE GDQELHVVGR GDQERIRRRA RDLGIAESVV LHGEVPHPEL PGYHAGADVF CLPSHDESFA MANVEAMACG LPVVTADLEA IRTYLANGDN GLLARVGDSQ DLADKLRLVL ESSTLRARLG EQARADAQAF GWRTQARRLE AFCYDALDIE ESVEEGRPDQ PHPSTV
|
| |