Gene Htur_5249 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHtur_5249 
Symbol 
ID8745797 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaloterrigena turkmenica DSM 5511 
KingdomArchaea 
Replicon accessionNC_013747 
Strand
Start bp147272 
End bp148432 
Gene Length1161 bp 
Protein Length386 aa 
Translation table11 
GC content66% 
IMG OID646515606 
Productglycosyl transferase group 1 
Protein accessionYP_003406553 
Protein GI284176276 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACATCG GTTTCTACCA CGATGCCGCC GGAACCCGCC ACGCCGGCGG GATCGCCGTC 
TACACGCAGC AGATGGCGGC CGCACTCAGT CGATCGAACG ACGTCTATCT CTATACGCAG
CGCGGAGAGC CCGCACCGAT CGTCCGCGAG TCGGACGTTA CCGTCATCGA GACCCCGTCC
TTCGACAGCG ACTGGCCGGT CTCGCTCGAG GAGGCGCTCC CGCTCGGCTC CCAGGACTGG
ACGAAAGCCC GAATGACGCT GTGGGCCGAG CGAAACGGCG TCATCGACCA CATCGACGAC
ACCCTGGACG TGCTGTTTAC CGCCCACTAT CTCGACGATC TCCTCCTGTC GAATCTGGTC
GACGTGCCCA CGGTCTACAC GTACCACCGG CTCTCGGATA TCGGGGTCGG TGCGAAACTG
CAACACGCGT TCTCCGCGAC GGAGCTGATT CTGGCCAACT CGCCGGAAAC CGCGGACCGA
GTGGAATCGG CGTTCGACGT CGCGGTCGAG GAAATCGTCT ATCCGGGCGT CGACACGGAC
CGGTTCCGGC CCGACGCCAA GCCCGTCATC TCGAGTTCCG ATCCGATCAT CCTCTTCGTC
GGCCGACTGG TCGAATCGAA GGGGATCGAC GAACTGCTCG AGGCGGTCGC CCGACTCGAG
GGCGACCAGG AGCTTCATGT GGTCGGGCGC GGCGACCAGG AGCGGATCCG CCGGCGGGCC
CGCGACCTCG GAATCGCGGA GTCGGTGGTG CTCCACGGCG AAGTTCCCCA CCCCGAACTG
CCGGGCTACC ACGCCGGCGC CGACGTGTTC TGCCTGCCGA GTCACGACGA GAGCTTCGCG
ATGGCCAACG TCGAGGCGAT GGCCTGCGGG CTGCCGGTCG TGACGGCCGA TCTCGAGGCG
ATCCGGACGT ACCTCGCCAA CGGCGACAAC GGACTCCTGG CTCGAGTCGG GGACTCACAA
GACCTAGCTG ACAAACTCAG GCTCGTACTC GAATCGTCGA CGTTGCGGGC GCGGCTCGGC
GAGCAGGCTC GTGCGGACGC GCAGGCGTTC GGGTGGCGAA CGCAGGCACG TCGACTCGAG
GCGTTCTGTT ACGACGCCCT CGACATCGAG GAGTCGGTCG AAGAGGGTCG GCCCGACCAG
CCACACCCGA GCACGGTTTA A
 
Protein sequence
MNIGFYHDAA GTRHAGGIAV YTQQMAAALS RSNDVYLYTQ RGEPAPIVRE SDVTVIETPS 
FDSDWPVSLE EALPLGSQDW TKARMTLWAE RNGVIDHIDD TLDVLFTAHY LDDLLLSNLV
DVPTVYTYHR LSDIGVGAKL QHAFSATELI LANSPETADR VESAFDVAVE EIVYPGVDTD
RFRPDAKPVI SSSDPIILFV GRLVESKGID ELLEAVARLE GDQELHVVGR GDQERIRRRA
RDLGIAESVV LHGEVPHPEL PGYHAGADVF CLPSHDESFA MANVEAMACG LPVVTADLEA
IRTYLANGDN GLLARVGDSQ DLADKLRLVL ESSTLRARLG EQARADAQAF GWRTQARRLE
AFCYDALDIE ESVEEGRPDQ PHPSTV