Gene Hmuk_1997 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHmuk_1997 
Symbol 
ID8411526 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalomicrobium mukohataei DSM 12286 
KingdomArchaea 
Replicon accessionNC_013202 
Strand
Start bp1901801 
End bp1902880 
Gene Length1080 bp 
Protein Length359 aa 
Translation table11 
GC content67% 
IMG OID645020329 
Productglycosyl transferase group 1 
Protein accessionYP_003177817 
Protein GI257388044 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value0.723795 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.0390531 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCGCACGC TGAACTACCT GGAGGTCGCG GGCTGGCTGG ACCGCAGCGG AATCGGTACG 
TCGGTCGAAC ACCAGCGAGC GGCGCTCGCC GACAGAGACG TCGAGGTCGT CACCTCGCCG
TGGGAGGGCG GTCATCCGGT CGACGCCGTC CGGTCGAAGC TCACCGGCGG GCGCGCGTTC
ACCGACGTGG ACCTCGTCCA CTGCAACATG ATCGGCCCGG GAACGGCCGC GACCATCAAG
CACGCCCAGC GGACCGACAC GCCGGTAATC TGCCACGCAC ACGTCACTCG CGAGGACTTC
CGAGACAGTT TCCGCGGGGC CAACGTCGTC GCCCCGGCCC TGGGGAGGTA CCTCAAGTGG
TTCTACTCGC AGGCCGACCT CGTGCTGTGT CCCAGCGAGT ACACGAGAGG GGTGTTGCAG
TCGTATCCGA TCGACGCGCC GATCCGGCCG ATCACGAACG GGATCGACCT CGACCGGCTG
ACGGGGTACG AGGAGTTCCG CGAGGAGTAC CGCGAGCGCT ACGGCATCGA GGGGATGGGG
ATCTTCGCCG TCGGCAACGT CTTCGAGCGC AAGGGGCTCT CTACCTTCTG TAGGGTCGCC
CGGCGGACCG ACTACGACTT CACCTGGTTT GGCACCTACG AGACCGGACC GAGCGCGTCC
GCGACGGTGC GCAAGTGGAC CGGTGATCCG CCGGACAACG TCACGTTCTC GGGGTGGGTC
GACGACATCC GCGGGGCCTA CGGGGCCGGC GACGTGTTCA TGTTCCCCGC GAAGGTCGAG
AACCAGGGCA TCGTCGTGCT CGAAGCGATG GCCTGCGGGA AAGCCTGTGT GATTTCGGAC
ATCCCCGCCT TCTCGGAGTA CTACGAGGAC GGCCACGACT GCCTGATCTG CTCGTCCGAG
CGGGAGTTCG TCGACGCGCT CGAACGGCTG GAAGCGAATC CCGATCTCCG GGAACGGCTG
GGCGAGAACG CGAAAGCGAC CGCTCGCGAA CACGGACTCG ACCGGGTCGG CGAACAGCTG
ACGGACATCT ACGAACGGGT CCTCGACGGG GACGTGCCAG AGGCTGTCGG CGAGAGATAG
 
Protein sequence
MRTLNYLEVA GWLDRSGIGT SVEHQRAALA DRDVEVVTSP WEGGHPVDAV RSKLTGGRAF 
TDVDLVHCNM IGPGTAATIK HAQRTDTPVI CHAHVTREDF RDSFRGANVV APALGRYLKW
FYSQADLVLC PSEYTRGVLQ SYPIDAPIRP ITNGIDLDRL TGYEEFREEY RERYGIEGMG
IFAVGNVFER KGLSTFCRVA RRTDYDFTWF GTYETGPSAS ATVRKWTGDP PDNVTFSGWV
DDIRGAYGAG DVFMFPAKVE NQGIVVLEAM ACGKACVISD IPAFSEYYED GHDCLICSSE
REFVDALERL EANPDLRERL GENAKATARE HGLDRVGEQL TDIYERVLDG DVPEAVGER