Gene Hmuk_2046 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHmuk_2046 
Symbol 
ID8411577 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalomicrobium mukohataei DSM 12286 
KingdomArchaea 
Replicon accessionNC_013202 
Strand
Start bp1947876 
End bp1948901 
Gene Length1026 bp 
Protein Length341 aa 
Translation table11 
GC content69% 
IMG OID645020380 
Productglycosyl transferase family 2 
Protein accessionYP_003177866 
Protein GI257388093 
COG category[R] General function prediction only 
COG ID[COG1216] Predicted glycosyltransferases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones36 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCTGCC GGGGGCAGGC CACGAGCGGG CACAGCAGCG ACACAGCGTC CGATCTTTCC 
CTCTCCGTGG TCGTCGTGAC CTACAACGAG GCCGATCGTA TCGAGGCGTG TCTCGATGCG
ATCTTCGAGG CGTGTCGCCG GTTCGAACGT ACCGAAGTCG TCATGGTCGA TTCGCGCTCG
ACCGACGAGA CCATCGCGCT GGCCGCCAAC TATCCGATCC GTGTCTACCG TCTCCCCGCG
TCCACGGACC GCACGCCCGG TGCCGGTCGA TACGTCGGGA CACAGGTCAC GTCGGCTGAC
CCCGTGCTGT TCGTCGACGG CGACATGATC GTCGAGCCCT CGTGGGTCGC GGCCGCCGCG
GCGCGGCTCC GGTCCGAGCC CGCGGTCGCC GGCGTCGACG GCTGTCTCAA CGACGCCTCC
GGACGGACCG AACGCCGCGT CGACACGCTT CGTGGCGTCG TACTGTACGA CCGGGCGATC
CTGGCGTCGG TCGGCGGCTT CGACCCGCAC CTGCAGGCCC TCGAAGACGT GGAGCTGGGC
TTCCGCCTCA GGAACGCGGG ATACCACCTG GTGCGGCTCC CGATCGTCGC CGCAACCCAC
CCCTTCGGCG ACGGGCTACC AGAGCTGCGT CGCCGGTGGC GCAGCGGCTA CTACTTCGGC
CGCGGGCAGG TCCTGCGCAA GTGGTCTCGA TCCCCGCGGA TGGTCGCGCG CGTGTGTCAC
TACTCTCGAC TCTACGCGGT GATGGGCGGC TGGACGGCGC TCGGAATCTT CGCAACCGGT
TCGCTGGGAC CGGTCGGGCT TCTGGCGTGG TGCTGCGTGA CGGCGGCGCT GGTCGGCGTC
TGTCTCCGAC TCAAGGGGCG GACCTGGGTC GAAAACAAGT CGATATCGCT CGCTCCCGTC
TGGGCGGGCG CACTCGTCGG CTTTCTCGGG CCGCACCCGC CGCCGTCTTC CTATCCGGTC
GGACGGGTCG AGCTGATCGC GACGCCGACC GGGCGGAGTT CCGGAGCGGT CGGAGGGATT
CGATGA
 
Protein sequence
MSCRGQATSG HSSDTASDLS LSVVVVTYNE ADRIEACLDA IFEACRRFER TEVVMVDSRS 
TDETIALAAN YPIRVYRLPA STDRTPGAGR YVGTQVTSAD PVLFVDGDMI VEPSWVAAAA
ARLRSEPAVA GVDGCLNDAS GRTERRVDTL RGVVLYDRAI LASVGGFDPH LQALEDVELG
FRLRNAGYHL VRLPIVAATH PFGDGLPELR RRWRSGYYFG RGQVLRKWSR SPRMVARVCH
YSRLYAVMGG WTALGIFATG SLGPVGLLAW CCVTAALVGV CLRLKGRTWV ENKSISLAPV
WAGALVGFLG PHPPPSSYPV GRVELIATPT GRSSGAVGGI R