Gene Hmuk_2022 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHmuk_2022 
Symbol 
ID8411553 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalomicrobium mukohataei DSM 12286 
KingdomArchaea 
Replicon accessionNC_013202 
Strand
Start bp1926111 
End bp1927169 
Gene Length1059 bp 
Protein Length352 aa 
Translation table11 
GC content69% 
IMG OID645020356 
ProductUDP-N-acetylglucosamine 2-epimerase 
Protein accessionYP_003177842 
Protein GI257388069 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0381] UDP-N-acetylglucosamine 2-epimerase 
TIGRFAM ID[TIGR00236] UDP-N-acetylglucosamine 2-epimerase 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.0549776 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value0.843248 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGGGTCC TCACCGTCGT CGGCGCGCGT CCACAGTTCG TCAAGTCGTT CCCCGTCTCG 
CGTGCGCTCG CACCCGACCA CGAGGAGGTG CTCGTACACA CCGGCCAGCA CTACGACGAA
GAGCTGTCGG CGGTGTTCTT CGACGACCTG GATCTGGACC AGCCCGACTA CAACCTCGGA
GTGGGCTCGC ACACCCACGC GGTCCAGACC GCCCAGATCA TGGAACGACT GGACCCGATC
GTCGCCAGGG AAGCGCCCGA CGTACTGCTG TTGTACGGCG ATACGAACTC GACGGTTGCC
GGAGCGCTCG TCGGGGCCAA CCGAGACGTG ACGGTCGCTC ACGTCGAGGC CGGACTGCGG
AGCGGGACGC GATCGATGCC CGAAGAGACC AACCGCATCG TCACGGATCA CGTCGCCGAC
GTGCTCTGTA CGCCCTGCCG GGCGGCGACC GAGACGCTCG AACGGGAGGG GCTGGGTGAT
CGGGTCCACG AGACCGGCGA CGTGATGTAC GACGCGCTCC GCTGGGCCGA GCGGATCGCG
CGCGACGAGT CGGCCGTCCT CGATCGGCTC GGGCTCGACG AGTCGTTCGT CCTCGCGACG
GTACACCGGC CCCGCAACAC CGACGATCCC GACCGCCTCG CCGCCATCGT GGAGGCGCTG
GTGACCCACC CCGCGCCGGT CGTCTTTCCG GTTCACCCCC GGACTGCGGC CGCGCTCCGA
GATCAGGGGC TGTTCGAGAC GGTCCAGTCC GAACTGCACT GTATCGACCC CGTCGGCTAC
CTGGATTTCG TCCGCCTGCT CGACGCCGCC GATCGGGTGG TCACCGACTC CGGGGGCGTC
CAGAAGGAGG CGTTCTTCCT CGAAACGCCC TGCGTGACGT TGCGCGAGGA GACCGAGTGG
GACGAGACGG TCGCCGAGGG GTGGAACAGG CTCGTCGGCG CACGGACGAC GGCGATCCGC
GACGCGCTGG CGACGCCGGT CGACGCCACC GACAGGGGCC ACCCCTACGG CGACGGCGAC
GCGGCCGAGC GCGTCGTCGA GGTGATCGCC GATGGGTGA
 
Protein sequence
MRVLTVVGAR PQFVKSFPVS RALAPDHEEV LVHTGQHYDE ELSAVFFDDL DLDQPDYNLG 
VGSHTHAVQT AQIMERLDPI VAREAPDVLL LYGDTNSTVA GALVGANRDV TVAHVEAGLR
SGTRSMPEET NRIVTDHVAD VLCTPCRAAT ETLEREGLGD RVHETGDVMY DALRWAERIA
RDESAVLDRL GLDESFVLAT VHRPRNTDDP DRLAAIVEAL VTHPAPVVFP VHPRTAAALR
DQGLFETVQS ELHCIDPVGY LDFVRLLDAA DRVVTDSGGV QKEAFFLETP CVTLREETEW
DETVAEGWNR LVGARTTAIR DALATPVDAT DRGHPYGDGD AAERVVEVIA DG