Gene Hmuk_1939 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHmuk_1939 
Symbol 
ID8411467 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalomicrobium mukohataei DSM 12286 
KingdomArchaea 
Replicon accessionNC_013202 
Strand
Start bp1847941 
End bp1850058 
Gene Length2118 bp 
Protein Length705 aa 
Translation table11 
GC content71% 
IMG OID645020270 
ProductGlycosyl hydrolase family 32 domain protein 
Protein accessionYP_003177759 
Protein GI257387986 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1621] Beta-fructosidases (levanase/invertase) 
TIGRFAM ID[TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.257645 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTGATA CGGCAGTGCG TATCGCGTCG CTCCACGACG GCGGGCGGTC CGTGGAGCAG 
GCCGCGGCCG ACGAGTGGGC CGAGACGGCG GGGTTGGACC TGACGCCCGT CGCCGTGAGC
GACCTCGTCG ACGGCGACCG CTCGCTGGCG GCGTTCGACA CCTGCTGGTG GCACGCCGAC
GAGCCCCCGA AGCTGGGATC GACTACCGAC CTCGGCGGCG TCCTCCGAGC GTTCGTCCGC
TCGGGCGGCG GGCTCGTCCT CTCGCTGGCG GCGCTGACCG CGGTGAACGA TCTCGGCATC
GACCCCGTCA CCCCCGACGC GGTCGGGCCG GACCCAGACG GCGGCCCGAC CGGTCCGCTG
ATCAAGACGG CCTACGACGA CCACCCTCTG TTCGAGCCAC TGGCCACGCG CCGGCCGCTC
ACCAGAGGCG GCACCGACGC GACGCTCGCG CGCTACGATT CGATCCTCCC GGAACGTGGC
GAGGTACTGG CAAGCACTGT CAGCGCCGAC CACGACGTGC CCGGCCTCGT GACCGTCGTG
GGCTGGTCGC TGGGTGACGG CTCCGTCGTC GGCCTCGGTG ACGGCCTCGT CTTCGCGGCC
GAGGGCGGCG ACCACGACCA GCGCGACAGG CTCGCCCGTG CGGTCCTGAC GAACGCCGCC
GACAGCGCGA TCCCCGAACG ACCACGGACC GGCGGCGAGA TCGCCGCCCT GCGCGATCGC
TTCGAGGACG ATCCACACCG CCCGCAGTTT CACCTCTCGA CGCCCGCCAA CTGGCTCAAC
GACCCCAACG GCGTCGTCCA GTGGAACGGT CGCTACCACG TCTTCTACCA GTACAATCCC
GGCGGCCCGT ACCACGACAC GATCCACTGG GGCCACGCGG TCAGCGACGA CCTCCTCCAC
TGGGAGGACG AACCCGTTGC GCTCGCGCCC TCGCCGGACG GTCCCGACCG CGACGGCTGC
TGGTCGGGCT GTGCGGTCGA CGACGACGGC GCGGCGACGC TCGTGTACAC GGGTGGGCGC
GGCCGCGACC AGCTGCCCTG TCTCGCGACG GCGGACGACC CCTCCCTGCG GCGCTGGGAG
AAAGCGACGG ACAATCCGAT CATCGAAGCG CCCCCGACGG AGCCGGACCT GCTCTCGACC
GAGGAGTGGA ACGGGGAGTT CCGAGACCAC TGCGTCTGGC AGGAGGGCGA GACCTGGTAC
CAGCTGATCG GATCCGGACT GGCCGACGGT GGCGGCGTCG TCCTGTTGTA CGAATCCCCG
GACCTCCGTG AGTGGGAGTA CCGTGGCCCG ATCCTTTCGG GGGACCGCGA CACGCCACAG
GAGACCGTCT GGGAGTGCCC GGAGCTACTC GACCTCGGGG AGAAGCAGCT GTTGCACGTC
TCGAACTACG AGGACGTGGT CTACTTCCTC GGCCAGTTCG ACGGCGCGAC CTTCGAGCCG
GAGCGACGCG GGACGCTGGA CCGCGGCGAC TACTACGCCC CGCAGTCGCT CCGGGCAGAC
GACGGCCGCC TGCTCACCTG GGGATGGGTC CCCGAGGCGC GCGACGTGAG CGCCCAGTGG
GACGCCGGCT GGTCGGGGAC GATGTCGCTC CCGCGGGAGC TGTCCCTCGG AGACGACGGC
CGCCTCCGAC AGCGACCGGC CCGGGAACTG ACCGAACTCC GTGGCGACTG CGAGTCCCGC
GAGAGCGTCA CGCTCGACGA CGGCGAGTCG ATCCGGCTGG ACACCGACGG ACGCTCGTTC
GAACTCGCCA CCCGCCTCTC GCTGGTCGAC GCCGACGCCG TCGAGGTGAC GCTGCTCGGG
ACGGCGACGG AGAACGAGCG CACGACGCTG CGATACACGA GAGACGACGT GCTCGAAGTC
GATCGCAGTC GCTCCAGCAG GGACCCGCGC GCGACGAGCG ACAGTCAGCA CGTGCCCGTC
ACACCGTACG ACGAGCCCCT GGACCTGCGC GTGTTCGTCG ACGGGTCGGT CGTCGAACTG
TTCGCCAACG AGCGCCACTG CCTGACGAGC CGCGTCTATC CGGCCGGCGA CGACGCGACC
GGCGTCGACC TCGCGACCGT CGGCGGCCGC GCGCGACTGC CGACAGTCGA AGCCTGGGAG
CTGGCGTCGA TCTGGTAG
 
Protein sequence
MTDTAVRIAS LHDGGRSVEQ AAADEWAETA GLDLTPVAVS DLVDGDRSLA AFDTCWWHAD 
EPPKLGSTTD LGGVLRAFVR SGGGLVLSLA ALTAVNDLGI DPVTPDAVGP DPDGGPTGPL
IKTAYDDHPL FEPLATRRPL TRGGTDATLA RYDSILPERG EVLASTVSAD HDVPGLVTVV
GWSLGDGSVV GLGDGLVFAA EGGDHDQRDR LARAVLTNAA DSAIPERPRT GGEIAALRDR
FEDDPHRPQF HLSTPANWLN DPNGVVQWNG RYHVFYQYNP GGPYHDTIHW GHAVSDDLLH
WEDEPVALAP SPDGPDRDGC WSGCAVDDDG AATLVYTGGR GRDQLPCLAT ADDPSLRRWE
KATDNPIIEA PPTEPDLLST EEWNGEFRDH CVWQEGETWY QLIGSGLADG GGVVLLYESP
DLREWEYRGP ILSGDRDTPQ ETVWECPELL DLGEKQLLHV SNYEDVVYFL GQFDGATFEP
ERRGTLDRGD YYAPQSLRAD DGRLLTWGWV PEARDVSAQW DAGWSGTMSL PRELSLGDDG
RLRQRPAREL TELRGDCESR ESVTLDDGES IRLDTDGRSF ELATRLSLVD ADAVEVTLLG
TATENERTTL RYTRDDVLEV DRSRSSRDPR ATSDSQHVPV TPYDEPLDLR VFVDGSVVEL
FANERHCLTS RVYPAGDDAT GVDLATVGGR ARLPTVEAWE LASIW