Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hmuk_1939 |
Symbol | |
ID | 8411467 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halomicrobium mukohataei DSM 12286 |
Kingdom | Archaea |
Replicon accession | NC_013202 |
Strand | - |
Start bp | 1847941 |
End bp | 1850058 |
Gene Length | 2118 bp |
Protein Length | 705 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 645020270 |
Product | Glycosyl hydrolase family 32 domain protein |
Protein accession | YP_003177759 |
Protein GI | 257387986 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1621] Beta-fructosidases (levanase/invertase) |
TIGRFAM ID | [TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 0.257645 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 30 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACTGATA CGGCAGTGCG TATCGCGTCG CTCCACGACG GCGGGCGGTC CGTGGAGCAG GCCGCGGCCG ACGAGTGGGC CGAGACGGCG GGGTTGGACC TGACGCCCGT CGCCGTGAGC GACCTCGTCG ACGGCGACCG CTCGCTGGCG GCGTTCGACA CCTGCTGGTG GCACGCCGAC GAGCCCCCGA AGCTGGGATC GACTACCGAC CTCGGCGGCG TCCTCCGAGC GTTCGTCCGC TCGGGCGGCG GGCTCGTCCT CTCGCTGGCG GCGCTGACCG CGGTGAACGA TCTCGGCATC GACCCCGTCA CCCCCGACGC GGTCGGGCCG GACCCAGACG GCGGCCCGAC CGGTCCGCTG ATCAAGACGG CCTACGACGA CCACCCTCTG TTCGAGCCAC TGGCCACGCG CCGGCCGCTC ACCAGAGGCG GCACCGACGC GACGCTCGCG CGCTACGATT CGATCCTCCC GGAACGTGGC GAGGTACTGG CAAGCACTGT CAGCGCCGAC CACGACGTGC CCGGCCTCGT GACCGTCGTG GGCTGGTCGC TGGGTGACGG CTCCGTCGTC GGCCTCGGTG ACGGCCTCGT CTTCGCGGCC GAGGGCGGCG ACCACGACCA GCGCGACAGG CTCGCCCGTG CGGTCCTGAC GAACGCCGCC GACAGCGCGA TCCCCGAACG ACCACGGACC GGCGGCGAGA TCGCCGCCCT GCGCGATCGC TTCGAGGACG ATCCACACCG CCCGCAGTTT CACCTCTCGA CGCCCGCCAA CTGGCTCAAC GACCCCAACG GCGTCGTCCA GTGGAACGGT CGCTACCACG TCTTCTACCA GTACAATCCC GGCGGCCCGT ACCACGACAC GATCCACTGG GGCCACGCGG TCAGCGACGA CCTCCTCCAC TGGGAGGACG AACCCGTTGC GCTCGCGCCC TCGCCGGACG GTCCCGACCG CGACGGCTGC TGGTCGGGCT GTGCGGTCGA CGACGACGGC GCGGCGACGC TCGTGTACAC GGGTGGGCGC GGCCGCGACC AGCTGCCCTG TCTCGCGACG GCGGACGACC CCTCCCTGCG GCGCTGGGAG AAAGCGACGG ACAATCCGAT CATCGAAGCG CCCCCGACGG AGCCGGACCT GCTCTCGACC GAGGAGTGGA ACGGGGAGTT CCGAGACCAC TGCGTCTGGC AGGAGGGCGA GACCTGGTAC CAGCTGATCG GATCCGGACT GGCCGACGGT GGCGGCGTCG TCCTGTTGTA CGAATCCCCG GACCTCCGTG AGTGGGAGTA CCGTGGCCCG ATCCTTTCGG GGGACCGCGA CACGCCACAG GAGACCGTCT GGGAGTGCCC GGAGCTACTC GACCTCGGGG AGAAGCAGCT GTTGCACGTC TCGAACTACG AGGACGTGGT CTACTTCCTC GGCCAGTTCG ACGGCGCGAC CTTCGAGCCG GAGCGACGCG GGACGCTGGA CCGCGGCGAC TACTACGCCC CGCAGTCGCT CCGGGCAGAC GACGGCCGCC TGCTCACCTG GGGATGGGTC CCCGAGGCGC GCGACGTGAG CGCCCAGTGG GACGCCGGCT GGTCGGGGAC GATGTCGCTC CCGCGGGAGC TGTCCCTCGG AGACGACGGC CGCCTCCGAC AGCGACCGGC CCGGGAACTG ACCGAACTCC GTGGCGACTG CGAGTCCCGC GAGAGCGTCA CGCTCGACGA CGGCGAGTCG ATCCGGCTGG ACACCGACGG ACGCTCGTTC GAACTCGCCA CCCGCCTCTC GCTGGTCGAC GCCGACGCCG TCGAGGTGAC GCTGCTCGGG ACGGCGACGG AGAACGAGCG CACGACGCTG CGATACACGA GAGACGACGT GCTCGAAGTC GATCGCAGTC GCTCCAGCAG GGACCCGCGC GCGACGAGCG ACAGTCAGCA CGTGCCCGTC ACACCGTACG ACGAGCCCCT GGACCTGCGC GTGTTCGTCG ACGGGTCGGT CGTCGAACTG TTCGCCAACG AGCGCCACTG CCTGACGAGC CGCGTCTATC CGGCCGGCGA CGACGCGACC GGCGTCGACC TCGCGACCGT CGGCGGCCGC GCGCGACTGC CGACAGTCGA AGCCTGGGAG CTGGCGTCGA TCTGGTAG
|
Protein sequence | MTDTAVRIAS LHDGGRSVEQ AAADEWAETA GLDLTPVAVS DLVDGDRSLA AFDTCWWHAD EPPKLGSTTD LGGVLRAFVR SGGGLVLSLA ALTAVNDLGI DPVTPDAVGP DPDGGPTGPL IKTAYDDHPL FEPLATRRPL TRGGTDATLA RYDSILPERG EVLASTVSAD HDVPGLVTVV GWSLGDGSVV GLGDGLVFAA EGGDHDQRDR LARAVLTNAA DSAIPERPRT GGEIAALRDR FEDDPHRPQF HLSTPANWLN DPNGVVQWNG RYHVFYQYNP GGPYHDTIHW GHAVSDDLLH WEDEPVALAP SPDGPDRDGC WSGCAVDDDG AATLVYTGGR GRDQLPCLAT ADDPSLRRWE KATDNPIIEA PPTEPDLLST EEWNGEFRDH CVWQEGETWY QLIGSGLADG GGVVLLYESP DLREWEYRGP ILSGDRDTPQ ETVWECPELL DLGEKQLLHV SNYEDVVYFL GQFDGATFEP ERRGTLDRGD YYAPQSLRAD DGRLLTWGWV PEARDVSAQW DAGWSGTMSL PRELSLGDDG RLRQRPAREL TELRGDCESR ESVTLDDGES IRLDTDGRSF ELATRLSLVD ADAVEVTLLG TATENERTTL RYTRDDVLEV DRSRSSRDPR ATSDSQHVPV TPYDEPLDLR VFVDGSVVEL FANERHCLTS RVYPAGDDAT GVDLATVGGR ARLPTVEAWE LASIW
|
| |