Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hmuk_2021 |
Symbol | |
ID | 8411552 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halomicrobium mukohataei DSM 12286 |
Kingdom | Archaea |
Replicon accession | NC_013202 |
Strand | - |
Start bp | 1925180 |
End bp | 1926118 |
Gene Length | 939 bp |
Protein Length | 312 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 645020355 |
Product | glycosyl transferase group 1 |
Protein accession | YP_003177841 |
Protein GI | 257388068 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0438] Glycosyltransferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 0.0243957 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 30 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGTGACG TGCTCATGCT GGTGACCAAC GAAGACGCAT CGTTCTACAG ACAACAGGTC GACGCGCTCG AAGACCTGGG GCTGAACTGT GATACGGTGG CGGTCCCCAA CAGCGAGGAC GGAACCCGGC GGTTGCGCCA CTACGCGCGC TTCTACGGGG CGACGCTGCG CAAGTCGCTG TCGGAGTACG ACGTCGTCCA CGCCAACTAC GGGCTGACAG CCCCGGCCGC CCTCGCCCAG CGGCGTCGTC CGGTCGTCCT CTCGCTGTGG GGGTCGGACC TGCTGGGGTC GTACGGACGG CTGTCGACGT GGTGTGCCAA TCGGTGTGAC GAAGTGATCG TCATGTCCGA CGCGATGGCG ACCGAACTCG ACCGAGACGC CCACGTCATT CCCCACGGCG TCGACATGGA GCGGTTCAGC CCCACCGCAC AGGCGAGCGC CCGCGACGAA CTCGGGTGGG ATCAAAACGC CCGGATCGTC CTGTTTCCCT ACGACCCCGA GCGCCCGATC AAGGACTTCC CACGGGCCCG CGCGGTCGTC GACGACGCGG CGTCGCGCCT CGATGGACAC GTCGCGCTCC GGACGGTGAG CGGCGTCGAC CACGGAGAGA TACCACGATA CATGAACGCA ACAGACGTGA TGCTCCTGAC CTCTCGACGC GAGGGGTCAC CGAACACGGT GAAAGAAGCA CTGGCGTGTA ACTGCCCCGT CGTCTCGACC GACGTGGGAG ACGTGTCCGA CTACGTCGCG GCCCTCGACT ACTCGGGGGT CTGTGCCGAC GACGACGAGC TCGTCGAGCG ACTCGTCGCG ACGCTCCGGG CCGCACCGGT CTCTGAGGGC CGCGAACAGG TCGCGGCGCT CGGGTTGCCG GAGATGGCCC GCAACATCGC GTCGGTGTAT CGACGAGCCG GCGCACGGGG GGTGACCGTC GATGGCTAG
|
Protein sequence | MGDVLMLVTN EDASFYRQQV DALEDLGLNC DTVAVPNSED GTRRLRHYAR FYGATLRKSL SEYDVVHANY GLTAPAALAQ RRRPVVLSLW GSDLLGSYGR LSTWCANRCD EVIVMSDAMA TELDRDAHVI PHGVDMERFS PTAQASARDE LGWDQNARIV LFPYDPERPI KDFPRARAVV DDAASRLDGH VALRTVSGVD HGEIPRYMNA TDVMLLTSRR EGSPNTVKEA LACNCPVVST DVGDVSDYVA ALDYSGVCAD DDELVERLVA TLRAAPVSEG REQVAALGLP EMARNIASVY RRAGARGVTV DG
|
| |