Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hmuk_1938 |
Symbol | |
ID | 8411466 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halomicrobium mukohataei DSM 12286 |
Kingdom | Archaea |
Replicon accession | NC_013202 |
Strand | + |
Start bp | 1846461 |
End bp | 1847771 |
Gene Length | 1311 bp |
Protein Length | 436 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 645020269 |
Product | Levansucrase |
Protein accession | YP_003177758 |
Protein GI | 257387985 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 32 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 33 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCAAGG ACAGTCCTGG GACCGCCGTT CCGGGCCACG GCGCGCGCTC GGGGTGGTCG CGCGAACAGG CGAGTCGCAT CGAGCGAACC GACGACACGA CGGCACCGAT CGTCTACCCG CCCGCGACCG ACCAGGCACC CGACGTTCAC GTCTGGGACA CCTGGCTGTT GCGCGAGCGC GACGGCACGG TCGCCACCGT CGACGGCTAC CGTGTCACGT TCTCGCTGAC TGCGCCGGCC GATCTCTTGC CGGGCAAGCG CCACGACGTG GCGACGATCC GGTACTTCTA CTCGGCCGAC GGCCGGACGT GGCAGCCCGG TGGCGTCGTC TTCGAGGAGC CGCTGGGCCA GCGCACCTGG GCCGGCTCCG CGCTGTACGA CGACGGCGAC ATCTACCTCT TCTACACTGC GGCCGGGGAG CGCGGGGCGG ACGAACTCAC CTACACCCAG CGCATCGTCG CCGCGTCCGG GGGGACACCG CGGACCGACG GCGAGTTCGC CATCGAGGGG CCCTGGACCC ACCACGAACT GCTCCGCCCG GATGGCGACC GCTACGAGCG CCAGGACCAG TCTCGCGGCA TGACCTACAC CTTCCGAGAT CCGTGGTTCT TCGAGGACCC TGCGACCGGC GAGACGCACC TCCTGTTCGA GGCCAACACG CCGGTGCCGG CGGCGAGCGA CGCCTGCGGC GGCGACCCGG ATCTACAGTC GTTCAACGGC AGCGTCGGCC TCGCGCACTC GCCGACGGGT GACCCCCTCT CCTGGGAGCT GTGTGACCCG CTGCTGGATT CGGTCTGTGT CAACCAAGAG CTCGAACGTC CCCACGTCGT CCCCCGGGAC GGCCGCTACT ACCTCTTCGT CTCCAGCCAC GACCACACCT TCGCGCCGGG GCTCGACGGC TACGACGCGC TGTATGGCTT CGTCGCCGAC TCCCTGCGTG GCGACTACGT CCCGCTCAAC GACTCCGGAC TGGTCGTGAC CAACCCCGCG AACGCCCCCT TCCAGGCGTA CTCGTGGATG GTGTTCCCGC ACCGGGAGGA AGTGCTGGTC CAGAGCTTCT TCAACTACTA CGACTTCGAG GCCGACTCGA TGGATCGGGT CGCAGACCTG CCCGAGTCCG AGCAGCTGCG ACGCTTCGGC GGAACGCTCG CGCCGACGCT GCGCCTCCGG GTCGAGGGGA CCCACACGGA GATCCTCGGG ACGCTCGACC ACTGGCAGAT CCCGCTGCCC GACGAGGTCC TGCCGCCGAC GGACCGAGAG TACTTCGCGG GCGAGAGCGG CGACGGCGGA TCGTACTATA GTAGCCATTG A
|
Protein sequence | MSKDSPGTAV PGHGARSGWS REQASRIERT DDTTAPIVYP PATDQAPDVH VWDTWLLRER DGTVATVDGY RVTFSLTAPA DLLPGKRHDV ATIRYFYSAD GRTWQPGGVV FEEPLGQRTW AGSALYDDGD IYLFYTAAGE RGADELTYTQ RIVAASGGTP RTDGEFAIEG PWTHHELLRP DGDRYERQDQ SRGMTYTFRD PWFFEDPATG ETHLLFEANT PVPAASDACG GDPDLQSFNG SVGLAHSPTG DPLSWELCDP LLDSVCVNQE LERPHVVPRD GRYYLFVSSH DHTFAPGLDG YDALYGFVAD SLRGDYVPLN DSGLVVTNPA NAPFQAYSWM VFPHREEVLV QSFFNYYDFE ADSMDRVADL PESEQLRRFG GTLAPTLRLR VEGTHTEILG TLDHWQIPLP DEVLPPTDRE YFAGESGDGG SYYSSH
|
| |