Gene Hmuk_1938 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHmuk_1938 
Symbol 
ID8411466 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalomicrobium mukohataei DSM 12286 
KingdomArchaea 
Replicon accessionNC_013202 
Strand
Start bp1846461 
End bp1847771 
Gene Length1311 bp 
Protein Length436 aa 
Translation table11 
GC content69% 
IMG OID645020269 
ProductLevansucrase 
Protein accessionYP_003177758 
Protein GI257387985 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCAAGG ACAGTCCTGG GACCGCCGTT CCGGGCCACG GCGCGCGCTC GGGGTGGTCG 
CGCGAACAGG CGAGTCGCAT CGAGCGAACC GACGACACGA CGGCACCGAT CGTCTACCCG
CCCGCGACCG ACCAGGCACC CGACGTTCAC GTCTGGGACA CCTGGCTGTT GCGCGAGCGC
GACGGCACGG TCGCCACCGT CGACGGCTAC CGTGTCACGT TCTCGCTGAC TGCGCCGGCC
GATCTCTTGC CGGGCAAGCG CCACGACGTG GCGACGATCC GGTACTTCTA CTCGGCCGAC
GGCCGGACGT GGCAGCCCGG TGGCGTCGTC TTCGAGGAGC CGCTGGGCCA GCGCACCTGG
GCCGGCTCCG CGCTGTACGA CGACGGCGAC ATCTACCTCT TCTACACTGC GGCCGGGGAG
CGCGGGGCGG ACGAACTCAC CTACACCCAG CGCATCGTCG CCGCGTCCGG GGGGACACCG
CGGACCGACG GCGAGTTCGC CATCGAGGGG CCCTGGACCC ACCACGAACT GCTCCGCCCG
GATGGCGACC GCTACGAGCG CCAGGACCAG TCTCGCGGCA TGACCTACAC CTTCCGAGAT
CCGTGGTTCT TCGAGGACCC TGCGACCGGC GAGACGCACC TCCTGTTCGA GGCCAACACG
CCGGTGCCGG CGGCGAGCGA CGCCTGCGGC GGCGACCCGG ATCTACAGTC GTTCAACGGC
AGCGTCGGCC TCGCGCACTC GCCGACGGGT GACCCCCTCT CCTGGGAGCT GTGTGACCCG
CTGCTGGATT CGGTCTGTGT CAACCAAGAG CTCGAACGTC CCCACGTCGT CCCCCGGGAC
GGCCGCTACT ACCTCTTCGT CTCCAGCCAC GACCACACCT TCGCGCCGGG GCTCGACGGC
TACGACGCGC TGTATGGCTT CGTCGCCGAC TCCCTGCGTG GCGACTACGT CCCGCTCAAC
GACTCCGGAC TGGTCGTGAC CAACCCCGCG AACGCCCCCT TCCAGGCGTA CTCGTGGATG
GTGTTCCCGC ACCGGGAGGA AGTGCTGGTC CAGAGCTTCT TCAACTACTA CGACTTCGAG
GCCGACTCGA TGGATCGGGT CGCAGACCTG CCCGAGTCCG AGCAGCTGCG ACGCTTCGGC
GGAACGCTCG CGCCGACGCT GCGCCTCCGG GTCGAGGGGA CCCACACGGA GATCCTCGGG
ACGCTCGACC ACTGGCAGAT CCCGCTGCCC GACGAGGTCC TGCCGCCGAC GGACCGAGAG
TACTTCGCGG GCGAGAGCGG CGACGGCGGA TCGTACTATA GTAGCCATTG A
 
Protein sequence
MSKDSPGTAV PGHGARSGWS REQASRIERT DDTTAPIVYP PATDQAPDVH VWDTWLLRER 
DGTVATVDGY RVTFSLTAPA DLLPGKRHDV ATIRYFYSAD GRTWQPGGVV FEEPLGQRTW
AGSALYDDGD IYLFYTAAGE RGADELTYTQ RIVAASGGTP RTDGEFAIEG PWTHHELLRP
DGDRYERQDQ SRGMTYTFRD PWFFEDPATG ETHLLFEANT PVPAASDACG GDPDLQSFNG
SVGLAHSPTG DPLSWELCDP LLDSVCVNQE LERPHVVPRD GRYYLFVSSH DHTFAPGLDG
YDALYGFVAD SLRGDYVPLN DSGLVVTNPA NAPFQAYSWM VFPHREEVLV QSFFNYYDFE
ADSMDRVADL PESEQLRRFG GTLAPTLRLR VEGTHTEILG TLDHWQIPLP DEVLPPTDRE
YFAGESGDGG SYYSSH