Gene Hmuk_2342 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHmuk_2342 
Symbol 
ID8411883 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalomicrobium mukohataei DSM 12286 
KingdomArchaea 
Replicon accessionNC_013202 
Strand
Start bp2255030 
End bp2256238 
Gene Length1209 bp 
Protein Length402 aa 
Translation table11 
GC content70% 
IMG OID645020685 
Productglucose sorbosone dehydrogenase 
Protein accessionYP_003178161 
Protein GI257388388 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2133] Glucose/sorbosone dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.00995217 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones35 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGACG AGACGACAGC AGTCACGCGG CGTCGTTCGA TGCTGGCCAC GATCGCTACC 
GGCCTCGGCG GCCTCGTGGC CGGCTGTGAC GGGCGGTCGG TCGGACAGCG ATCACCGACC
GAGACCGACG ACGACGCCGG GGTCGCCGTC GATCGCGTCG CGGAGTCGTT CACGGCTCCG
TGGAGCGTCA CACCGTTGCC CTCGGGCGAT CGGTTGCTCG TGACCGAACG GCCGGGTCGG
CTCTGGCTGG TGTCGCTGCC GGACGGCGAG AAGAGCGAAC TGACCGGCGT GCCGTCGGTC
CACGCTCGGG GGCAGGGCGG CTTGCTCGAC AGCGAACTCC ACCCCGAGTT CGAGGAGACG
CCGTGGGTGT ACCTGACCTA CGCCAGCGCG AACGACGGCG GAGAGTCGAC GACGGCGCTG
GCTCGCGGAC GGCTGAACGT CGACAGCGGC TCGCTCGAAG CGGTCGAGCG GCTCCACGTC
GCCGAGCCCT TCGTCGAGTC GAACGGCCAC TTCGGCTCCC GGGTCGCTTT CGGCCGGGAC
GGCACAGTCT ACCAGACGGT CGGTGATCGA CAGTTCAAGG ACTTCGGCCC CCAGCACGTC
GCCCAGGACC TGACGACGGA ACTCGGTGTG ACGCTGCGCC TCGAACCCGA CGGCTCGGTC
CCGGACGACA ATCCGTTCGT CGACGAGCCG GCGGCCGCCG ACGCCGTCTA CAGCTACGGC
CACCGCAACG CCCAGGGCAT GGCCGTCCAT CCCGACACCG GCGCGATCTG GCAGAGCGAG
TTCGGGGAGC AGGACGGCGA CGAGATCAAC GTCCTCCGGC CCGGGGGCAA CTACGGGTGG
CCCGTCGCCG ACGAAGGCTG TACGTACGGT TCGGGCGATC CCATCGGCGT GGCACACGCG
GACCGCGACG ACGTGGTCGG GCCGGTCTAC TCCTGGCCCT GCGGGAGCGG TGGCTTCCCA
CCCAGCGGAA TGACGTTCTA CACCGGCGCG GCGTTCCCGG AGTGGGACGG CGATCTGTTC
GTCGGCGGGC TGGCCTCGCA GTCTCTCGCT CGCTTCACCG TCGACGGCAC CGACGTGACC
GAGGCCGAGC AGTTGCTGTC GGGACGCGGG TGGCGAATCA GAGACGTGGC ACAGGGAGTC
GAGGACGGAC ACCTCTACGT GGCCGTCGAC GCCGACGACG CGCCGATCGT GCGCCTCTCG
CCTGCGTGA
 
Protein sequence
MTDETTAVTR RRSMLATIAT GLGGLVAGCD GRSVGQRSPT ETDDDAGVAV DRVAESFTAP 
WSVTPLPSGD RLLVTERPGR LWLVSLPDGE KSELTGVPSV HARGQGGLLD SELHPEFEET
PWVYLTYASA NDGGESTTAL ARGRLNVDSG SLEAVERLHV AEPFVESNGH FGSRVAFGRD
GTVYQTVGDR QFKDFGPQHV AQDLTTELGV TLRLEPDGSV PDDNPFVDEP AAADAVYSYG
HRNAQGMAVH PDTGAIWQSE FGEQDGDEIN VLRPGGNYGW PVADEGCTYG SGDPIGVAHA
DRDDVVGPVY SWPCGSGGFP PSGMTFYTGA AFPEWDGDLF VGGLASQSLA RFTVDGTDVT
EAEQLLSGRG WRIRDVAQGV EDGHLYVAVD ADDAPIVRLS PA