Gene Hmuk_2329 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHmuk_2329 
Symbol 
ID8411870 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalomicrobium mukohataei DSM 12286 
KingdomArchaea 
Replicon accessionNC_013202 
Strand
Start bp2244963 
End bp2246372 
Gene Length1410 bp 
Protein Length469 aa 
Translation table11 
GC content69% 
IMG OID645020672 
Productglucose sorbosone dehydrogenase 
Protein accessionYP_003178148 
Protein GI257388375 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2133] Glucose/sorbosone dehydrogenases 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value0.485013 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGTGGCA ATCGAGCCAG CAGCAGGGGG ATCAGTCGCC GTCGGTTTCT CCGACGGAGC 
GGTGCGATCG TCGGCGTCGG CCTCGTCGCC GGTTGTACGG GATCGAATCC GAACGACGGC
ACTCGAACGA CGAGCGAGGG CGGTACCGAC ACGCCGGCCG GCCCCGCGGC GCTCGGATAC
GATCTCTCGG TGACCCACGA GCTCACAGAG TGGGACCGCT ACGATCCCGA CTGGGAACCG
CCGAGCGACT CGCCCCGAGA GGAGTACACC GCGGAGGTGC TGGCGACCGG GCTCGAAGTG
CCCTGGGACC TCTCGTTTGC GGGCGAAGAC ACGCTGTTCG TGACAGAGCG GACCGGCCGT
ATCACCGAGT TCGACAGCGG GACGCTGCGG ACGGTCGCCG AGCCGTCCGA CATCATCGAC
GCAGCCGCGA TCGAGGCGGG CTCCGACGAG AGTCGGTGGC GGCTCACCGG CGGAGAAGGC
GGTCTGCTCG GCGTCGCCGC CCACCCGTCG TATCCGGATC CGCCGGTCGT CTACGTCTAC
TACACGGCCG AGACGAGCGA GGGGAAGCGC AATCGGGTGG TCGCCTTCGA CGCCAGCGCA
CCAGCTCCCG ACGAGACTGT CGTCCCGGTC GTCGACGAGA TCCCGGCCGA CACCTACCAC
AACGGCGGTC GGATCGCGTT CGGACCGGCC GACTACCTGT GGATTACGAC CGGCGACGCC
GATCCCGGAC TCGAACACAC GGAACAGACG AGAGACCCCG CCTCCCTGGC CGGGAAAGTC
CTCCGCGTTC GGCCCGACGG GTCGCCACCA CCGGACAACC CCGACAGCAC GTCGGACGCC
GACCCGCGCG TGTTCACCTA CGGCCACCGG AACCCCCAGG GTATCGACTG GCTCCCGGAC
GGGACCCCGA TAATCACCGA GCACGGACCC GGCGCTGGAG ACGAGCTCAA CGTCCTCAGG
CCGGGCGTCG ACTACGGCTG GCCAGTGGTC CGGAACAGCG GCGATCACGA GCGATACCCA
GAGACCGAGT TCCAGTCGCC GGTCGCGGAC GCCTCGTCGT GGGCCCCGGC CGGGGGCGTG
TTCTACACCG GCGAGAGCGT TCCGAGCCTG CGGAACCGGT TCGTGTTCGG TGGCCTGATC
AGTCAGCGAG TCACGGCCGC GACGATCACG CCCGCCGACG GGCCACAGCC CGCAGACGGA
CACGAGCGAC GCCACGATGC CTCGTGGTAC GACGCCGACT ACCGGGCCGG GACCAGCGGG
CTCCTGAGCG AGGAACTCGG CCGTGTCCGC CACGTCGAAC AGGGACCGGA GGGCGATCTC
TACGCGATCA CGTCGAACCG TGACGGCCGC GCGAACGGAC CGTTCCCGCG CGACGACGAC
GATCGACTGG TCCGGATCCG TCCGGCCTGA
 
Protein sequence
MGGNRASSRG ISRRRFLRRS GAIVGVGLVA GCTGSNPNDG TRTTSEGGTD TPAGPAALGY 
DLSVTHELTE WDRYDPDWEP PSDSPREEYT AEVLATGLEV PWDLSFAGED TLFVTERTGR
ITEFDSGTLR TVAEPSDIID AAAIEAGSDE SRWRLTGGEG GLLGVAAHPS YPDPPVVYVY
YTAETSEGKR NRVVAFDASA PAPDETVVPV VDEIPADTYH NGGRIAFGPA DYLWITTGDA
DPGLEHTEQT RDPASLAGKV LRVRPDGSPP PDNPDSTSDA DPRVFTYGHR NPQGIDWLPD
GTPIITEHGP GAGDELNVLR PGVDYGWPVV RNSGDHERYP ETEFQSPVAD ASSWAPAGGV
FYTGESVPSL RNRFVFGGLI SQRVTAATIT PADGPQPADG HERRHDASWY DADYRAGTSG
LLSEELGRVR HVEQGPEGDL YAITSNRDGR ANGPFPRDDD DRLVRIRPA