Gene Hmuk_3220 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHmuk_3220 
Symbol 
ID8412773 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalomicrobium mukohataei DSM 12286 
KingdomArchaea 
Replicon accessionNC_013202 
Strand
Start bp3108099 
End bp3109505 
Gene Length1407 bp 
Protein Length468 aa 
Translation table11 
GC content68% 
IMG OID645021565 
Productsugar transporter 
Protein accessionYP_003179030 
Protein GI257389257 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID[TIGR00879] MFS transporter, sugar porter (SP) family 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value0.681904 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGACAG CACAACTCCG GCGCGTTCTG GACGGCGACG GGGGGCGGTT CATCTACCTC 
TCGGCCGCCC TCGCCGCGCT CAACGGCCTC CTGTTCGGGT TCGACACTGG AATCATCTCC
GGAGCGTTCC TCTACATCCA AGACACGTTC ACGATGTCCC CGCTCGTCGA GGGGATCGTC
GTCAGCGGTG CGATGGCGGG AGCCGCCTTC GGCGCGGCCG TCGGCGGACG ACTCGCGGAC
CGGATCGGCC GCCGCCGGCT CATCTTGCTC GGTGCCGGAG TGTTCTTCGT CGGCTCGCTC
ACCATGGCCG TCGCCCCGTC GGTGCCGGTC CTCGTCGCCG GACGGCTCAT CGACGGCGTC
GCGATCGGGT TCGCCTCGAT CGTCGGCCCG CTGTACATCT CCGAGATCTC GCCGCCGAAG
ATCCGCGGTG CGTTGACCTC GCTGAACCAG CTGATGGTCA CCGTCGGTAT CCTCGTCTCG
TACTTCGTCA ACTACGCCTT CGCCGACGCG GGGGCGTGGC GGTGGATGCT CGGTGCCGGA
ATGGTGCCGG CGGTCGTCCT CGCGATCGGG ATGGTCAAGA TGCCCGAGAG CCCTCGCTGG
CTCCTCGAAA ACGGCCGCGT GGACGAGGCT CGTGCCGTCC TCGCACGCAC CCGTGAGGAA
GGCGTCGAGG AGGAGCTGGC GGAGATCCGC TCGACCGTCG AAAAGCAGTC CGGCACCGGC
CTGCGCGACC TGCTCCAGCC GTGGATGCGC CCCGCGCTGA TCGTCGGCCT CGGGCTGGCC
GTCTTCCAGC AGATCACCGG GATCAACGCG GTCATCTACT ACGCCCCGAC CATTCTGGAA
TCGACCGGCT TCGGCAGCGT CACGTCGATC CTCGCGACGG TCGGGATCGG CGTCATCAAC
GTCGTCATGA CGGTCGTCGC CATCGCACTG ATCGACCGGG TCGGCCGACG CGTCCTCCTG
TTGGTCGGTG TCGGCGGAAT GGTCGTCACG CTGGGCATCC TCGGTGTCGT CTTCTACCTG
CCCGGCTTCG GCGGCGCGCT GGGCTGGATC GCGACGGGCA GCCTGATGCT GTTCGTCGCC
TTCTTCGCGA TCGGGCTCGG CCCGGTCTTC TGGCTACTCA TCTCCGAGAT CTACCCGCTG
GCGACTCGTG GCAGCGCGAT GGGGCTCGTC ACCGTCGCCA ACTGGGGCGC GAACCTCGCG
GTCTCGCTGG CCTTCCCCGT CCTGACCGCC AGCGTCGGGC AGCCCTCGAC GTTCTGGCTG
TTCGGACTCT GTAGCCTGGT CGCGCTCGTG TTCACCTACC GCCTCGTGCC CGAGACGAAG
GGGCGGTCCC TGGAAGCGAT CGAGGCGGAC CTCCGGAGCA ACGTCTCGTC GACGCCCGCG
GCCGCCGTCG GCGACTCGGG CGAATAG
 
Protein sequence
MSTAQLRRVL DGDGGRFIYL SAALAALNGL LFGFDTGIIS GAFLYIQDTF TMSPLVEGIV 
VSGAMAGAAF GAAVGGRLAD RIGRRRLILL GAGVFFVGSL TMAVAPSVPV LVAGRLIDGV
AIGFASIVGP LYISEISPPK IRGALTSLNQ LMVTVGILVS YFVNYAFADA GAWRWMLGAG
MVPAVVLAIG MVKMPESPRW LLENGRVDEA RAVLARTREE GVEEELAEIR STVEKQSGTG
LRDLLQPWMR PALIVGLGLA VFQQITGINA VIYYAPTILE STGFGSVTSI LATVGIGVIN
VVMTVVAIAL IDRVGRRVLL LVGVGGMVVT LGILGVVFYL PGFGGALGWI ATGSLMLFVA
FFAIGLGPVF WLLISEIYPL ATRGSAMGLV TVANWGANLA VSLAFPVLTA SVGQPSTFWL
FGLCSLVALV FTYRLVPETK GRSLEAIEAD LRSNVSSTPA AAVGDSGE