Gene Hmuk_3161 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHmuk_3161 
Symbol 
ID8412714 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalomicrobium mukohataei DSM 12286 
KingdomArchaea 
Replicon accessionNC_013202 
Strand
Start bp3051531 
End bp3052859 
Gene Length1329 bp 
Protein Length442 aa 
Translation table11 
GC content69% 
IMG OID645021508 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_003178973 
Protein GI257389200 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones40 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value0.586334 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTACTGG ACACGGACGG TCGCGTCCTG ACACTGGCGT TCGCGCGGAT GGCCGACGCG 
CTGGGCAACT CCTTTCTGAT CATCGTCCTG CCGCTGTACA TAGCCAGCGG CCAGATCTCG
CTGTCGGGCA TCGCCGGCAC GGAGATCCTC GGCTTCGTCC TGCGCGAGGA GACGCTGATC
GGGCTCGTGC TCTCCCTGTT CGGTCTGCTG AACAGCTTCG GCCAGCCGTT CACCGGGCGG
CTCTCCGACC GGACCGGCCG GCGGCGCGTG TTCGTCCTGA CGGGACTGGC GATCTTCGCC
GTCGGCAGCG CGACCTACCC GTTCGTCACG AGCTACTGGT CGGTGCTCGG GGCACGTGCG
CTCCAGGGGA TCGGCGCGGC CTTTACCGTG CCGGCCACGG TCGCGCTGGT CAACGACTAC
GCGGCCAGCG ACCGCGAACG GGGCGGCAAC TTCGGCGTGT TCAACACCTT CCGGCTGATC
GGCTTCGGCT TCGGGCCGAT CGTCGCCGGA GTCGTCATCA CGGGCGGGCT GGCCGCCGAG
ACCGTCGTCA GCTACGCGCT CCCGGCCTGG CTCGGCCCCC TGGCCGGCCT CAGGTTCTCC
GGGTTCGTCG CCGCCTTCGC CGTCGCCGTC CTCGGAGCGG TCGTCAGTTT CGTGCTCGTC
GTCGCTCTGA TCGCGGACCC GCCGAAAGTC GTCGGCGGGG CGGGCAAAGA CCTCTCCATC
GCGGTCCGCG ACCGCGACGG AAACGGGCTC GATCCCGTCT TCGTCCTCGG CGTCGGGACC
TTCTTCATGG CCACGACGAT CGCGCTGTTC GCCACCCTGG AGGGGCCGAT CCGCGCGCGA
CTGGACGAGA CGACGTTCCT CTTTTCGGTG CAGTTCGCCG CGGTCGTCAT CGCCAACGTC
GTCTTCCAGA TCCCCATCGG GCGCGCCTCG GACGTGTACG GTCGCCGCCC GTTCATCATC
GCGGGCTTCG TCGTCCTGAT CCCCGCCGTG TTCGCGCAAG GCGTCGTCAC GGGACCGTGG
ACGATGCTCG CGGCCAGACT GCTCCAGGGC GTCGCCGTCG CGCTCGTGTT CGCGCCGTCG
CTCGCGCTGG CTGGCGATCT CGCCGGGGAC CGCGGGTCGG GGACGACGCT GTCGGTGCTG
ACGATGGCGT TCGGACTCGG CGTCGCACTC GGGCCACTCG CTTCCGGCGT GCTGTACAAC
CTCGGCGGTC TCGTCGCGCC GTTTAGCTTC GGTGCCGTCC TGGCCGTGTT CGCGCTCCTC
TTGACCTACT TCGAAGTCGA GGACACGCTG GAGACCGGTC GGGCCAGTGA GCCAGTGCCA
CAGGAGTGA
 
Protein sequence
MVLDTDGRVL TLAFARMADA LGNSFLIIVL PLYIASGQIS LSGIAGTEIL GFVLREETLI 
GLVLSLFGLL NSFGQPFTGR LSDRTGRRRV FVLTGLAIFA VGSATYPFVT SYWSVLGARA
LQGIGAAFTV PATVALVNDY AASDRERGGN FGVFNTFRLI GFGFGPIVAG VVITGGLAAE
TVVSYALPAW LGPLAGLRFS GFVAAFAVAV LGAVVSFVLV VALIADPPKV VGGAGKDLSI
AVRDRDGNGL DPVFVLGVGT FFMATTIALF ATLEGPIRAR LDETTFLFSV QFAAVVIANV
VFQIPIGRAS DVYGRRPFII AGFVVLIPAV FAQGVVTGPW TMLAARLLQG VAVALVFAPS
LALAGDLAGD RGSGTTLSVL TMAFGLGVAL GPLASGVLYN LGGLVAPFSF GAVLAVFALL
LTYFEVEDTL ETGRASEPVP QE