Gene Hlac_0358 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_0358 
Symbol 
ID7399751 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp377639 
End bp378910 
Gene Length1272 bp 
Protein Length423 aa 
Translation table11 
GC content67% 
IMG OID643707423 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_002565032 
Protein GI222478795 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.359815 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATTCTG CCGAGCAACG GATCGTGGCG TTCACCGCCG GATCGCACGG GCTCGTCCAC 
ACCTACGAGC TCTCGATCCC GATTCTGTTG ACCGTGTGGG TCGGGGAGTT CTCGACGACG
GCCGCGGTTC TCGGACTCGT CGTGACTGTC GGCTACGGAC TGTTCGGCGT CGGCGCACTT
CCGGGCGGCA TCCTCGTCGA CCGATTCGGG TCCAAGCCGC TCATTCTCGC CTGTCTCGGC
GGAATGGCGG GATCGTTTCT CCTCGTGAGC CTCGCGCCCA ACCTCCTCAC GCTCGCGCTC
GCGATCGCCG TGTGGGGGGT CACTGCGAGC GTCTATCACC CGGCGGGGCT GTCGCTGCTC
TCGAAGTCGG TCGATCAGCG CGGGACTGCG CTCGGCTACC ACGGGATCGG GGGGAACCTC
GGGATCGCGC TCGGCCCGCT GGCGACCGCG CTCCTCCTCT TAGCCTTCGA CTGGCGGATC
GTCACGGCCG CGCTCACGGT ACCCGCGGCC GTGGTCGCCG CCTACGGTCT CACCGTCGAT
ATCGACGACG CGCTTCCTGA CACTGAGAAC GCGGTTTCCG ATGGCGACAT CGACGGCGAC
GGAGGGAGCG GCGCGAACGG AGCGGCCTCG CTGTCGACGA TCGCCGACGA CACGCGCGTG
CTGCTCGCCG GCGGCTTCCT GATCGTGTTC GTCTTCGTCA CGTTCAGCGG ACTCTACTAC
CGCACGTTTC TCACGTTCCT GCCGGATCTC CTCGGCGACG TACTCGGCGG GCTGATCGAT
ATCCAACTCA TCGACCCTGA AAGCCCGTAC GCCGAGGAGT TCGACGTGGC ACGATATCTG
TACGTCGCCG TTCTCATGGT CGGCGTACTC GGGCAGTACC TCGGCGGGCG GATCGCCGAT
CGCGTGCCGC CGGAGCGCGC GCTGATGGTT CTGATGGGAA TATTGACGGT CCTCGCGCTG
TTGTTCGTCC CCGCCAGCGA GACCATCGTG ACGTTCATCG CGGTCTCACT GCTGCTCGGC
GTCGTGCTGT TCACGGTCCA GCCACTCTCG CAGGCGACGG TCGCGGCCTA CTCTCCGAGC
GAGGCGCGCG GGATCTCGTT CGGCTACACG TACCTCGGGA TCTTCGGTTT CGGCTCGCTG
GGCGCTGCGC TCGCGGGGAC GGTCCTCACC CGAGCGGGAC CACGAGAGCT CTTTTTCGTC
CTCGCGGGTA TCGCGGCCCT CGGCGCCCTC TCCGCGGCCG GAGTCTCTCG GCTCGCGACG
CGGCAGGACT GA
 
Protein sequence
MDSAEQRIVA FTAGSHGLVH TYELSIPILL TVWVGEFSTT AAVLGLVVTV GYGLFGVGAL 
PGGILVDRFG SKPLILACLG GMAGSFLLVS LAPNLLTLAL AIAVWGVTAS VYHPAGLSLL
SKSVDQRGTA LGYHGIGGNL GIALGPLATA LLLLAFDWRI VTAALTVPAA VVAAYGLTVD
IDDALPDTEN AVSDGDIDGD GGSGANGAAS LSTIADDTRV LLAGGFLIVF VFVTFSGLYY
RTFLTFLPDL LGDVLGGLID IQLIDPESPY AEEFDVARYL YVAVLMVGVL GQYLGGRIAD
RVPPERALMV LMGILTVLAL LFVPASETIV TFIAVSLLLG VVLFTVQPLS QATVAAYSPS
EARGISFGYT YLGIFGFGSL GAALAGTVLT RAGPRELFFV LAGIAALGAL SAAGVSRLAT
RQD