Gene Hlac_1984 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_1984 
Symbol 
ID7402003 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp1979721 
End bp1981097 
Gene Length1377 bp 
Protein Length458 aa 
Translation table11 
GC content71% 
IMG OID643709055 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_002566632 
Protein GI222480395 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.179831 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGGACTGC TCGGGATCGA CCGGCCGCCG ACCGTGTTGC TCGCGGTGAT CGCCAGCACC 
TTTTTCGTCG GCTTTGGCGG CGGTGTCGTC TTCCCAATCC TCCCGAACCT CGGCGCGGTG
CTCGGCATCT CGGCGTTCAT GGTCGGCGTG ATCCTCTCCG CGAACCGGTG GGTGCGCCTC
GTCGCGAACG CGCCCGCCGG CGCCTTAGTC GACCGGTACG GAACGCGTAA ACCGTTCGTC
GCCGGGCTGT TCGTCGAGGG CGTCGCCACC CTCGGATACG TCGTTGCGCT CGCGATGCCG
CCCGCCGAGT CGCTCCGCCC GATCGCGGCG TCGCTACCGA CGTTTGCGGC CGGTCCGCTG
GTCGTCGGCG CGGAGCAGTG GTTCACCCCG ATCGCGATCG TCGTCGCGCC CGAGACGTGG
TTCCTGCTCG CGCGCATTCT CTGGGGGTTC GGCTCCGCGG CGGTGTTCGC GACGGCCTAC
ACCATCGCCG CCGACCTCTC CGACAGCGGC TCGCGGGGGA CGAATATGGG CGTCGTCCGC
GGCGGGATCA CGATGGGGTT CCCAGCGGGG CTCGTGCTTG GCGGCGTCGT CTCCGCGATC
GCGGGCAACA TCGCCGCCTT CTCCGTCGCC GCCGCGTTCG CGCTCACCGC CAGCGTCGTC
GCATACCGCT ACGTCCCGGA GACGCACGTC ACGGGCGATC GCTCCGGGGA TTCGATCAAG
CCGTGGGATA TCGACACCGC CGTCCCCGCC GTGACCGTCG GGCTGGTCAA CTTCGGGCTG
ATGTTCGCGT ACATCGGCGC GCTGTTCTCC ACGCTCGTGT TGTTCCTCGG CGCAAACGAC
ATCTCCCTGT TGGGGCTCGC CCCGCAGGGG ACCTCCGGGC TGTTCATGGC CGGTACGGTC
CTCTCGGCCG CGTTCTTCAT GCTCGTCGGC GGGCGGATCT CGGACACTCG TGACTCCCGG
ACGCCGATAC TGCTGACGTT CCTCGTGGTC TCGTTCGTCG GGTTCCTGCT GCTCGCCCGG
GCCGAATCGG TGGTCTCACT CGGACTCGCC TGCATCTTCA TCGGCGCCGG ACAGGGCGGG
ACGAGCGGCC CGATGATGGC CCTGCTCGCG GACCTGACCC CCGACGAGCG GATGGGTCGG
GCCTCGGGGA CGAACAACGT CCTCGGCGAC GTCGGCGGCG GCCTCGGCCC GATGGTGTCG
CTCCCGCTGA TCGAGTCGGT CGGCTTCGCG CCCATCTACG CCGCCTGCGC GATCCTCCCG
CTCGCCGCGG GCGCAGCGCT CCTCGTTGGC GTCCGCCGAG AGACCGGGAC GTTCCTTCCC
GGACGCACCG CGGGCGAGAC GGACCCGGGC GAGGGGTCGC CCCCCACGGA GCCGTAG
 
Protein sequence
MGLLGIDRPP TVLLAVIAST FFVGFGGGVV FPILPNLGAV LGISAFMVGV ILSANRWVRL 
VANAPAGALV DRYGTRKPFV AGLFVEGVAT LGYVVALAMP PAESLRPIAA SLPTFAAGPL
VVGAEQWFTP IAIVVAPETW FLLARILWGF GSAAVFATAY TIAADLSDSG SRGTNMGVVR
GGITMGFPAG LVLGGVVSAI AGNIAAFSVA AAFALTASVV AYRYVPETHV TGDRSGDSIK
PWDIDTAVPA VTVGLVNFGL MFAYIGALFS TLVLFLGAND ISLLGLAPQG TSGLFMAGTV
LSAAFFMLVG GRISDTRDSR TPILLTFLVV SFVGFLLLAR AESVVSLGLA CIFIGAGQGG
TSGPMMALLA DLTPDERMGR ASGTNNVLGD VGGGLGPMVS LPLIESVGFA PIYAACAILP
LAAGAALLVG VRRETGTFLP GRTAGETDPG EGSPPTEP