Gene Hlac_1768 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_1768 
Symbol 
ID7399640 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp1785672 
End bp1786970 
Gene Length1299 bp 
Protein Length432 aa 
Translation table11 
GC content68% 
IMG OID643708833 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_002566417 
Protein GI222480180 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.0114747 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGCAACG CGGACGCAGG CAACGGGGCA AAGCGATCGC AGTTCTGGGC GCTCTATCTC 
ACGCGCTTCG CTGAGGGGTT CGGCTTCATC ACGCTCATCA CCCTGTTGGG GACGTACATC
AACACGCTCG ACCCGCAGGC GACGACGGTC CTCGGCGTGT CTATCTCGGC CGGACTCATC
ATCGGGATGT ACACCACGGG ATTCACCCTC GCGCAGACGG TTGCCGTGGT GCCGCTGGCG
TGGGCCGGCG ACCGGTTCGA CAAGCGAACC GTCCTGCTGG GCGTACTCGC GATCGGTGCC
GGCGTCTACG CGCTGTTCCC GCTCGTCGAC TCCTCCGCCT CGTTCATCCT CGTCCGCGCC
CTGCAGGGAC TCGTGGTCAC CGGTGCGGGG CTGATGACGC TGTCGCTGGT CGGACAGATC
GCGGATGTCG GGACGCGCGC CGACAAGATT GGCAAGGCCA ACGCCGCCTC CTTCGCAGCG
TCCATCGTCG GGTCGCTGTC GGCCGGAGCG ATATACGACG CGGTCGGCTT CGATCCCATC
TTCATGATCA TCGCGTTGCT GATGGTCGCC GCGTGGGTCA TCACGTGGCT CGTCCTCGAC
GACGACGACA CTCGCGTCGA GGGCTTCCCC TTCTCGGATC TCGCTGTGAA CCGGCGGATC
CTCACGATGA CGAGCTTCCG CGCCCAGTAC GCCTTCGCCG TGACGATGGT GCGGACGTGG
GTCCCGATCT ACGTCAGCAC GGAGATGGCC GCGGGCGGCC TCGGCGTCAC CGGCATCGCC
ATCGGGGTCA CCGTCACCGC CGAGAAGGCG ACCAACATGG TCGGCCAGCT GTTCACCGGT
CGTCTCTCGG ACGACTACGG TCGGTCCCTG TTCGTCTTCG CCGGCGGCGG CGCCTACGGG
CTGATCGCGA TGGCGATCCC GTTCTCGGCC GTCATCGGAA CCGCGCTCGG AGCCGGGGTG
ACGCTCCCGA TTCTGGGCGA ACTGCCGGCC GCGTACCTGC CGCTCGTCGC CTTCTCGGGA
CTGCTCGGTA TCGCCGACTC CTTCCGTGAG CCGGCCAGTA TGGCGCTGTT CGCGGACGAG
GGGACCGACG ACGGCGGGGT CGCCTCCAGC TTCGGCATCC GCGAACTCGT CTGGCGGCCG
GGCTCGGTGG CGGGACCGCT CATCGCCGGC TGGCTGATGA TCGAGGTGAA CATGGCGTCC
GTCTTCTACG TCGGCGGCGC GTTCGCGATC ACCGGCGTCC TCGCGTTCCT CGCGATCCTC
GCGCACGACC ACGGCCGCGC GGCGCTGACG GCGTGGTAG
 
Protein sequence
MSNADAGNGA KRSQFWALYL TRFAEGFGFI TLITLLGTYI NTLDPQATTV LGVSISAGLI 
IGMYTTGFTL AQTVAVVPLA WAGDRFDKRT VLLGVLAIGA GVYALFPLVD SSASFILVRA
LQGLVVTGAG LMTLSLVGQI ADVGTRADKI GKANAASFAA SIVGSLSAGA IYDAVGFDPI
FMIIALLMVA AWVITWLVLD DDDTRVEGFP FSDLAVNRRI LTMTSFRAQY AFAVTMVRTW
VPIYVSTEMA AGGLGVTGIA IGVTVTAEKA TNMVGQLFTG RLSDDYGRSL FVFAGGGAYG
LIAMAIPFSA VIGTALGAGV TLPILGELPA AYLPLVAFSG LLGIADSFRE PASMALFADE
GTDDGGVASS FGIRELVWRP GSVAGPLIAG WLMIEVNMAS VFYVGGAFAI TGVLAFLAIL
AHDHGRAALT AW