Gene Hlac_1455 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_1455 
Symbol 
ID7400282 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp1462308 
End bp1463492 
Gene Length1185 bp 
Protein Length394 aa 
Translation table11 
GC content70% 
IMG OID643708516 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_002566113 
Protein GI222479876 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones46 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTCACGGA GCCGGCTGTT TGGATCACTG TGTGCGATCG TCTTTCTCGT CAACTTCGCG 
CGGGTGGTGT TCGCGCCGCT CATCGGCGAG TTCATCAGCG AGTTCGCGAT CGGCGAGGGG
ACGGCGGGAC TGATCGTCAC GCTCGCGTGG CTCGGCTCGG CCGCGCCGCG GCTCCCGGCG
GGCTGGGCGC TCACGCGCTT CTCCCGGCAG TACGTGGTTC TCGTCTCCGG AGCGATGCTG
ACAGTCGGTG CGCTCGGCGT CGCGCTGGCG CCGGGCGTCC CGACGCTGAT GGCCGCCGCG
TTCGCGATCG GCTTGGCGTC CGGCGTCTAC TTCGTCGCCG CCAACCCGTT CATCGCGGAG
CTGTTCCCGA CTCGGGTCGG GCGCGTGATG GGTGTCCACG GAATGGCGAG CCAGCTCGCC
GCAGTCGCCG CCGCGCCGGT CGTCACGGTC GCGCTCTGGT ACGACTGGCG GCTCGCCTTC
TACGGGCTCG CGCTCGCCTC GGCCGCCTCC ACCGTCGTCT TCGTCGCCTT GGCCCGCCGG
ACGGACCTCC CGGACGCAGG CGCGGGCGAC ACCGATTTCC TCGCCGGCGC GCTCTCGGAG
TGGAAGCTGA TTCTGGCGGG CGTCGTGTTG ATGGGACTCA CGAGCTTCGT TTGGCAAGGA
CTCTTCAACT TCTACGAGCT GTACATGGTC GATAAGGGGC TCCCGCCCGC GGCGGCGCGG
AACCTGCTGA CGGTGATCTT CGCCGCGGGC GTCCCGGCGT TTCTCATCTC CGGCGACCTC
GCCGACCGGC TCCCGCACGT CCCATACCTA CTCGGGATCG TGTCTGTGTT CCTCGTCGGC
GTCGTCCTCG TGGTCGTTTC CTCGGGGCTG GCCGCTGTCG TCGCCGCGAG CGTCGTCGTC
GGCTTCGCCA TCCACATGCT GTTTCCCGCC GGCGACACCT ACCTGCTCGC GTCGCTGCCG
GACGGGTCCC GAGCGTCGGC GTACGCCGTC TTCTCCGCCG GGATGATGAC GACGCAGGCG
GCCGGCTCGT GGGTCGTCGG CGAGGCGATA GAGGCCGGCG CCGGCTACGA CGCGGTTTTT
CTCTCCCTCG CCGGGGGACT CGCCCTCGTC GTCGTCGCCT ACGCGGTGCT TGAGTACGCC
GGGCGCGTTC CGGGCGGCGC CGCGGGCACG GAGCACGCGG CCTGA
 
Protein sequence
MSRSRLFGSL CAIVFLVNFA RVVFAPLIGE FISEFAIGEG TAGLIVTLAW LGSAAPRLPA 
GWALTRFSRQ YVVLVSGAML TVGALGVALA PGVPTLMAAA FAIGLASGVY FVAANPFIAE
LFPTRVGRVM GVHGMASQLA AVAAAPVVTV ALWYDWRLAF YGLALASAAS TVVFVALARR
TDLPDAGAGD TDFLAGALSE WKLILAGVVL MGLTSFVWQG LFNFYELYMV DKGLPPAAAR
NLLTVIFAAG VPAFLISGDL ADRLPHVPYL LGIVSVFLVG VVLVVVSSGL AAVVAASVVV
GFAIHMLFPA GDTYLLASLP DGSRASAYAV FSAGMMTTQA AGSWVVGEAI EAGAGYDAVF
LSLAGGLALV VVAYAVLEYA GRVPGGAAGT EHAA