Gene Hlac_3019 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_3019 
Symbol 
ID7398995 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012028 
Strand
Start bp274219 
End bp275610 
Gene Length1392 bp 
Protein Length463 aa 
Translation table11 
GC content57% 
IMG OID643706828 
Productpermease 
Protein accessionYP_002564450 
Protein GI222475929 
COG category[R] General function prediction only 
COG ID[COG0701] Predicted permeases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATGTTAG CACCACTGGG ACCAGCGGTT GCCGAGGCCC TCCGGATCGG GATCGGCTTC 
CTCTGGACAG CCGCCTGGGC GATCATCATG GGATTAGTGA TCACCAGTCT CGTCCAAGTT
TACGTCTCGA AAGAACGAAT GGCTGCCGTG CTTGGCGATG CCGATATGGC CGGTGTCACG
AAGGCCACGC TGTTCGGCGC TGCCAGCAGC GGCTGTAGCT TCGGTGCGGT CGCCATCGGG
AAGGGGCTCT TCAAAAAGGG TGCCCACGTC GTGAACTTCC TCGCGTTCAT GTTCGCGTCG
ACGAACCTCA TCGTCGAACT CGGATTGATG ATCCTGATCC TGTTGGGCTG GGAGTTCCTC
GTTGCCGAGT TACTCGGTGG TCTCATCCTC ATCGCAGTGA TGGCAGTGAT CGTCCAGTTG
ACGCTTCCAG AAACCCTCTT TGAGGAGGTT CGCACCGAAC TGAATCAGCG AGATCACGAC
CACGGCGTGA CCGAGGATCC GACCTGCGGA ATGGAGGGGC GAGACGAGTA TTCGCTCGTC
ACGGATGGTG GCGAGACGCT GAAATTCTGT TCGGAGGGTT GTATGGAGAC CTATCAGCAG
GAGGTGGCCA GCAGTGGGAG CTGGCGAGAC GAACTCCTGT CGTGGGGTGG CTGGTACAAG
GTCGGCAACC AGTATCGCAA GGAGTGGTCG ATGCTCTACA CGGACGTGAT AGCGGGCTTT
CTGATCTCTG GATTCGTCAT CGTGTTCGTC CCACAGTGGG TCTGGAACAC GCTGTTTTTG
CAGGGCGACG GCATCCTTGT CAGCGCTGAG AACGCAGTCA TGGGCGTGGC TATCGCTGTT
ATTAGCTTCG TCGGTAGCAT GGGAAACGTC CCGTTCGCCG TTGCGCTCTG GGGTGGCGGT
GTGAGCTTCG CCGGTGTGAT CGCCTTCGTC TATGCCGACC TCATTACAGT CCCCGTGCTC
AACGTCTACC GGAAGTACTA CGGCTGGAGT GTGATGCTGT ATATCCTCGG CGTGTTCTTC
GTGACGATGG CATTCACGGG GTTTCTCATG GAACAGCTAT TTAGTGTGTT AGGGATCGTC
CCCGATCTCG CTGGTGGCAT GACTGCGAGC GAGCAGACCT ATTTCGAACT GAACTATACG
TTCTACCTCA ATCTGATCGC GTTCGCACTC TCAGGATTCC TTCTCTACGT GTACCGACGC
GGTCTGGGAG CACCCGGTCA GTACCGGGAT CCTGTCTGTG GAATGCGAAC CGGCGAGGAT
GGTCCAACCG TCATCCACGA TGGTGATACC TACCATTTCT GTTCGAAGGC GTGTCGACGA
GCCTTCGAGG AGACACCCGA GGAGTTCGCT ATGATCGATC CAACAGTCTC GGGCGCTCAC
GATCACCATT GA
 
Protein sequence
MMLAPLGPAV AEALRIGIGF LWTAAWAIIM GLVITSLVQV YVSKERMAAV LGDADMAGVT 
KATLFGAASS GCSFGAVAIG KGLFKKGAHV VNFLAFMFAS TNLIVELGLM ILILLGWEFL
VAELLGGLIL IAVMAVIVQL TLPETLFEEV RTELNQRDHD HGVTEDPTCG MEGRDEYSLV
TDGGETLKFC SEGCMETYQQ EVASSGSWRD ELLSWGGWYK VGNQYRKEWS MLYTDVIAGF
LISGFVIVFV PQWVWNTLFL QGDGILVSAE NAVMGVAIAV ISFVGSMGNV PFAVALWGGG
VSFAGVIAFV YADLITVPVL NVYRKYYGWS VMLYILGVFF VTMAFTGFLM EQLFSVLGIV
PDLAGGMTAS EQTYFELNYT FYLNLIAFAL SGFLLYVYRR GLGAPGQYRD PVCGMRTGED
GPTVIHDGDT YHFCSKACRR AFEETPEEFA MIDPTVSGAH DHH