Gene Hlac_3544 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_3544 
Symbol 
ID7402387 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012030 
Strand
Start bp291875 
End bp293095 
Gene Length1221 bp 
Protein Length406 aa 
Translation table11 
GC content59% 
IMG OID643710082 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_002567648 
Protein GI222481412 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCGGTTC CATTTTGGAA TTCATCTACG CTTCCCCCAA CGAACGTTCT GAAATATTAT 
CTCTACAAAT CGACCAAGGC CGTCGAGTTT TACCGCCCAA TTATGTATCT CTTTTTTCTC
GCACAGGGGC TCACTTTTAC GCAGATCGCT ATTCTCGAGG CGATATACAA TCTGACGACG
CTAGTCGGTG AGATCCCGAC AGGCTACATC GGCGACCGTG TCGGTCGGCG CAACAGTCTC
CTCGTCGGCA CGACCCTCAT CTCGTTCACA CTCGTTGGCA TCGGCCTCTC CAGTTCGTTC
CAAGCGCTCG CGGTGCTGTA CGTCTGCTGG TCAGCAGGGT ACAATTTCCG CTCTGGAAGC
GAAGACGCGT GGCTGTACGA CACCCTCACA GACGGCCGCT CCGAGGACGC ATTCGCGAAC
GTCCGTGGGC GGGGAGAGTC CATCGCACTG GCAATCGGCG CCGCGGCGGC TATCACCGGA
GGGTATCTCG GAAGCATCGA CCTCTCGTAT CCGTGGTTCG TCGCTTCCGC GATGACGGCG
GTCGGCGTGC TCGTCCTCCT GACGGTAGAT GAGTCGGAGA CCTACGAGCG AACCGACACC
GATGATTTGA GCCTCCGACG GACGATCTTG ATCGTCCGAC AGACGCTCTC ACAGCGCAAC
ATTCGGGCGT TCGTGCTGTA TTATTACGTC CTCTACGCGG CAGTGACATA CCTCGTGTTC
GTGTTCCTGC AGCCGATCTT CGAGACGGTC GTGCTCGACC TCGGGGTGTC GCAGTCACGC
GTGAAATCCC TCCTCGGATG GTTCTACGCA ACGTACAGTC TCTTCGGTGC GGGACTGAGC
TACTACACTG GTGCGATTCG GGCTCGTCTC GGGCTTCGAA CGTGGTTTCT GTGGCTCCCC
TTCATCGTCG GCGGCGCGCT GATAGGGATG TATTTCGTTC CGGTGCTCGC GCTTCCGACG
TTCCTACTGA TTCGGGGACT TTCGGACGTG ACGCGGTCGT TCGCCGGACA GTACATCAAC
GACCGAATCG GGACGATGGG GCGCGCGACC GTACTCAGCG CGATGGCGAT GGTGAGTGGT
CTCGCCGTCG TTCCGTTTCA ACTCGGGAGC GGGATCCTCT CCGACGTCGC TTCGCCACTG
TTCGCGCTCG CTGTGGCTGG TGGTGTGCTC GTCGTTGGTG CAACAGGGGT GCTGCTTTGG
GAGGCACCGA TCGAGCGGTG A
 
Protein sequence
MAVPFWNSST LPPTNVLKYY LYKSTKAVEF YRPIMYLFFL AQGLTFTQIA ILEAIYNLTT 
LVGEIPTGYI GDRVGRRNSL LVGTTLISFT LVGIGLSSSF QALAVLYVCW SAGYNFRSGS
EDAWLYDTLT DGRSEDAFAN VRGRGESIAL AIGAAAAITG GYLGSIDLSY PWFVASAMTA
VGVLVLLTVD ESETYERTDT DDLSLRRTIL IVRQTLSQRN IRAFVLYYYV LYAAVTYLVF
VFLQPIFETV VLDLGVSQSR VKSLLGWFYA TYSLFGAGLS YYTGAIRARL GLRTWFLWLP
FIVGGALIGM YFVPVLALPT FLLIRGLSDV TRSFAGQYIN DRIGTMGRAT VLSAMAMVSG
LAVVPFQLGS GILSDVASPL FALAVAGGVL VVGATGVLLW EAPIER