Gene Hlac_0216 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_0216 
Symbol 
ID7402145 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp232217 
End bp233482 
Gene Length1266 bp 
Protein Length421 aa 
Translation table11 
GC content68% 
IMG OID643707279 
Productsodium:dicarboxylate symporter 
Protein accessionYP_002564891 
Protein GI222478654 
COG category[C] Energy production and conversion 
COG ID[COG1301] Na+/H+-dicarboxylate symporters 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.013646 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCAGTT TCATCGGATC GCTGTGGCGT CGGTATCGAT CGGTACCGCT CATCTACCGC 
ATCGCGGTCG CGTTCCTCCT CGGATCGCTC GCTGGCGCGG TCTTCGGTGA GCGGATGACC
GTCGTCAAAC CGTTCGGTGA CCTCTTTTTG CGCCTGCTCA ACATGCTCGC CGTCCCGATC
ATCGTCTTCA CGCTGCTCAC CGGGATCAGA CAGCTCTCGC CGGCGAAGCT CGGGCGCATC
GGCGGGGCGA CGGTCGGGCT CTACGCCGTG ACGACGACGT TCGCCGGGCT GATCGGGCTC
GCGGTCGCGA ACCTGCTGCG CCCGGGTCGC GGCGTGGAGT TCACGGGCGG TGAGGCGCAG
TCGCAGGCGC CGCCGTCGCT GACCGAGGTC GTCCTTGGAA TCGTCCCGAA CAACCCGGTG
GCCGCGATGG CCGAGGGGAA CCTGCTCGCG ACGGTCTTTT TCGTGATCGT GTTCGGTATC
GCGCTCACCT ACGTCCGGGC GCAGAAACCG GAACTCGCGG GCCGCGTCGA CGGCGTGTTC
GGGGCGTTCA AGATCGGAGC CGAGGCGATG TTCGTGGTCG TCCGCGGCGT GTTGGAGTAC
GGGGTCATCG GCGTGTTCGC CCTCATGGCG GTCGGGATCG GCACCGAGGG CGTCGGCGTG
TTCTCCTCGC TCGGCGCGCT CGTGCTGGCG GTCGGCGTCG CGGTCGTCGT CCACATCGCG
TTCACGTACC TGTTCGTACT CATGCGCGTG GTCGCCGGCG TCTCCCCGGT CGCGTTCCTT
AGGGGCGCAA AAGACGCGAT GCTCACCGCC TTCGCGACGC GCTCGTCCAG CGGGACGCTC
CCCGTGACGA TGACGAACGC CGAAGAGGAT CTCCGGATCG AAGAGCGGGT GTACTCGTTC
GCGCTCCCCG TCGGCGCCAC CGCCAACATG GACGGCGCCG CGATTCGACA GGCGATCACC
GTGATGTTCG CGGCCAACGC CGTGGGACAG CCGCTCGCGC TCACCGAGCA GTTCCTCGTG
TTGGTCGTCG CCGTGCTGAT CAGCATCGGG ACCGCCGGCG TCCCGGGCGC CGGACTCGTC
ATGTTGACCG TCGTATTGAG TCAGGTCGGC CTCCCGCTGG CGGTCGTCGG CTTCGTCGCC
GGCGTCGACC CCATCCTCGG GCGCATCGCG ACGATGAACA ACGTCACCGG CGACCTGGCG
GTCGCGACCG TGGTCGGCAA GTGGAACGAC GCCGTCGACT TCGGCGACGG CGTGTGGGCC
AGATAG
 
Protein sequence
MASFIGSLWR RYRSVPLIYR IAVAFLLGSL AGAVFGERMT VVKPFGDLFL RLLNMLAVPI 
IVFTLLTGIR QLSPAKLGRI GGATVGLYAV TTTFAGLIGL AVANLLRPGR GVEFTGGEAQ
SQAPPSLTEV VLGIVPNNPV AAMAEGNLLA TVFFVIVFGI ALTYVRAQKP ELAGRVDGVF
GAFKIGAEAM FVVVRGVLEY GVIGVFALMA VGIGTEGVGV FSSLGALVLA VGVAVVVHIA
FTYLFVLMRV VAGVSPVAFL RGAKDAMLTA FATRSSSGTL PVTMTNAEED LRIEERVYSF
ALPVGATANM DGAAIRQAIT VMFAANAVGQ PLALTEQFLV LVVAVLISIG TAGVPGAGLV
MLTVVLSQVG LPLAVVGFVA GVDPILGRIA TMNNVTGDLA VATVVGKWND AVDFGDGVWA
R