Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hlac_0216 |
Symbol | |
ID | 7402145 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorubrum lacusprofundi ATCC 49239 |
Kingdom | Archaea |
Replicon accession | NC_012029 |
Strand | + |
Start bp | 232217 |
End bp | 233482 |
Gene Length | 1266 bp |
Protein Length | 421 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 643707279 |
Product | sodium:dicarboxylate symporter |
Protein accession | YP_002564891 |
Protein GI | 222478654 |
COG category | [C] Energy production and conversion |
COG ID | [COG1301] Na+/H+-dicarboxylate symporters |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.013646 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCCAGTT TCATCGGATC GCTGTGGCGT CGGTATCGAT CGGTACCGCT CATCTACCGC ATCGCGGTCG CGTTCCTCCT CGGATCGCTC GCTGGCGCGG TCTTCGGTGA GCGGATGACC GTCGTCAAAC CGTTCGGTGA CCTCTTTTTG CGCCTGCTCA ACATGCTCGC CGTCCCGATC ATCGTCTTCA CGCTGCTCAC CGGGATCAGA CAGCTCTCGC CGGCGAAGCT CGGGCGCATC GGCGGGGCGA CGGTCGGGCT CTACGCCGTG ACGACGACGT TCGCCGGGCT GATCGGGCTC GCGGTCGCGA ACCTGCTGCG CCCGGGTCGC GGCGTGGAGT TCACGGGCGG TGAGGCGCAG TCGCAGGCGC CGCCGTCGCT GACCGAGGTC GTCCTTGGAA TCGTCCCGAA CAACCCGGTG GCCGCGATGG CCGAGGGGAA CCTGCTCGCG ACGGTCTTTT TCGTGATCGT GTTCGGTATC GCGCTCACCT ACGTCCGGGC GCAGAAACCG GAACTCGCGG GCCGCGTCGA CGGCGTGTTC GGGGCGTTCA AGATCGGAGC CGAGGCGATG TTCGTGGTCG TCCGCGGCGT GTTGGAGTAC GGGGTCATCG GCGTGTTCGC CCTCATGGCG GTCGGGATCG GCACCGAGGG CGTCGGCGTG TTCTCCTCGC TCGGCGCGCT CGTGCTGGCG GTCGGCGTCG CGGTCGTCGT CCACATCGCG TTCACGTACC TGTTCGTACT CATGCGCGTG GTCGCCGGCG TCTCCCCGGT CGCGTTCCTT AGGGGCGCAA AAGACGCGAT GCTCACCGCC TTCGCGACGC GCTCGTCCAG CGGGACGCTC CCCGTGACGA TGACGAACGC CGAAGAGGAT CTCCGGATCG AAGAGCGGGT GTACTCGTTC GCGCTCCCCG TCGGCGCCAC CGCCAACATG GACGGCGCCG CGATTCGACA GGCGATCACC GTGATGTTCG CGGCCAACGC CGTGGGACAG CCGCTCGCGC TCACCGAGCA GTTCCTCGTG TTGGTCGTCG CCGTGCTGAT CAGCATCGGG ACCGCCGGCG TCCCGGGCGC CGGACTCGTC ATGTTGACCG TCGTATTGAG TCAGGTCGGC CTCCCGCTGG CGGTCGTCGG CTTCGTCGCC GGCGTCGACC CCATCCTCGG GCGCATCGCG ACGATGAACA ACGTCACCGG CGACCTGGCG GTCGCGACCG TGGTCGGCAA GTGGAACGAC GCCGTCGACT TCGGCGACGG CGTGTGGGCC AGATAG
|
Protein sequence | MASFIGSLWR RYRSVPLIYR IAVAFLLGSL AGAVFGERMT VVKPFGDLFL RLLNMLAVPI IVFTLLTGIR QLSPAKLGRI GGATVGLYAV TTTFAGLIGL AVANLLRPGR GVEFTGGEAQ SQAPPSLTEV VLGIVPNNPV AAMAEGNLLA TVFFVIVFGI ALTYVRAQKP ELAGRVDGVF GAFKIGAEAM FVVVRGVLEY GVIGVFALMA VGIGTEGVGV FSSLGALVLA VGVAVVVHIA FTYLFVLMRV VAGVSPVAFL RGAKDAMLTA FATRSSSGTL PVTMTNAEED LRIEERVYSF ALPVGATANM DGAAIRQAIT VMFAANAVGQ PLALTEQFLV LVVAVLISIG TAGVPGAGLV MLTVVLSQVG LPLAVVGFVA GVDPILGRIA TMNNVTGDLA VATVVGKWND AVDFGDGVWA R
|
| |