Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hhal_0197 |
Symbol | |
ID | 4710961 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorhodospira halophila SL1 |
Kingdom | Bacteria |
Replicon accession | NC_008789 |
Strand | + |
Start bp | 229587 |
End bp | 230675 |
Gene Length | 1089 bp |
Protein Length | 362 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 639854655 |
Product | bile acid:sodium symporter |
Protein accession | YP_001001793 |
Protein GI | 121997006 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG0798] Arsenite efflux pump ACR3 and related permeases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.0279028 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTGGTCCA TCCTCTACGC CCTTCAGAAG CGACTGACCT GGGCCATACC GGCCATGCTC GCGCTGGGTT TCCTCTACGG CGCTCTCCTG CCGGAAGAGC CGCTGCGCTG GCTGATCCTG CCGCTGACGT TCCTGATGGT CTACCCGATG ATGGTCACCC TGCGCGTATC CCACCTGACC GGGTGGGATG ACCTGCGCGC GCAGCTTCTG ACCCAGGGCG TCAACTTTGC AGTCATCCCC TTCATCGCTT TCGGTCTGGG GCTGCTGTTC TTCGCGGATC AGCCCTATAT GGCGCTCGGG CTCCTGCTGG CCGGCCTGGT GCCGACCAGC GGCATGACCA TCTCCTGGAC GGGCTTCGCG GGCGGCAACC TCGCCGCAGC GGTGAAGATG ACGGTCCTCG GCCTGCTGAT CGGGGCGCTG GCGACCCCCT TCTATGTCGA GGCCCTTCTG GGCGCCCAGA TCGACGTGGA CTTCTGGGCC ATCGCCCAAC AGGTGCTCGT GATCGTTGTC CTGCCCCTTG TCCTCGGCCT ACTCACCCAG CGCGCACTGG TGCGCCGCTT CGGTCAGGAA GACTTCCGCA ATCGCTGGGC GTTTCGCTTC CCGGCACTCT CCACACTGGG CGTGCTGGGC ATCGTCTTCA TCGCCATGGC CCTCAACGCG GAGACGATCC TCGCGGCGCC CGAGCAATTG CTCTACATCC TGATTCCCGT CGCCCTGCTC TACGCCATCA ACTTCCCGCT CAGCACGGCG CTGGGCAAGG CACTCCTGCC CCGCGGTGAC GCCATCGCCC TGGTCTACGG CACGGTGATG CGCAACCTCT CCATTGCCCT GGCGATCGCC ATGAACGCAT TCGGTGCCGA GGGCGCCAAC GCCGCTCTGG TGGTGGCCAT CGCCTTCGTC ATCCAGGTCC AGGCCGCCGC CTGGTACGTG CGGGTCACCC GGCGTGTCTT TGGCCCGCCC GACGACGAGG TGCGCGAGGC CCGTCGGTCG GATACGGCCG AGTCTGCCGA CAACGGCGGC GCTCAGGGGA GCGTCGGGTC GGAAACCGGA GCAGAGAGCG AGTCGGGCTC CCGGCGGCCC TCGGGCTGA
|
Protein sequence | MWSILYALQK RLTWAIPAML ALGFLYGALL PEEPLRWLIL PLTFLMVYPM MVTLRVSHLT GWDDLRAQLL TQGVNFAVIP FIAFGLGLLF FADQPYMALG LLLAGLVPTS GMTISWTGFA GGNLAAAVKM TVLGLLIGAL ATPFYVEALL GAQIDVDFWA IAQQVLVIVV LPLVLGLLTQ RALVRRFGQE DFRNRWAFRF PALSTLGVLG IVFIAMALNA ETILAAPEQL LYILIPVALL YAINFPLSTA LGKALLPRGD AIALVYGTVM RNLSIALAIA MNAFGAEGAN AALVVAIAFV IQVQAAAWYV RVTRRVFGPP DDEVREARRS DTAESADNGG AQGSVGSETG AESESGSRRP SG
|
| |