Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Htur_4024 |
Symbol | |
ID | 8744652 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haloterrigena turkmenica DSM 5511 |
Kingdom | Archaea |
Replicon accession | NC_013744 |
Strand | - |
Start bp | 280909 |
End bp | 281901 |
Gene Length | 993 bp |
Protein Length | 330 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 646514593 |
Product | Bile acid:sodium symporter |
Protein accession | YP_003405540 |
Protein GI | 284167262 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG0798] Arsenite efflux pump ACR3 and related permeases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATACGAC TCTCGAAACA GTGGATTCAA CACAACCAAG TCGGATTATA CGCCGTCGCC GTCCTCCTTG CAATCGGGGT CGGCCTCGGG CAACCGAGTG CAGGCTCACT CTTGGAACTG CTTATCAATC CCATCCTGGC GGTGTTGTTG TACGTAACCT TTCTGGAGAT ACCGTTCGTC CGGATCAGAC GAGCGTTCCG GAATGGGCGG TTCATGATAG CTGCTTTCGG AATGAACTTC GTGGTCGTCC CAGTGGTTGT ATTCGGTCTC ACGCGGTTCC TGCCGCAGGA GCCGGTGCTG CTCGTTGGGG TGTTCATGGT ATTGTTGACG CCGTGTATCG ACTACGTCAT CACGTTCACG GATCTGGCAG GTGGTGACGC CGAGCAGATC ACTGCCGCGA CGCCGGCGCT GATGCTCGTG CAATTGCTGT TGCTCCCCGT GTACCTCTGG CTGTTCATGG GCCAGCAGGT GGCTGAGTTC ATCGAGGCTG GACCGTTCAT CGAGGCGTTC GTCGTGATCA TTGCGCTGCC GTTAGCGCTC GCCTGGGCGA CCGAACTCTG GGCAGATCGG TCGAAACGTG TCGAAGAGTG CCAGGACGTA ATGAGATGGT TGCCGGTGCC GATGATGGGT GTGACGCTGT TCGTCGTCAT CGCCTCCCAA CTGCCACGTG TCCAGAATTC GATCGGTCAA ATCGCGGCTG TCGTTCCGGT GTACGTAGTG TTCCTCGTCA TCATGCCCCT GCTCAGTCGA CTTGCTGCCG GACTTCTCGG GATGGATGTC GGTGAGAGTC GTGCTCTCGT GTTTACATCC GTGACGCGAA ACTCACTAGT TATTCTGCCG CTGGCGCTGG CGTTGCCGTC AGGGTATGCG CTCGCGCCAG CGGTCGTCGT GACGCAGACG CTCATCGAAC TGACCGGGAT GGTCGTCCTG ACGCGAGTTG TTCCGGGATG GCTCCTGCCG AACGCACCAT CCCAGACTCC GTCAGGCACA TAA
|
Protein sequence | MIRLSKQWIQ HNQVGLYAVA VLLAIGVGLG QPSAGSLLEL LINPILAVLL YVTFLEIPFV RIRRAFRNGR FMIAAFGMNF VVVPVVVFGL TRFLPQEPVL LVGVFMVLLT PCIDYVITFT DLAGGDAEQI TAATPALMLV QLLLLPVYLW LFMGQQVAEF IEAGPFIEAF VVIIALPLAL AWATELWADR SKRVEECQDV MRWLPVPMMG VTLFVVIASQ LPRVQNSIGQ IAAVVPVYVV FLVIMPLLSR LAAGLLGMDV GESRALVFTS VTRNSLVILP LALALPSGYA LAPAVVVTQT LIELTGMVVL TRVVPGWLLP NAPSQTPSGT
|
| |