Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hlac_0071 |
Symbol | |
ID | 7401426 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorubrum lacusprofundi ATCC 49239 |
Kingdom | Archaea |
Replicon accession | NC_012029 |
Strand | + |
Start bp | 76604 |
End bp | 77593 |
Gene Length | 990 bp |
Protein Length | 329 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 643707132 |
Product | Bile acid:sodium symporter |
Protein accession | YP_002564747 |
Protein GI | 222478510 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG0798] Arsenite efflux pump ACR3 and related permeases |
TIGRFAM ID | [TIGR00832] arsenical-resistance protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 26 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGACCTCG TCGAGAAGTA CCAGTCGGCG CTCGTTCTTG CCGCCATCGC TGTCGGTCTC GCGGTTGGGC AGATCGACGG CGTCGCGTCC GTCGCGGACC GACTCATTCT CCCGTTCCTG ATGGTCATGC TGTTCGCCGC ATTCGTCGGT ATCCCGCTCT CCCACCTCCG GGAGGCGTTC GGGAACCGTC GCGTGGTGGG ATCGAGCCTG CTGGTTAACT TCGTCTGGAG TCCGCTGCTG GCGGTCGGAC TCGGGGCAGT CTTCCTCCGC GACCAGCCCG CGCTCTGGGT CGGACTCATC ATGCTTCTGG TGACGCCGTG TACGGACTGG TATCTCGTCT TCACCGATAT CGCCGACGGC GACGTGCCGC TGGCGACATC CGTGCTGCCG TACAACCTCG TCCTCCAACT GGTGTTGCTA CCGGTGTATC TCTACGCGTT CGCGGGGACG CTCGTCGACC TCCCGCTCGC GCTCTTAGTC GAGAGCGTCT TGCTCGTGCT GGTCGTCCCG CTGATTTTCG GCGGCATCGC CCGGTGGGGC CTCACCCAGA CGAAGGGAGA GACGTGGTTC AGAGAGCGAT TCCTGCCGCG ACTGAGCCCG ATCCAGATCG TCTTCCTCGC GCTGGCCATC GGAGCGATGT TCGCGTCGCA AGGCGAGGTG GTCGTCGAGA ACCCCCGCCT GTTGGCGCTG CTTGCGGTTC CCGTCGTGCT GTTCTACGCG ATCAACCTCG CGGTCGGGTT CGCCGCCGGC CGCCTGCTTT CCTTCTCGTA CGGCGAGATG GTCTGTTTCA ACAACACCAT CCTCTCGCGG AACTCCCCCA CCGCGCTCGC GATCGCCGTC GTCGCGTTCC CCAACGAGCC GCTGATCCCG CTGGCGCTCG TGATCGGGCC ACTGCTCGAA CTCCCGCTTT TAGGCGTTAT CGCGCAGGTC CATCTCGCGG TCAGGGAGCG CTGGACGACC CGCTTCGACG TCTCGTCGAA CGACCGATAG
|
Protein sequence | MDLVEKYQSA LVLAAIAVGL AVGQIDGVAS VADRLILPFL MVMLFAAFVG IPLSHLREAF GNRRVVGSSL LVNFVWSPLL AVGLGAVFLR DQPALWVGLI MLLVTPCTDW YLVFTDIADG DVPLATSVLP YNLVLQLVLL PVYLYAFAGT LVDLPLALLV ESVLLVLVVP LIFGGIARWG LTQTKGETWF RERFLPRLSP IQIVFLALAI GAMFASQGEV VVENPRLLAL LAVPVVLFYA INLAVGFAAG RLLSFSYGEM VCFNNTILSR NSPTALAIAV VAFPNEPLIP LALVIGPLLE LPLLGVIAQV HLAVRERWTT RFDVSSNDR
|
| |