Gene Htur_4024 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHtur_4024 
Symbol 
ID8744652 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaloterrigena turkmenica DSM 5511 
KingdomArchaea 
Replicon accessionNC_013744 
Strand
Start bp280909 
End bp281901 
Gene Length993 bp 
Protein Length330 aa 
Translation table11 
GC content59% 
IMG OID646514593 
ProductBile acid:sodium symporter 
Protein accessionYP_003405540 
Protein GI284167262 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0798] Arsenite efflux pump ACR3 and related permeases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATACGAC TCTCGAAACA GTGGATTCAA CACAACCAAG TCGGATTATA CGCCGTCGCC 
GTCCTCCTTG CAATCGGGGT CGGCCTCGGG CAACCGAGTG CAGGCTCACT CTTGGAACTG
CTTATCAATC CCATCCTGGC GGTGTTGTTG TACGTAACCT TTCTGGAGAT ACCGTTCGTC
CGGATCAGAC GAGCGTTCCG GAATGGGCGG TTCATGATAG CTGCTTTCGG AATGAACTTC
GTGGTCGTCC CAGTGGTTGT ATTCGGTCTC ACGCGGTTCC TGCCGCAGGA GCCGGTGCTG
CTCGTTGGGG TGTTCATGGT ATTGTTGACG CCGTGTATCG ACTACGTCAT CACGTTCACG
GATCTGGCAG GTGGTGACGC CGAGCAGATC ACTGCCGCGA CGCCGGCGCT GATGCTCGTG
CAATTGCTGT TGCTCCCCGT GTACCTCTGG CTGTTCATGG GCCAGCAGGT GGCTGAGTTC
ATCGAGGCTG GACCGTTCAT CGAGGCGTTC GTCGTGATCA TTGCGCTGCC GTTAGCGCTC
GCCTGGGCGA CCGAACTCTG GGCAGATCGG TCGAAACGTG TCGAAGAGTG CCAGGACGTA
ATGAGATGGT TGCCGGTGCC GATGATGGGT GTGACGCTGT TCGTCGTCAT CGCCTCCCAA
CTGCCACGTG TCCAGAATTC GATCGGTCAA ATCGCGGCTG TCGTTCCGGT GTACGTAGTG
TTCCTCGTCA TCATGCCCCT GCTCAGTCGA CTTGCTGCCG GACTTCTCGG GATGGATGTC
GGTGAGAGTC GTGCTCTCGT GTTTACATCC GTGACGCGAA ACTCACTAGT TATTCTGCCG
CTGGCGCTGG CGTTGCCGTC AGGGTATGCG CTCGCGCCAG CGGTCGTCGT GACGCAGACG
CTCATCGAAC TGACCGGGAT GGTCGTCCTG ACGCGAGTTG TTCCGGGATG GCTCCTGCCG
AACGCACCAT CCCAGACTCC GTCAGGCACA TAA
 
Protein sequence
MIRLSKQWIQ HNQVGLYAVA VLLAIGVGLG QPSAGSLLEL LINPILAVLL YVTFLEIPFV 
RIRRAFRNGR FMIAAFGMNF VVVPVVVFGL TRFLPQEPVL LVGVFMVLLT PCIDYVITFT
DLAGGDAEQI TAATPALMLV QLLLLPVYLW LFMGQQVAEF IEAGPFIEAF VVIIALPLAL
AWATELWADR SKRVEECQDV MRWLPVPMMG VTLFVVIASQ LPRVQNSIGQ IAAVVPVYVV
FLVIMPLLSR LAAGLLGMDV GESRALVFTS VTRNSLVILP LALALPSGYA LAPAVVVTQT
LIELTGMVVL TRVVPGWLLP NAPSQTPSGT