Gene Hhal_0197 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHhal_0197 
Symbol 
ID4710961 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhodospira halophila SL1 
KingdomBacteria 
Replicon accessionNC_008789 
Strand
Start bp229587 
End bp230675 
Gene Length1089 bp 
Protein Length362 aa 
Translation table11 
GC content67% 
IMG OID639854655 
Productbile acid:sodium symporter 
Protein accessionYP_001001793 
Protein GI121997006 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0798] Arsenite efflux pump ACR3 and related permeases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.0279028 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTGGTCCA TCCTCTACGC CCTTCAGAAG CGACTGACCT GGGCCATACC GGCCATGCTC 
GCGCTGGGTT TCCTCTACGG CGCTCTCCTG CCGGAAGAGC CGCTGCGCTG GCTGATCCTG
CCGCTGACGT TCCTGATGGT CTACCCGATG ATGGTCACCC TGCGCGTATC CCACCTGACC
GGGTGGGATG ACCTGCGCGC GCAGCTTCTG ACCCAGGGCG TCAACTTTGC AGTCATCCCC
TTCATCGCTT TCGGTCTGGG GCTGCTGTTC TTCGCGGATC AGCCCTATAT GGCGCTCGGG
CTCCTGCTGG CCGGCCTGGT GCCGACCAGC GGCATGACCA TCTCCTGGAC GGGCTTCGCG
GGCGGCAACC TCGCCGCAGC GGTGAAGATG ACGGTCCTCG GCCTGCTGAT CGGGGCGCTG
GCGACCCCCT TCTATGTCGA GGCCCTTCTG GGCGCCCAGA TCGACGTGGA CTTCTGGGCC
ATCGCCCAAC AGGTGCTCGT GATCGTTGTC CTGCCCCTTG TCCTCGGCCT ACTCACCCAG
CGCGCACTGG TGCGCCGCTT CGGTCAGGAA GACTTCCGCA ATCGCTGGGC GTTTCGCTTC
CCGGCACTCT CCACACTGGG CGTGCTGGGC ATCGTCTTCA TCGCCATGGC CCTCAACGCG
GAGACGATCC TCGCGGCGCC CGAGCAATTG CTCTACATCC TGATTCCCGT CGCCCTGCTC
TACGCCATCA ACTTCCCGCT CAGCACGGCG CTGGGCAAGG CACTCCTGCC CCGCGGTGAC
GCCATCGCCC TGGTCTACGG CACGGTGATG CGCAACCTCT CCATTGCCCT GGCGATCGCC
ATGAACGCAT TCGGTGCCGA GGGCGCCAAC GCCGCTCTGG TGGTGGCCAT CGCCTTCGTC
ATCCAGGTCC AGGCCGCCGC CTGGTACGTG CGGGTCACCC GGCGTGTCTT TGGCCCGCCC
GACGACGAGG TGCGCGAGGC CCGTCGGTCG GATACGGCCG AGTCTGCCGA CAACGGCGGC
GCTCAGGGGA GCGTCGGGTC GGAAACCGGA GCAGAGAGCG AGTCGGGCTC CCGGCGGCCC
TCGGGCTGA
 
Protein sequence
MWSILYALQK RLTWAIPAML ALGFLYGALL PEEPLRWLIL PLTFLMVYPM MVTLRVSHLT 
GWDDLRAQLL TQGVNFAVIP FIAFGLGLLF FADQPYMALG LLLAGLVPTS GMTISWTGFA
GGNLAAAVKM TVLGLLIGAL ATPFYVEALL GAQIDVDFWA IAQQVLVIVV LPLVLGLLTQ
RALVRRFGQE DFRNRWAFRF PALSTLGVLG IVFIAMALNA ETILAAPEQL LYILIPVALL
YAINFPLSTA LGKALLPRGD AIALVYGTVM RNLSIALAIA MNAFGAEGAN AALVVAIAFV
IQVQAAAWYV RVTRRVFGPP DDEVREARRS DTAESADNGG AQGSVGSETG AESESGSRRP
SG