Gene Hlac_1191 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_1191 
Symbol 
ID7399458 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp1196302 
End bp1197354 
Gene Length1053 bp 
Protein Length350 aa 
Translation table11 
GC content69% 
IMG OID643708256 
Productperiplasmic solute binding protein 
Protein accessionYP_002565855 
Protein GI222479618 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0803] ABC-type metal ion transport system, periplasmic component/surface adhesin 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000000411642 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00000625153 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCACACGG ACGCGAACGG CTTGTCACGG GTCTCACGTC GCCGGTTCGC TGCGCTCGGC 
GCCGGCGCTC TCGCGGGAGG GCTCGCGGGT TGCACCGGGA ACGCGACCAA CGCGGGCTCG
ACCGGAGATG GGGACGGCAA CGGCGACGGC TACACCGTCG TCGCCTCGTT TTTCACGTTC
TACGACTTCG CCGACTCGCT CGCCGAGGGA ACGGACGTTA CGGTCGAGAA CTTGGTCCCG
ACCGGGCTCC ACGGGCACGG CTGGGAGCCG GACCCCTCGA TCCAGCGCCG AATCACCGAC
GCCGACGCCC TCGTCCACGT CGGCCCCGAT TTCCAGCCGT GGGTCGACCG CGCGATCGAC
GCGCTCGCGG CCGAGTCGAC CGAGACGGCG CTTATCAACG CCCGAGCGGG CGTCGACCTC
ATCGATCTCG CCGACTCGCT GACGGAAGAC GAGGCGGTCG AGGGGGCGAA GGACCCGCAC
TTCTGGCTCG ACCCGCAGCG CGCGAAGATT GCGGTCGAGA ACATCGCCGA CGGGCTGGCC
GCCGTCGATC CCGATCACGA GGCGACGATC CGGGAGAACG CGACCGCGCT CAAGGCCGAA
CTCGACGCCC TCGACGACGA GTGGCAGGCG GTCTTCGACG CGGCCGAGCG CGACGTGGCG
TTCCTCGCCG CACACAACGC CTTCGCGTAC GTCTCTCACC GATACGACGC GACGATCGAG
CCGCTCGTGG TGAACCTCGC GGCCAGCAAC GACGTCCGAC CGGCCGACAT GCAGCGAGCG
CAGGAGACGA TCGCCGACCA CGGCATCGAA CACATCGGCG CCGCCGTCTT CGAGCCGATT
CGGCCGGCAC AACAGCTGCT GGCGCAGACC GATGTCGAGG CGTACTACCC CGTGACGCCA
TACGCGGGCA CCGCCGAGTC ATGGGTCGAG CGCGGGTGGG GGTACTTCGA GATCGCCCGC
GAGGTGAACC TCCCGACGTT CCGGATCCTT CTCGGCGTCG ACGACCCCGA GGACGTAACG
TTCGCCGACT ACGGTCGGAA CTTCCAGCCA TGA
 
Protein sequence
MHTDANGLSR VSRRRFAALG AGALAGGLAG CTGNATNAGS TGDGDGNGDG YTVVASFFTF 
YDFADSLAEG TDVTVENLVP TGLHGHGWEP DPSIQRRITD ADALVHVGPD FQPWVDRAID
ALAAESTETA LINARAGVDL IDLADSLTED EAVEGAKDPH FWLDPQRAKI AVENIADGLA
AVDPDHEATI RENATALKAE LDALDDEWQA VFDAAERDVA FLAAHNAFAY VSHRYDATIE
PLVVNLAASN DVRPADMQRA QETIADHGIE HIGAAVFEPI RPAQQLLAQT DVEAYYPVTP
YAGTAESWVE RGWGYFEIAR EVNLPTFRIL LGVDDPEDVT FADYGRNFQP